ViPE is the first language model designed for assisting in text-to-image generation. It translates any arbitrary piece of text into a visualizable prompt. It helps any text-to-image model in figurative or non-lexical language visualizations. Below is a comparison between SDXL with and without ViPE given infinity as a prompt.
Below is another example of using DALLE 2 with and without ViPE for a highly abstract prompt. The image on the left shows the prompts and the generated image. The images on the right, show ViPE's interpretaion of how the initial prompt could be visualized and the genrated images.
How ViPE is Built?
Building ViPE involves three main steps
Data Collection: Scraping all the English lyrics from the Genius platform, preprocessing and noise removal
Synthetic Label Generation: Applying GPT3.5 Turbo to generate visual translation (elaborations) for the lyrics based on human instructions and the context of the songs. Compiling the LyricCanvas dataset comprising of 10M samples.
Training: Obtaining a robust and lightweight model by training GPT2 on the LyricCanvas dataset with causal language modeling objective conditioned on the lyrics
Down Stream Applications
ViPE's robust generalization capabilities offers a wide range of usage including
Our website uses cookies. Some of them are mandatory, while others allow us to improve your user experience on our website. The settings you have made can be edited at any time.
or
Essential
in2code
Name
in2cookiemodal-selection
Use
Required to save the user selection of the cookie settings.
Lifetime
3 months
Name
be_lastLoginProvider
Use
Required for the TYPO3 backend login to determine the time of the last login.
Lifetime
3 months
Name
be_typo_user
Use
This cookie tells the website whether a visitor is logged into the TYPO3 backend and has the rights to manage it.
Lifetime
Browser session
Name
ROUTEID
Use
These cookies are set to always direct the user to the same server.
Lifetime
Browser session
Name
fe_typo_user
Use
Enables frontend login.
Lifetime
Browser session
Videos
in2code
Name
iframeswitch
Use
Used to show all third-party contents.
Lifetime
3 months
YouTube
Name
yt-player-bandaid-host
Use
Is used to display YouTube videos.
Lifetime
Persistent
Name
yt-player-bandwidth
Use
Is used to determine the optimal video quality based on the visitor's device and network settings.
Lifetime
Persistent
Name
yt-remote-connected-devices
Use
Saves the settings of the user's video player using embedded YouTube video.
Lifetime
Persistent
Name
yt-remote-device-id
Use
Saves the settings of the user's video player using embedded YouTube video.
Lifetime
Persistent
Name
yt-player-headers-readable
Use
Collects data about visitors' interaction with the site's video content - This data is used to make the site's video content more relevant to the visitor.
Lifetime
Persistent
Name
yt-player-volume
Use
Is used to save volume preferences for YouTube videos.
Lifetime
Persistent
Name
yt-player-quality
Use
Is used to save the quality settings for YouTube videos.
Lifetime
Persistent
Name
yt-remote-session-name
Use
Saves the settings of the user's video player using embedded YouTube video.
Lifetime
Browser session
Name
yt-remote-session-app
Use
Saves the settings of the user's video player using embedded YouTube video.
Lifetime
Browser session
Name
yt-remote-fast-check-period
Use
Saves the settings of the user's video player using embedded YouTube video.
Lifetime
Browser session
Name
yt-remote-cast-installed
Use
Saves the user settings when retrieving a YouTube video integrated on other web pages
Lifetime
Browser session
Name
yt-remote-cast-available
Use
Saves user settings when retrieving integrated YouTube videos.
Lifetime
Browser session
Google
Name
ANID
Use
Used for targeting purposes to profile the interests of website visitors in order to display relevant and personalized Google advertising.
Lifetime
2 years
Name
SNID
Use
Google Maps - Google uses these cookies to store user preferences and information when you view pages with Google Maps.
Lifetime
1 month
Name
SSID
Use
Used to store information about how you use the site and what advertisements you saw before visiting this site, and to customize advertising on Google resources by remembering your recent searches, your previous interactions with an advertiser's ads or search results, and your visits to an advertiser's site.
Lifetime
6 months
Name
1P_JAR
Use
This cookie is used to support Google's advertising services.
Lifetime
1 month
Name
SAPISID
Use
Used for targeting purposes to profile the interests of website visitors in order to display relevant and personalized Google advertising.
Lifetime
2 years
Name
APISID
Use
Used for targeting purposes to profile the interests of website visitors in order to display relevant and personalized Google advertising.
Lifetime
6 months
Name
HSID
Use
Includes encrypted entries of your Google account and last login time to protect against attacks and data theft from form entries.
Lifetime
2 years
Name
SID
Use
Used for security purposes to store digitally signed and encrypted records of a user's Google Account ID and last login time, enabling Google to authenticate users, prevent fraudulent use of login credentials, and protect user data from unauthorized parties. This may also be used for targeting purposes to display relevant and personalized advertising content.
Lifetime
6 months
Name
SIDCC
Use
This cookie stores information about user settings and information for Google Maps.
Lifetime
3 months
Name
NID
Use
The NID cookie contains a unique ID that Google uses to store your preferences and other information.
Lifetime
6 months
Name
CONSENT
Use
This cookie tracks how you use a website to show you advertisements that may be of interest to you.
Lifetime
18 years
Name
__Secure-3PAPISID
Use
This cookie is used to support Google's advertising services.
Lifetime
2 years
Name
__Secure-3PSID
Use
This cookie is used to support Google's advertising services.
Lifetime
6 months
Name
__Secure-3PSIDCC
Use
This cookie is used to support Google's advertising services.