Meta's open-source ImageBind AI aims to mimic human perception | Engadget

📆 09/05/2023 18:18:00

Brasil Notícia Notícia

Brasil Últimas Notícias,Brasil Manchetes

📆 09/05/2023 18:18:00
📰 engadget

⏱ Reading Time:
44 sec. here
2 min. at publisher
📊 Quality Score:
News: 21%
Publisher: 63%

Meta's open-source ImageBind AI aims to mimic human perception

pair words with images, allowing you to generate visual scenes based only on a text description, ImageBind casts a broader net. It can link text, images / videos, audio, 3D measurements , temperature data , and motion data — and it does this without having to first train on every possibility. It’s an early stage of a framework that could eventually generate complex environments from an input as simple as a text prompt, image or audio recording .

You could view ImageBind as moving machine learning closer to human learning. For example, if you’re standing in a stimulating environment like a busy city street, your brain absorbs the sights, sounds and other sensory experiences to infer information about passing cars and pedestrians, tall buildings, weather and much more. Humans and other animals evolved to process this data for our genetic advantage: survival and passing on our DNA.

So, while you can use Midjourney to prompt “a basset hound wearing a Gandalf outfit while balancing on a beach ball” and get a relatively realistic photo of this bizarre scene, a multimodal AI tool like ImageBind may eventually create a video of the dog with corresponding sounds, including a detailed suburban living room, the room’s temperature and the precise locations of the dog and anyone else in the scene.

Resumimos esta notícia para que você possa lê-la rapidamente. Se você se interessou pela notícia, pode ler o texto completo aqui. Consulte Mais informação:

Brasil Últimas Notícias, Brasil Manchetes

Similar News:Você também pode ler notícias semelhantes a esta que coletamos de outras fontes de notícias.

Meta open-sources multisensory AI model that combines six types of dataThe ImageBind model combines six types of information: text, audio, visual, movement, thermal, and depth data.
Consulte Mais informação »

Meta is 'accelerating' plans to being more ads to Reels on Facebook and Instagram | EngadgetMeta is bringing more ads to Reels on Facebook and Instagram, and changing up how creators can earn money from their content..
Consulte Mais informação »

Meta is 'accelerating' plans to bring more ads to Reels on Facebook and Instagram | EngadgetMeta is bringing more ads to Reels on Facebook and Instagram, and changing up how creators can earn money from their content..
Consulte Mais informação »

Dota 2's biggest tournament will return to Seattle this year | EngadgetFor the first time since 2017, The International, Dota 2's most prestigious tournament, will take place in Valve's hometown..
Consulte Mais informação »

'Darkest Dungeon II' arrives on Steam next week | EngadgetAfter almost two years of early access and more than five years after it was first announced, Darkest Dungeon II is ready for release..
Consulte Mais informação »

Qualcomm is buying auto-safety chipmaker Autotalks | EngadgetQualcomm has agreed to acquire an Israeli fabless chipmaker called Autotalks, and according to TechCrunch, the deal will cost the company around $350 to $400 million..
Consulte Mais informação »