The bot protects user data while being aware of information scraped to prevent breaching copyrights.
Meta’s AI assistant is created using a custom LLM model that blends Llama 2 and Emu for text and image tasks.
“We’ve tried to exclude datasets that have a heavy preponderance of personal information,” Clegg said, adding that the “vast majority” of the data used by Meta for training was publicly available. The companies are weighing how to handle the private or copyrighted materials vacuumed up in that process that their AI systems may reproduce, while facing lawsuits from authors accusing them of infringing copyrights.
The product will be able to generate text, audio and imagery and will have access to real-time information via a partnership with Microsoft’s Bing search engine. Interactions with Meta AI may also be used to improve the features going forward, the spokesperson said.