Today OpenAI ChatGPT announced it will work together with organizations to produce public and private datasets for training artificial intelligence (AI) models for training data that are more conversational in style:
We’re particularly looking for data that expresses human intention, across any language, topic and format,” the company said in a blog post.
ChatGPT, a chatbot, generates “poems and prose from simple prompts”, is based on large language models that are trained entirely on open-source data available on the Internet.
OpenAI is seeking organizations to help it create an open-source datasets and private datasets for training language models. The open-source dataset would be public for anyone to use in AI model training.