Congrats to MOL posters for helping train Chat GPT, Bard, etc

PVW

PVW

Apr 19, 2023 at 2:07pm

Per the Washington Post:

Inside the secret list of websites that make AI like ChatGPT sound smart

To look inside this black box, we analyzed Google’s C4 data set, a massive snapshot of the contents of 15 million websites that have been used to instruct some high-profile English-language AIs, called large language models, including Google’s T5 and Facebook’s LLaMA. (OpenAI does not disclose what datasets it uses to train the models backing its popular chatbot, ChatGPT)

There's a search tool to see what sites are this data set. Relevant screenshot attached...

nohero

nohero

Apr 19, 2023 at 4:17pm

That explains a lot.

But we can’t be sure until it calls someone a “poopyhead”.

dave

dave

Apr 19, 2023 at 7:52pm

Where can we spend the tokens?

Main Discussions

Science & Technology Category

In order to add a comment – you must Join this community – Click here to do so.

Start a new discussion

For Sale

ROCKING CHAIR $75
More info
Trump 2025 inauguration hotel room in DC $4,709.89
More info
SELLING ITEMS $50
More info
Furniture on SALE $250
More info
Coffee mugs $1.50
More info

Garage Sales

Garage Sale Sale Date: Nov 23, 2024
More info

View all Garage Sales

Free Items

Couch and Armchair $0
More info

View all Free Items

Real Estate Listings

Renovated apartment in Bloomfield

3 Bd | 2Full Ba
$2,850

View all Real Estate Listings

Advertise here!