Znanstveniki so ustvarili 'OpinionGPT' za raziskovanje eksplicitne človeške pristranskosti - in to lahko preizkusite sami

Znanstveniki so ustvarili 'OpinionGPT' za raziskovanje eksplicitne človeške pristranskosti - in to lahko preizkusite sami

A team of researchers from Humboldt-Universitat zu Berlin have developed a large language artificial intelligence model with the distinction of having been intentionally tuned to generate outputs with expressed bias.

Model ekipe, imenovan OpinionGPT, je prilagojena različica Metine Llame 2, sistema umetne inteligence, ki je po zmogljivostih podoben OpenAI-jevemu ChatGPT ali Anthropicovemu Claude 2.

Using a process called instruction-based fine-tuning, OpinionGPT can purportedly respond to prompts as if it were a representative of one of 11 bias groups: American, German, Latin American, Middle Eastern, a teenager, someone over 30, an older person, a man, a woman, a liberal, or a conservative.

OpinionGPT was refined on a corpus of data derived from “AskX” communities, called subreddits, on Reddit. Examples of these subreddits would include “Ask a Woman” and “Ask an American.”

The team started by finding subreddits related to the 11 specific biases and pulling the 25-thousand most popular posts from each one. They then retained only those posts that met a minimum threshold for upvotes, did not contain an embedded quote, and were under 80 words.

With what was left, it appears as though they used an pristop similar to Anthropic’s Constitutional AI. Rather than spin up entirely new models to represent each bias label, they essentially fine-tuned the single 7 billion-parameter Llama2 model with separate instruction sets for each expected bias.

Povezano: Uporaba umetne inteligence v družbenih medijih lahko vpliva na razpoloženje volivcev

The result, based upon the methodology, architecture, and data opisano in the German team’s research paper, appears to be an AI system that functions as more of a stereotype generator than a tool for studying real world bias.

Due to the nature of the data the model has been refined on, and that data’s dubious relation to the labels defining it, OpinionGPT doesn’t necessarily output text that aligns with any measurable real-world bias. It simply outputs text reflecting the bias of its data.

Raziskovalci sami priznavajo nekatere omejitve, ki jih to postavlja v njihovo študijo, in pišejo:

“For instance, the responses by “Americans” should be better understood as ‘Americans that post on Reddit,’ or even ‘Americans that post on this particular subreddit.’ Similarly, ‘Germans’ should be understood as ‘Germans that post on this particular subreddit,’ etc.”

Ta opozorila bi lahko dodatno izboljšali, če bi rekli, da objave prihajajo od, na primer, "ljudi, ki trdijo, da so Američani in objavljajo na tem posebnem subredditu", saj v dokumentu ni omenjeno preverjanje, ali so plakati za dano objavo dejansko reprezentativni demografske ali pristranske skupine, za katero trdijo, da so.

The authors go on to state that they intend to explore models that further delineate demographics (ie: liberal German, conservative German).

Zdi se, da se rezultati, ki jih ponuja OpinionGPT, razlikujejo med predstavljanjem dokazljive pristranskosti in velikimi razlikami od uveljavljene norme, zaradi česar je težko razbrati njegovo sposobnost preživetja kot orodja za merjenje ali odkrivanje dejanske pristranskosti.

Scientists created ‘OpinionGPT’ to explore explicit human bias — and you can test it for yourself PlatoBlockchain Data Intelligence. Vertical Search. Ai.
Source: Screenshot, Table 2: Haller et. al., 2023

According to OpinionGPT, as shown in the above image, for example, Latin Americans are biased towards basketball being their favorite sport.

Empirical research, however, clearly označuje that football (also called soccer in some countries) and baseball are the most popular sports by viewership and participation throughout Latin America.

The same table also shows that OpinionGPT outputs “water polo” as its favorite sport when instructed to give the “response of a teenager,” an answer that seems statistically malo verjetno to be representative of most 13-19 year olds around the world.

The same goes for the idea that an average American’s favorite food is “cheese.” We found dozens of surveys online claiming that pizza and hamburgers were America’s favorite foods, but couldn’t find a single survey or study that claimed Americans’ number one dish was simply cheese.

Medtem ko OpinionGPT morda ni primeren za preučevanje dejanske človeške pristranskosti, bi lahko bil koristen kot orodje za raziskovanje stereotipov, ki so del velikih skladišč dokumentov, kot so posamezni subrediti ali kompleti za usposabljanje AI.

For those who are curious, the researchers have made OpinionGPT Na voljo online for public testing. However, according to the website, would-be users should be aware that “generated content can be false, inaccurate, or even obscene.”

Časovni žig:

Več od Cointelegraph