If AI can now speak Italian, it can certainly replace us...

lseif@sopuli.xyz · 25 days ago

If AI can now speak Italian, it can certainly replace us...

impure9435@kbin.run · 25 days ago

The thing that I find the most funny about this post, is the fact that you call this Italian

Peepee Confiscator / Kaca Konfiskisto@sh.itjust.works · 6 days ago

Typical 'muricans being unable to comprehend anything besides English.

/s i don't mean to be racist

yes i was a r/2we4u user, how’d you know?

lseif@sopuli.xyz · 25 days ago

how am i supposed to know how italians speak. i’ve never seen one

jballs@sh.itjust.works · 24 days ago

From my experience, they speak mostly with their hands

Terrasque@infosec.pub · 23 days ago

🫰🤙🫵👌✊🫳🫸🤲🤌

jballs@sh.itjust.works · 23 days ago

Prego

thesporkeffect@lemmy.world · 24 days ago

They’re not real, but they can hurt you.

pewpew@feddit.it · 24 days ago

Ne sei sicuro?

velox_vulnus@lemmy.ml · 25 days ago

Blud could’ve chosen Runic, Egyptian, Ancient Romanian used by Vlad the Impaler, Mesapotamian or even Harappan Indic. But Italian is it.

Phoenix3875@lemmy.world · 24 days ago

Let me simplify it: proceeds to print the same expression

ChanchoManco@lemm.ee · edit-2 24 days ago

Typical AI behavior

Edit: and then it will gaslight you if you say the answer is the same.

driving_crooner@lemmy.eco.br · 24 days ago

Fucking hate when do that.

You are repeating the same mistake.

I’m sorry for repeating the same mistake, here’s a new solution with corrections *proceed to write the exactly thing already told it was wrong*

stingpie@lemmy.world · 24 days ago

This might be happening because of the ‘elegant’ (incredibly hacky) way openai encodes multiple languages into their models. Instead of using all character sets, they use a modulo operator on each character, to make all Unicode characters represented by a small range of values. On the back end, it somehow detects which language is being spoken, and uses that character set for the response. Seeing as the last line seems to be the same mathematical expression as what you asked, my guess is that your equation just happened to perfectly match some sentence that would make sense in the weird language.

PlexSheep@infosec.pub · 24 days ago

Do you have a source for that? Seems like an internal detail a corpo wouldn’t publish

stingpie@lemmy.world · 24 days ago

Can’t find the exact source–I’m on mobile right now–but the code for the gpt-2 encoder uses a utf-8 to unicode look up table to shrink the vocab size. https://github.com/openai/gpt-2/blob/master/src/encoder.py

crispy_kilt@feddit.de · 22 days ago

Seriously? Python for massive amounts of data? It’s a nice scripting language, but it’s excruciatingly slow

stingpie@lemmy.world · 22 days ago

There are bindings in java and c++, but python is the industry standard for AI. The libraries for machine learning are actually written in c++, but use python language bindings. Python doesn’t tend to slow things down since machine learning is gpu-bound anyway. There are also library specific programming languages which urges the user to make pythonic code that can be compiled into c++.

NeatNit@discuss.tchncs.de · 24 days ago

I suppose it’s conceivable that there’s a bug in converting between different representations of Unicode, but I’m not buying and of this “detected which language is being spoken” nonsense or the use of character sets. It would just use Unicode.

The modulo idea makes absolutely no sense, as LLMs use tokens, not characters, and there’s soooooo many tokens. It would make no sense to make those tokens ambiguous.

stingpie@lemmy.world · 23 days ago

I completely agree that it’s a stupid way of doing things, but it is how openai reduced the vocab size of gpt-2 & gpt-3. As far as I know–I have only read the comments in the source code– the conversion is done as a preprocessing step. Here’s the code to gpt-2: https://github.com/openai/gpt-2/blob/master/src/encoder.py I did apparently make a mistake, as the vocab reduction is done through a lut instead of a simple mod.

abrahambelch@programming.dev · 25 days ago

Which language uses these signs? It truly looks like some kind of alien language

nimpnin@sopuli.xyz · 25 days ago

APL?

chapapa@discuss.tchncs.de · edit-2 25 days ago

Glagolitic script. Oldest known Slavic alphabet according to Wikipedia.

82cb5abccd918e03@lemmygrad.ml · 25 days ago

I found it! its the Glagolitic script used in the 9th century before Cyrillic took over:

ⰀⰁⰂⰃⰄⰅⰆⰇⰈⰉⰊⰋⰌⰍⰎⰏⰐⰑⰒⰓⰔⰕⰖⰗⰘⰙⰚⰛⰜⰝⰞⰟⰠⰡⰢⰣⰤⰥⰦⰧⰨⰩⰪⰫⰬⰭⰮⰰⰱⰲⰳⰴⰵⰶⰷⰸⰹⰺⰻⰼⰽⰾⰿⱀⱁⱂⱃⱄⱅⱆⱇⱈⱉⱊⱋⱌⱍⱎⱏⱐⱑⱒⱓⱔⱕⱖⱗⱘⱙⱚⱛⱜⱝⱞ

I Cast Fist@programming.dev · 23 days ago

Title mentions speaking italian

Not a single hand gesture anywhere

I’ve been duped

Annoyed_🦀 🏅@monyet.cc · 25 days ago

That’s not italian that’s obviously Unown

Redex@lemmy.world · 25 days ago

Damn, wild Glagolitic script found. I didn’t even realise it was in the Unicode standard.

XEAL@lemm.ee · 25 days ago

Ah, I see you’re using FartGPT instead of ChatGPT

lseif@sopuli.xyz · 25 days ago

is that the new model ?

Blyfh@lemmy.world · 25 days ago

French pronunciation intensifies

Lem Jukes@lemm.ee · 25 days ago

Cat, I farted.

r00ty@kbin.life · 25 days ago

Wow, an alien ion drive formula! Try to get warp drive out of it too!

QuazarOmega@lemy.lol · 23 days ago

You may not understand, but we do.
Questo segreto rimarrà custodito gelosamente dalla stirpe italica. ◉‿◉

MazonnaCara89@lemmy.ml · 23 days ago

No brother non possiamo tenere questo segreto fino alla fine

QuazarOmega@lemy.lol · 23 days ago

Non c’è scelta, se l’ultimo italiano dovesse lasciarci, allora anche questa informazione dovrà lasciare l’umanità

Iheartcheese@lemmy.world · 23 days ago

breaks spaghetti near you

Peepee Confiscator / Kaca Konfiskisto@sh.itjust.works · 6 days ago

calls SISMI

QuazarOmega@lemy.lol · 23 days ago

Rememeber, whenever you break one spaghetto you break one heart 💔

robigan@lemmy.world · 23 days ago

How about go die in a hole?

Iheartcheese@lemmy.world · 23 days ago

Vitaly@feddit.uk · 25 days ago

Kind of looks like the writing system of Georgian language but I’m not sure

Allero@lemmy.today · 25 days ago

No, this is Glagolitic script, an alternative to Cyrillic. Mostly used in old Slavic scriptures, was later replaced by Cyrillic and Latin.

Most Slavs themselves don’t know how to read this