• sp3ctr4l@lemmy.zip
    link
    fedilink
    arrow-up
    32
    arrow-down
    1
    ·
    edit-2
    3 hours ago

    I just tried out Gemini.

    I asked it several questions in the form of ‘are there any things of category x which also are in category y?’ type questions.

    It would often confidently reply ‘No, here’s a summary of things that meet all your conditions to fall into category x, but sadly none also fall into category y’.

    Then I would reply, ‘wait, you don’t know about thing gamma, which does fall into both x and y?’

    To which it would reply ‘Wow, you’re right! It turns out gamma does fall into x and y’ and then give a bit of a description of how/why that is the case.

    After that, I would say ‘… so you… lied to me. ok. well anyway, please further describe thing gamma that you previously said you did not know about, but now say that you do know about.’

    And that is where it gets … fun?

    It always starts with an apology template.

    Then, if its some kind of topic that has almost certainly been manually dissuaded from talking about, it then lies again and says ‘actually, I do not know about thing gamma, even though I just told you I did’.

    If it is not a topic that it has been manually dissuaded from talking about, it does the apology template and then also further summarizes thing gamma.

    I asked it ‘do you write code?’ and it gave a moderately lengthy explanation of how it is comprised of code, but does not write its own code.

    Cool, not really what I asked. Then command ‘write an implementation of bogo sort in python 3.’

    … and then it does that.

    Awesome. Hooray. Billions and billions of dollars for a shitty way to reform web search results into a coversational form, which is very often confidently wrong and misleading.

    • taladar@sh.itjust.works
      link
      fedilink
      arrow-up
      7
      ·
      2 hours ago

      And then more money spent on adding that additional garbage filter to the beginning and the end of the process which certainly won’t improve the results.