Great article on the nuances of Gemini and why many have labeled it (pun intended) as woke
Great article on the nuances of Gemini and why many have labeled it (pun intended) as woke
“Today, the bots are different: they appear to know right from wrong and truth from lies. Simplifying a bit, this is a product of a mechanism known as reinforcement learning from human feedback (RLHF). In RLHF, a largely-formed LLM is fine-tuned by presenting it with a variety of prompts, and then browbeating the model to give the answers that get the highest marks from human reviewers.”
https://lcamtuf.substack.com/p/gemini-how-did-we-end-up-here
