“Bard may display inaccurate or offensive information”
How might Google support users to calibrate trust within the tool itself?
Showing 13 posts tagged "hallucination"
How might Google support users to calibrate trust within the tool itself?
Research summary of FreshLLMs, a paper introducing search engine-augmented prompting to improve LLM factuality and reduce hallucinations on current world knowledge questions.
Sharing an example hallucination test.
Shares response from Bard and brief additional comments re running Will Knight's "hallucination test".
Thinking about hallucination with Klosterman, Leahu, Munk et al., Rettberg, and Powles & Nissenbaum.
How do you share about misinformation without spreading it? How do you link to the outputs of chatbots and generative search engines without deceiving folks?
Claude 2 fails my Claude Shannon hallucination test, producing a summary of a non-existent publication.
Exploring the challenges of differentiating LLM-generated content in search results and proposing possible actions.
Warnings and ClaimReview?
The briefest introduction.
Towards minimizing harm from “hallucinations” and other baloney.
@simonw sharing about finding misleading claims from Claude 2