Finally getting around to adding some notes…
Seems somewhat testable! Randomly assign users (or queries, within subjects) to search-chunk-generate or just generate and look for things like time, computation, and accuracy trade-offs. But these also depend on user skills/practices and prompting/querying might shift. And these depend on and can be compared to changes in the query-rewriting that the system does.
HT:
a tweet from me…
Quick. Pretty.
(Interesting re the big recent @perplexity_ai splash; now available as a default on @browsercompany’s Arc.)
I want more feedback options. Looking forward to Share being enabled on these. Curious what content creators think. Non-‘Browse for Me’ search is Google?!
Hugging Face is thinking of adding “RAG (and web search)” to their new Hugging Chat Assistants: huggingface.co/chat/assistants
a tweet from me…
“Add RAG (and web search) to Assistant”
Looking forward to following this.
@huggingface could provide users choice over a range of web search sources, tools to evaluate both fit-for-purpose & effective performance, and open analytics for researchers, devs, & content creators.
Currently they support a web search option in their HuggingChat: huggingface.co/chat/
via a complaint from Neal Parikh
Query: [NYC government used to have a different structure than used today, possibly in the 1940s, but I can’t remember. Please explain.]
Search intent: New York City Board of Estimate
This includes raw links to AI generated content on LLM and generative web search platforms:
Mention of the 1940s seems to serve as a bit of a distractor for Perplexity and 7 other search tools I tried. ChatGPT 4 got it, as did Bard. Exa had it in the third result.
Perplexity AI w/ distractor / maybe-useful-hint.
Perplexity AI w/o distractor / maybe-useful-hint.
See the thread of replies for a WIDE range of responses…
Multiple attempts may suggest a different pattern.
I’m always looking to explore new approaches to the search experience.
Gao, L., Ma, X., Lin, J., & Callan, J. (2022). Precise zero-shot dense retrieval without relevance labels. http://arxiv.org/abs/2212.10496 [gao2022precise]
Zamfirescu-Pereira, J. D., Wong, R. Y., Hartmann, B., & Yang, Q. (2023). Why johnny can’t prompt: How non-ai experts try (and fail) to design llm prompts. Proceedings of the 2023 Chi Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3544548.3581388 [zamfirescu-pereira2023johnny]