It probably uses Retrieval Augmented Generation, which can still hallucinate, but usually does a better job for niche questions and it can even provide a source sometimes depending on how you set it up
Per sci-hub.ru this has been available since March 6th.
"Hear the good news: recent advances in artificial intelligence enabled Sci-Hub to launch a robot that gives scientifically-grounded responses to questions. The robot starts with searching for relevant literature in Sci-Hub database, then turns to selecting and reading most recent studies, and composes the answer based on this information. The answer includes all the references, and each referenced article can be read on Sci-Hub with one click.
Unlike question-answering robots that were based upon the early generation of neural networks, Sci-Hub bot does not hallucinate and is not making up scientific facts and does not cite sources that do not exist. To support its statements, Sci-Bot uses articles from Sci-Hub database. Questions can be asked in any language, and answers can be saved on server and shared.
The alpha version only supports answerig one question, and a more advanced variation that supports conversation mode is coming soon. Right column displays example questions that has been answered by robot - push the question to see the generated answer."
Thanks for doing what I should have done, I actually red that and thought it sounded great.
The claim of “no hallucination” should of course be taken with a grain of salt, as other comments have pointed out.
Sci-hub has been an invaluable resource. I posted a question yesterday at work. There was a queue, and it was time to leave, so I’ll see what the result was when I get over there. I’ve avoided using AI, but this was too tempting. My question was in a area where I have some knowledge, so I’m hoping I’ll be able to spot any problems in the reply.
Speaking of hallucinations, I think the best way to see them is to go to Google Gemini (Reddit is selling them Reddit posts) and start a conversation about Reddit account you have and act as you don’t know anything. It usually starts good but as it progresses you can see how it is making shit up. The more you ask the more insane it gets.
And this is supposedly having all the comments at its disposal.
I also tried Lemmy as I’m sure they are also indexing it. It is telling me that I’m actually admin who created Lemmy.dbzer0.com
I doubt it’s fine-tuned, it’s likely just one of the open-weight LLMs with RAG. I’ve done similar things, and they don’t really work as well as I’d like (the most relevant chunks of text aren’t always ranked the highest/have the least embedding distance, and the models still hallucinate sometimes).
And without hallucinations ??? That sounds freaking awesome
Of course not.
Aye?
You’re them! You’re the person! Holy shit!!
That’s why you hate the internet???
Clearly.
Sorry 'bout that
Yeah they added “Don’t hallucinate” to the prompt.
Seems like the kind of prompt a hallucination would say
Likely not
yeah, no.
It probably uses Retrieval Augmented Generation, which can still hallucinate, but usually does a better job for niche questions and it can even provide a source sometimes depending on how you set it up
deleted by creator
Obviously not, because that’s not possible.
What fun would that be?
I’ll keep the hallucinations for myself, tyvm.
Per sci-hub.ru this has been available since March 6th.
"Hear the good news: recent advances in artificial intelligence enabled Sci-Hub to launch a robot that gives scientifically-grounded responses to questions. The robot starts with searching for relevant literature in Sci-Hub database, then turns to selecting and reading most recent studies, and composes the answer based on this information. The answer includes all the references, and each referenced article can be read on Sci-Hub with one click.
Unlike question-answering robots that were based upon the early generation of neural networks, Sci-Hub bot does not hallucinate and is not making up scientific facts and does not cite sources that do not exist. To support its statements, Sci-Bot uses articles from Sci-Hub database. Questions can be asked in any language, and answers can be saved on server and shared.
The alpha version only supports answerig one question, and a more advanced variation that supports conversation mode is coming soon. Right column displays example questions that has been answered by robot - push the question to see the generated answer."
Thanks for doing what I should have done, I actually red that and thought it sounded great. The claim of “no hallucination” should of course be taken with a grain of salt, as other comments have pointed out.
Sci-hub has been an invaluable resource. I posted a question yesterday at work. There was a queue, and it was time to leave, so I’ll see what the result was when I get over there. I’ve avoided using AI, but this was too tempting. My question was in a area where I have some knowledge, so I’m hoping I’ll be able to spot any problems in the reply.
I’d be interested in having your feedback !!
LOL, of course not.
Speaking of hallucinations, I think the best way to see them is to go to Google Gemini (Reddit is selling them Reddit posts) and start a conversation about Reddit account you have and act as you don’t know anything. It usually starts good but as it progresses you can see how it is making shit up. The more you ask the more insane it gets.
And this is supposedly having all the comments at its disposal.
I also tried Lemmy as I’m sure they are also indexing it. It is telling me that I’m actually admin who created Lemmy.dbzer0.com
From what I understand from the sales brochure, these types of “AI” that are modeled on highly curated data are far less prone to hallucinations.
I doubt it’s fine-tuned, it’s likely just one of the open-weight LLMs with RAG. I’ve done similar things, and they don’t really work as well as I’d like (the most relevant chunks of text aren’t always ranked the highest/have the least embedding distance, and the models still hallucinate sometimes).
Hallucination is Inevitable.