What can and can't language models do? Lessons learned from BIGBench
Por um escritor misterioso
Descrição
So what exactly can and can’t language models do? What's the least impressive thing GPT-4 won't be able to do? What will GPT-4 be incapable of?
BIGBench is kind of a way to figure this out. BigBench, aka “The Beyond the Imitation Game” Benchmark, is an attempt to explore the capabilities of large language models over a wide variety of tasks. All the tasks are enumerated here.
I looked through every BIGBench task and took the ones that compared both GPT3 and PaLM against humans.
* Spreadsheet
Large language models encode clinical knowledge
Google's new 540 billion parameter language model — LessWrong
Language Models Don't Always Say What They Think: Unfaithful
Benchmark of LLMs (Part 1): Glue & SuperGLUE, Adversarial NLI, Big
Generative AI and large language models: background and contexts
2301.00234] A Survey for In-context Learning
Trends in AI — August 2022. 3% of Google's new code is written by
Benchmark of LLMs (Part 1): Glue & SuperGLUE, Adversarial NLI, Big
Large language model - Wikipedia
Language Modelling
Iterative improvements from feedback for language models
AI scientists are studying the “emergent” abilities of large
de
por adulto (o preço varia de acordo com o tamanho do grupo)