InfiniteBench: Extending Long Context Evaluation Beyond 100K Tokens

InfiniteBench: Extending Long Context Evaluation Beyond 100K Tokens

1 | 1 The first benchmark for evaluating the effectiveness of LLMs in handling more than 100k tokens! In the paper, we name it Bench, but I will sometimes use "InfiniteBench" in this blog post for better readability. Finally got some time to write this blog, been so busy lately! I have been in a fairly long duration...

Jan 10, 2024 · 6 min