Google’s new Gemini Pro is the best overall at law stuff
| scholarship | 05/21/25 | | Gay Factory | 05/21/25 | | scholarship | 05/21/25 | | PoastPapi | 05/21/25 | | scholarship | 05/21/25 | | ,.,.,,,,,,,..,.,.,.,.,,.,., | 05/21/25 | | scholarship | 05/21/25 | | ,.,.,,,,,,,..,.,.,.,.,,.,., | 05/21/25 | | scholarship | 05/21/25 | | Behar-Bechukotai | 05/21/25 | | Cowghost | 05/21/25 | | .,.,,..,..,.,..:,,:,...,:::,.,.,:,.,.:.,:.,:.::,. | 05/21/25 | | Cowghost | 05/21/25 | | .,.,,..,..,.,..:,,:,...,:::,.,.,:,.,.:.,:.,:.::,. | 05/21/25 | | Cowghost | 05/21/25 | | .,..,..,.,,..,..,,,,,,,,..,...,.,.,., | 05/21/25 | | scholarship | 05/21/25 | | .,..,..,.,,..,..,,,,,,,,..,...,.,.,., | 05/21/25 | | scholarship | 05/21/25 | | .,..,..,.,,..,..,,,,,,,,..,...,.,.,., | 05/21/25 | | .,.,,..,..,.,..:,,:,...,:::,.,.,:,.,.:.,:.,:.::,. | 05/21/25 | | scholarship | 05/21/25 | | ChadGPT-5 | 05/21/25 | | .;:..;:.;.:.;.,,,..,.:,.;....;,;;;..;,..,,.,,...., | 05/21/25 | | scholarship | 05/21/25 | | .;:..;:.;.:.;.,,,..,.:,.;....;,;;;..;,..,,.,,...., | 05/21/25 | | scholarship | 05/21/25 | | NHH | 05/21/25 | | luis with that nose | 05/21/25 | | cum boil | 05/21/25 | | .,.,,..,..,.,..:,,:,...,:::,.,.,:,.,.:.,:.,:.::,. | 05/21/25 | | cum boil | 05/21/25 | | ' | 05/21/25 | | elon musk | 05/21/25 | | The Sub-Saharan Hephaestus | 05/21/25 |
Poast new message in this thread
Date: May 21st, 2025 2:15 PM Author: scholarship
https://ai.google.dev/showcase/harvey
Harvey: Validating Gemini 2.5 Pro Preview’s Advanced Legal Reasoning with BigLaw Bench
The legal industry demands AI solutions that can navigate an immense volume of complex information with precision and nuance. Traditional AI benchmarks often fall short of capturing the real-world demands of legal practice, making it challenging to identify models truly capable of high-value legal work. Harvey, a dynamic startup dedicated to transforming legal workflows through AI, confronted this by developing BigLaw Bench, their comprehensive framework for assessing Large Language Model (LLM) performance on tasks mirroring actual legal work. In their recent rigorous evaluations, Gemini 2.5 Pro Preview emerged as a standout performer, demonstrating exceptional potential to improve efficiency in key legal domains.
Gemini 2.5 Pro Preview Leads on BigLaw Bench
Harvey’s recent evaluations leveraging the BigLaw Bench framework have clearly indicated that Gemini 2.5 Pro Preview demonstrates remarkable proficiency across core legal reasoning tasks and, in particular, tasks requiring reasoning over long-form legal inputs or outputs. As shown in Harvey’s publicly shared results, obtained by testing models including Gemini 2.5 Pro Preview via their respective APIs, Gemini 2.5 Pro Previewachieved the leading score of 85.02% on BigLaw Bench, outperforming other models evaluated in this comprehensive assessment.
This leading capability is crucial for a wide range of high-value legal activities. Key evaluation tasks within BigLaw Bench showcased Gemini 2.5 Pro Preview’s strengths:
Transactional due diligence: Gemini 2.5 Pro Preview showed a strong capacity to extract and summarize critical provisions (e.g., assignment, indemnification, termination clauses) from multiple lengthy service agreements. This suggests a significant potential to streamline the time-intensive process of manual document review.
Transaction structuring: The model adeptly generated comprehensive, well-structured comparative analyses of intricate financial options (e.g., PIPE, underwritten equity offerings, bond offerings). The model showed promise in presenting this information in a clear and accessible manner, even for those without deep financial expertise, and in suggesting potential immediate action items.
Litigation drafting: When assessed on tasks related to litigation, Gemini 2.5 Pro Preview exhibited a notable ability to generate detailed outlines for legal briefs based on substantial volumes of briefing documents. This capability points towards a future where AI can significantly aid in the initial stages of legal argument development and organization.
Document review & analysis: Evaluations involving the review of disparate trial documents (call logs, emails, memoranda) revealed Gemini 2.5 Pro Preview’s strength in creating coherent chronological summaries of events. Furthermore, the model showed potential in identifying critical inconsistencies and ambiguities within the record, a crucial aspect of thorough legal analysis.
Across these evaluations, Gemini 2.5 Pro Preview showcased strong reasoning across inputs consisting of hundreds of pages of materials, a common scenario in legal work. In addition, it was capable of using these materials to generate longer-form and comprehensive outputs, allowing for deeper insights and analyses. These core capabilities highlight the potential for leveraging Gemini 2.5 Pro Preview across complex legal work requiring reasoning over large sets of documents to support diligence, review, and drafting use cases.
A New Standard for Legal AI
"At Harvey, we’re committed to equipping legal professionals with the most advanced tools," states Niko Grupen, Head of Applied AI at Harvey. "Our evaluation of Gemini 2.5 Pro Preview through BigLaw Bench has revealed its remarkable ability to synthesize complex legal information. This insight fuels our vision for future product development, where we aim to leverage these strengths to unlock unprecedented efficiency and empower lawyers to focus on higher-level strategic work."
Unlocking the Future of Legal Work
Harvey’s commitment to rigorous evaluation and their insightful analysis of cutting-edge AI models like Gemini 2.5 Pro Preview are demonstrating the transformative potential of AI in the legal field. Their findings pave the way for future innovations that promise to reshape how legal professionals approach their most demanding tasks.
To explore how Gemini 2.5 Pro Preview’s advanced reasoning and synthesis capabilities can power your own applications, visit the Gemini API documentation or get started in Google AI Studio.
Harvey is a participant in Google’s AI Futures Fund that invests in and collaborates with ambitious startups building what’s next in AI.
(http://www.autoadmit.com/thread.php?thread_id=5728503&forum_id=2#48951245) |
Date: May 21st, 2025 2:36 PM
Author: ,.,.,,,,,,,..,.,.,.,.,,.,.,
the openai model versions are confusing as fuck now. why did 4.1 come after 4.5? why is it seemingly better than 4.5? who is the retard who thought having o4 and 4o model names was a good idea? i really don't know where i should be using o3 vs o4 mini high since they are both reasoning models.
(http://www.autoadmit.com/thread.php?thread_id=5728503&forum_id=2#48951318) |
 |
Date: May 21st, 2025 2:44 PM
Author: ,.,.,,,,,,,..,.,.,.,.,,.,.,
it seems like they enjoy fucking around with people. now they have 4.1 mini and 4.1 nano. what the fuck.
(http://www.autoadmit.com/thread.php?thread_id=5728503&forum_id=2#48951354) |
 |
Date: May 21st, 2025 5:14 PM Author: Cowghost
I pay $200 / month and never heard of 4.1
Is 4.1 api only?
I'm still in the o3/4.5 frame and suddenly feel like I'm being ripped off
(http://www.autoadmit.com/thread.php?thread_id=5728503&forum_id=2#48951779) |
 |
Date: May 21st, 2025 5:17 PM
Author: .,.,,..,..,.,..:,,:,...,:::,.,.,:,.,.:.,:.,:.::,.
it was api only for a while but i think it appeared on chatgpt plus in the last few weeks. no clue what the limits are. hopefully it's better than 4.5, which has extremely low limits.
(http://www.autoadmit.com/thread.php?thread_id=5728503&forum_id=2#48951789) |
 |
Date: May 21st, 2025 5:24 PM Author: Cowghost
I ask gpt every week which model is best for raw complex legal analysis and it's been saying o3 since late April update
4.1 is buried under "More models" with o1 Pro
Is openAI just fucking with me or what?
(http://www.autoadmit.com/thread.php?thread_id=5728503&forum_id=2#48951808) |
 |
Date: May 21st, 2025 5:50 PM
Author: .,.,,..,..,.,..:,,:,...,:::,.,.,:,.,.:.,:.,:.::,.
isn't o3 the best according to that chart?
the decision as to what model to use is incredibly confusing. they should train a model to select what model to use for a particular response.
(http://www.autoadmit.com/thread.php?thread_id=5728503&forum_id=2#48951873) |
Date: May 21st, 2025 5:21 PM
Author: .,..,..,.,,..,..,,,,,,,,..,...,.,.,.,
deep research mode is shit compared to chatgpt for legal stuff.
(http://www.autoadmit.com/thread.php?thread_id=5728503&forum_id=2#48951798) |
 |
Date: May 21st, 2025 6:23 PM
Author: .,..,..,.,,..,..,,,,,,,,..,...,.,.,.,
nah just the one on the $25 a month plan because i got a free trial. it wouldnt take any attachments and it kept telling me it couldnt write a brief and i should get a lawyer. i eventually tricked it into writing one (said it was for educational purposes) but it wasnt very good because it didn let me attach anything.
it also can't go into deep research mid-chat like chatgpt can which is another minus since on chatgpt it can easily use what ever has already been said
(http://www.autoadmit.com/thread.php?thread_id=5728503&forum_id=2#48951923) |
 |
Date: May 21st, 2025 7:32 PM
Author: .,..,..,.,,..,..,,,,,,,,..,...,.,.,.,
i gotta try it out but i just downgraded my chargpt pro to plus since i have to start paying loans again now lol
(http://www.autoadmit.com/thread.php?thread_id=5728503&forum_id=2#48952093) |
 |
Date: May 21st, 2025 7:36 PM
Author: .,.,,..,..,.,..:,,:,...,:::,.,.,:,.,.:.,:.,:.::,.
Pro seems pointless. They don’t have o3 high available on it yet. The Deep Think model with Gemini seems like the best premium model available right now.
(http://www.autoadmit.com/thread.php?thread_id=5728503&forum_id=2#48952099) |
Date: May 21st, 2025 6:01 PM
Author: .;:..;:.;.:.;.,,,..,.:,.;....;,;;;..;,..,,.,,....,
Would this be beneficial for tranny lolyers?
(http://www.autoadmit.com/thread.php?thread_id=5728503&forum_id=2#48951887) |
 |
Date: May 21st, 2025 7:17 PM
Author: .;:..;:.;.:.;.,,,..,.:,.;....;,;;;..;,..,,.,,....,
Can it connect clause 9.2 with 7.6? I don't think so
(http://www.autoadmit.com/thread.php?thread_id=5728503&forum_id=2#48952052) |
 |
Date: May 21st, 2025 7:57 PM
Author: .,.,,..,..,.,..:,,:,...,:::,.,.,:,.,.:.,:.,:.::,.
Estimate what your anxiety level will be when use becomes widespread and employers realize they don’t need as many people
(http://www.autoadmit.com/thread.php?thread_id=5728503&forum_id=2#48952168) |
|
|