Property:title
Jump to navigation
Jump to search
2
Knowledge Graph Embeddings in the Biomedical Domain: Are They Useful? A Look at Link Prediction, Rule Learning, and Downstream Polypharmacy Tasks +
LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain +
L-Eval: Instituting Standardized Evaluation for Long Context Language Models +
LLM Instruction-Example Adaptive Prompting (LEAP) Framework for Clinical Relation Extraction +
Large Language Model Is Not a Good Few-shot Information Extractor, But a Good Reranker for Hard Samples! +
Large Language Models Are Legal But They Are Not: Making the Case for a Powerful LegalLLM +
Large Language Models As Optimizers +
Large Language Models in Law: A Survey +
LawBench: Benchmarking Legal Knowledge of Large Language Models +
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models +
Let's Verify Step by Step +
Levels of AGI: Operationalizing Progress on the Path to AGI +
Linear Classifier: An Often-Forgotten Baseline for Text Classification +
Llama 2: Open Foundation and Fine-Tuned Chat Models +
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models +
LongNet: Scaling Transformers to 1,000,000,000 Tokens +
MAUVE Scores for Generative Models: Theory and Practice +
Machine Learning for Synthetic Data Generation: A Review +
Mamba: Linear-Time Sequence Modeling with Selective State Spaces +
Navigating the Jagged Technological Frontier: Field Experimental Evidence of the Effects of AI on Knowledge Worker Productivity and Quality +
Orca 2: Teaching Small Language Models How to Reason +
Orca 2: Teaching Small Language Models How to Reason +
PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback +
Power and Progress: Our Thousand-Year Struggle over Technology and Prosperity +
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing +
Prompt Engineering a Prompt Engineer +
ReST Meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent +
Recent Advances in Natural Language Processing via Large Pre-trained Language Models: A Survey +
Reflexion: An Autonomous Agent with Dynamic Memory and Self-reflection +
Reinforced Self-Training (ReST) for Language Modeling +
Scalable Extraction of Training Data from (Production) Language Models +
Scaling Deep Learning for Materials Discovery +
Scaling People: Tactics for Management and Company Building +
Scenario Planning for An AGI Future +
Searching for Needles in a Haystack: On the Role of Incidental Bilingualism in PaLM's Translation Capability +
Self-Instruct: Aligning Language Models with Self-Generated Instructions +
Simulation-Driven Automated End-to-End Test and Oracle Inference +
Sparks of Artificial General Intelligence: Early Experiments with GPT-4 +
Survey of Vector Database Management Systems +
Synthetically Generated Text for Supervised Text Analysis +
TaskWeaver: A Code-First Agent Framework +
Text Mining Legal Documents for Clause Extraction +
The Dawn of LMMs: Preliminary Explorations with GPT-4V (ision) +
The Future of Jobs Report 2023 +
The Next Generation of Evidence-based Medicine +
Toolformer: Language Models Can Teach Themselves to Use Tools +
Towards Expert-Level Medical Question Answering with Large Language Models +
Tuning Language Models As Training Data Generators for Augmentation-Enhanced Few-Shot Learning +
Unifier: A Unified Retriever for Large-scale Retrieval +
Unifying Large Language Models and Knowledge Graphs: A Roadmap +
Universal and Transferable Adversarial Attacks on Aligned Language Models +
Universal and Transferable Adversarial Attacks on Aligned Language Models +
Visual Instruction Tuning +
Voyager: An Open-Ended Embodied Agent with Large Language Models +
What to Read in a Contract? Party-Specific Summarization of Legal Obligations, Entitlements, and Prohibitions +
Who Needs External References?âText Summarization Evaluation Using Original Documents +
Zero-shot Conversational Summarization Evaluations with Small Large Language Models +
Zero-shot Information Extraction via Chatting with Chatgpt +
Zero-shot Information Extraction via Chatting with Chatgpt +
Zero-Shot Question Answering over Financial Documents Using Large Language Models +
12 Predictions for the Future of Technology +
2027 AGI, China/US Super-Intelligence Race, \& The Return of History +
6 Common Leadership Styles â and How to Decide Which to Use When +
AI: Unexplainable, Unpredictable, Uncontrollable +
A Right to Warn About Advanced Artificial Intelligence +
A Whole-slide Foundation Model for Digital Pathology from Real-world Data +
Addressing Annotated Data Scarcity in Legal Information Extraction +
Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents +
An Interactive Agent Foundation Model +
Answering Questions in Stages: Prompt Chaining for Contract QA +
Are Emergent Abilities of Large Language Models a Mirage? +
ArtPrompt: ASCII Art-based Jailbreak Attacks Against Aligned LLMs +
Benchmarking Large Language Models for News Summarization +
Better Call GPT, Comparing Large Language Models Against Lawyers +
Better & Faster Large Language Models via Multi-token Prediction +
Blurring the Line Between Human and Machine Minds: Is U.S. Law Ready for Artificial Intelligence? +
By Default, Capital Will Matter More Than Ever After AGI +
Can AI Scaling Continue Through 2030? +
Capabilities of Gemini Models in Medicine +
Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering +
De Novo Design of High-affinity Protein Binders with AlphaProteo +
Diffusion Models Are Real-Time Game Engines +
DiscoLQA: Zero-shot Discourse-based Legal Question Answering on European Legislation +
Does ChatGPT Have a Mind? +
DrEureka: Language Model Guided Sim-To-Real Transfer +
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions +
Efficient Exploration for LLMs +
Evaluating Text Summaries Generated by Large Language Models Using OpenAI's GPT +
Extensible Prompts for Language Models on Zero-shot Language Style Customization +
False Positives in A/B Tests +
Frontier Models Are Capable of In-context Scheming +
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models +
Gemini 1.5: Unlocking Multimodal Understanding Across Millions of Tokens of Context +
Generative Agent Simulations of 1,000 People +
Genie: Generative Interactive Environments +
Grandmaster-Level Chess Without Search +
Grandmaster-Level Chess Without Search +
GraphRAG: Unlocking LLM Discovery on Narrative Private Data +
Hallucination Diversity-Aware Active Learning for Text Summarization +
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools +