A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial ScenariosSamuel AckermanElla Rabinovichet al.2024EMNLP 2024
Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA ConversationsArafat SultanJatin Ganhotraet al.2024EMNLP 2024
More Bang for your Context: Virtual Documents for Question Answering over Long DocumentsYosi MassBoaz Carmeliet al.2024EMNLP 2024
DARE to Diversify: DAta Driven and Diverse LLM REd TeamingManish NagireddyBernat Guillen Pegueroleset al.2024KDD 2024
Catalysts Synthesis Procedures Extraction from Synthesis Paragraphs using Large Language ModelsDaniel Pereira CostaMatteo Manicaet al.2024ICCatalysts 2024
Drinking Chai with Your (AI) Programming Partner: Value Tensions in the Tokenization of Future Human-AI Collaborative WorkMichael MullerJustin Weiszet al.2024CHIWORK 2024
Read between the lines - Functionality Extraction From READMEsPrince KumarSrikanth Tamilselvamet al.2024NAACL 2024
Navigating the Modern Evaluation Landscape: Considerations in Benchmarks and Frameworks for Large Language Models (LLMs)Leshem ChoshenAriel Geraet al.2024LREC-COLING 2024
Leveraging Large Language Models to Enhance Domain Expert Inclusion in Data Science WorkflowsJasmine ShihVishal Mohantyet al.2024CHI 2024