A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial ScenariosSamuel AckermanElla Rabinovichet al.2024EMNLP 2024
Navigating the Modern Evaluation Landscape: Considerations in Benchmarks and Frameworks for Large Language Models (LLMs)Leshem ChoshenAriel Geraet al.2024LREC-COLING 2024
Deploying automated ticket router across the enterpriseSamuel AckermanLincoln Alexanderet al.2023AI Magazine
Workflow Provenance in the Lifecycle of Scientific Machine LearningRenan Francisco Santos SouzaLeonardo Guerreiro Azevedoet al.2021CCPE