Skip to main content
Ctrl+K

CAPPr 0.9.6 documentation

  • About
  • User Guide
  • Research
  • API Reference
  • Local development
  • GitHub
  • About
  • User Guide
  • Research
  • API Reference
  • Local development
  • GitHub

Section Navigation

  • Related work
  • Computational performance
  • Statistical performance
  • Research
  • Related work

Related work#

The idea of aggregating token log-probabilities is well known. You’ll find it as a subroutine in papers from GPT-2[1] to Self-Consistency[2] to InPars[3] to hallucination detection[4] to SimPO[5]. The cappr implementation includes a few computational and statistical optimizations, while maintaining a simple interface.

Here are some papers which focus on the idea of aggregating token log-probabilities.

This paper[6] presents a transposed version of CAPPr. Its method was used in CAPPr’s demo for the Winograd Schema Challenge.

PET with multiple masks[7] also aggregates token log-probabilities to do prompt-completion classification. But these log-probabilities are assumed to come from masked language models like BERT.

References#

[1]

Radford, Alec, et al. “Language models are unsupervised multitask learners.” OpenAI blog 1.8 (2019): 9.

[2]

Wang, Xuezhi, et al. “Self-consistency improves chain of thought reasoning in language models.” arXiv preprint arXiv:2203.11171 (2022).

[3]

Bonifacio, Luiz, et al. “Inpars: Data augmentation for information retrieval using large language models.” arXiv preprint arXiv:2202.05144 (2022).

[4]

Guerreiro, Nuno M., Elena Voita, and André FT Martins. “Looking for a needle in a haystack: A comprehensive study of hallucinations in neural machine translation.” arXiv preprint arXiv:2208.05309 (2022).

[5]

Meng, Yu, Mengzhou Xia, and Danqi Chen. “SimPO: Simple Preference Optimization with a Reference-Free Reward.” arXiv preprint arXiv:2405.14734 (2024).

[6]

Trinh, Trieu H., and Quoc V. Le. “A simple method for commonsense reasoning.” arXiv preprint arXiv:1806.02847 (2018).

[7]

Schick, Timo, and Hinrich Schütze. “It’s not just size that matters: Small language models are also few-shot learners.” arXiv preprint arXiv:2009.07118 (2020).

previous

Research

next

Computational performance

On this page
  • References
Show Source

© Copyright 2023-2024, Kush Dubey.

Created using Sphinx 7.3.7.

Built with the PyData Sphinx Theme 0.16.0.