pull down to refresh
k
ComparisonsPercentage AgreementCohen’s KappaHuman vs Claude 2.1 Ratings79%0.41Human vs Titan Express Ratings78%0.35Human vs Sonnet 3.5 Ratings76%0.44Human vs Llama 3.3 70b Ratings79%0.39Human vs Nova Pro76%0.34
k
charts