Cool to see this work from @Jsjcl293905 , @davidsimchilevi, and @WillWeiSun deriving efficiency bounds and new estimators for model rankings built on Arena's human preference data. There’s a lot of foundational work to be done to continue to improve the statistical foundations of…
中文: 很高兴看到 @Jsjcl293905、@davidsimchilevi 和 @WillWeiSun 的作品,其使用基于 Arena 的人类偏好数据,为模型排名提供了效率限制和新的估算器。需要做大量基础工作,以继续改善......的统计基础