1 Comment
User's avatar
User's avatar
Comment removed
Jan 22
Comment removed
Conor Griffin's avatar

Reading some of the case studies cited in the survey, it's a pity all those insights can't somehow feed back into more recurring, systematic 'real-world evals' of the systems' usefulness + risks. So much nuances and little benefits/risks that probably get overlooked in evals that are 1-2 steps removed from real-world use.