Greetings from a world where… I’m enjoying Wong Kar-wai’s foray into C-drama land with Blossoms …As always, the searchable archive of all past issues is here. Please please subscribe here to support ChinAI under a Guardian/Wikipedia-style tipping model (everyone gets the same content but those who can pay support access for all AND compensation for awesome ChinAI contributors).
Thats a cool paper from Princeton on LLMs against CTF challenges. Would you surmise that there already exist AI-agents/LLMs that are capable of achieving or surpassing these researchers’ tasks/benchmarks? Such as more capable or more tailored models than off-the-shelf (OTS) GPT4?
(In the case of CTF Challenges) Would the limitation be the capability of the model itself or is this a product of its generalist/OTS nature?
Hey Professor!
Thats a cool paper from Princeton on LLMs against CTF challenges. Would you surmise that there already exist AI-agents/LLMs that are capable of achieving or surpassing these researchers’ tasks/benchmarks? Such as more capable or more tailored models than off-the-shelf (OTS) GPT4?
(In the case of CTF Challenges) Would the limitation be the capability of the model itself or is this a product of its generalist/OTS nature?