Hard problems that reduce to document ranking
Language models can effectively perform listwise document ranking, particularly useful in identifying N-day vulnerabilities through patch diffing. The technique transforms complex security problems into document ranking tasks, demonstrated successfully in locating vulnerable functions among patch diffs using GPT-4 mini with minimal cost and time.