DiscoX Bench invited a total of over 130 Vertical Domain experts from various fields (including experts with more than 3 years of professional experience and master's/doctoral students from world-class Universities to construct the evaluation dataset.) The texts in the dataset are required to be over 1,500 words, sourced from real-world industry and academic scenarios. They are logically coherent and of high quality, designed to challenge the upper limits of current large models' translation capabilities.
| Primary Category | Secondary Category | Count |
|---|---|---|
| Academic Papers | Social Science Papers | 38 |
| Natural Science Papers | 35 | |
| Humanities Papers | 28 | |
| Applied Science Papers | 20 | |
| Non-academic Tasks | News and Information | 37 |
| Domain-Specific Scenarios | 28 | |
| Literature and Arts | 14 | |
| Total | 200 | |