| LLM2024 vs Steve | 6–9 | 40.00% |
| LLM2024 vs Eddy | 3–6 | 33.33% |
| LLM2024 vs Jin | 3–5 | 37.50% |
| LLM2024 vs Azucena | 2–6 | 25.00% |
| LLM2024 vs Jun | 1–5 | 16.67% |
| LLM2024 vs Reina | 4–2 | 66.67% |
| LLM2024 vs Lidia | 4–2 | 66.67% |
| LLM2024 vs Clive | 2–4 | 33.33% |
| LLM2024 vs Hwoarang | 2–3 | 40.00% |
| LLM2024 vs Leo | 3–2 | 60.00% |
| LLM2024 vs Victor | 2–3 | 40.00% |
| LLM2024 vs King | 0–4 | 0.00% |
| LLM2024 vs Dragunov | 0–4 | 0.00% |
| LLM2024 vs Leroy | 4–0 | 100.00% |
| LLM2024 vs Miary Zo | 0–4 | 0.00% |
| LLM2024 vs Lili | 0–3 | 0.00% |
| LLM2024 vs Nina | 2–1 | 66.67% |
| LLM2024 vs Lee | 0–2 | 0.00% |
| LLM2024 vs Anna | 0–2 | 0.00% |
| LLM2024 vs Armor King | 0–2 | 0.00% |
| LLM2024 vs Law | 1–0 | 100.00% |
Limitations
This data is often requested to give insight into which characters you have more trouble with than others, but it is not particularly helpful for that. The main issue is that it is heavily skewed by how strong the opponents you play are.
For example, this data suggests my worst matchup is clearly vs Reina, but that's just because most of those games are vs Yagami.
There is a way to account for this being worked on. The central idea is to assign each matchup a rating vs you which adjusts based on the result, much like the regular rating but also based on the rating of each player. With this, it would give a better summary of how well you perform vs each character.
In the meantime, this page is here to present the data as requested.