GPQA Diamond
GPQA Diamond — Übersicht
GPQA Diamond ist ein Subset des GPQA-Benchmarks (Graduate-Level Google-Proof Questions and Answers) mit den 198 anspruchsvollsten Multiple-Choice-Fragen aus dem GPQA Datensatz. Der Benchmark umfasst Themen aus der Biologie, Physik und Chemie. Das Diamond-Subset enthält ausschließlich die Fragen, bei denen die beiden Experten. die den Benchmark validiert haben, korrekt geantwortet haben, während die Mehrheit der Nicht-Experten falsch lag. Die Fragen aus GPQA-Diamond sind auf Bachelor-Niveau konzipiert und "Google-proof". Das bedeutet, dass selbst hochqualifizierte Nicht-Experten trotz uneingeschränktem Internetzugang nur 22,1% Genauigkeit in GPQA-Diamond erreichen.
GPQA Diamond Leaderboard
Ranking aller getesteten Modelle im GPQA Diamond Benchmark, sortiert nach Score.
Beispielaufgaben aus dem GPQA Diamond Benchmark
Die folgenden Beispielaufgaben zeigen typische Fragestellungen, die im GPQA Diamond Benchmark vorkommen.
Two quantum states with energies E1 and E2 have a lifetime of 10^-9 sec and 10^-8 sec, respectively. We want to clearly distinguish these two energy levels. Which one of the following options could be their energy difference so that they can be clearly resolved?
(A) 10^-4 eV
(B) 10^-11 eV
(C) 10^-8 eV
(D) 10^-9 eV
(A) 10^-4 eV
1 out of every 400 people has a cancer caused by a completely recessive allele, b. Assuming the population is in Hardy-Weinberg equilibrium, which of the following is the expected proportion of individuals who carry the b allele but are not expected to develop the cancer?
(A) 1/400
(B) 19/400
(C) 20/400
(D) 38/400
(D) 38/400
A Fe pellet of 0.056g is first dissolved in 10mL of hydrobromic acid HBr (0.1M). The resulting solution is then titrated by KMnO4 (0.02M). How many equivalence points are there?
(A) Two points, 25ml and 35ml
(B) One point, 25mL
(C) One point, 10ml
(D) Two points, 25ml and 30ml
(A) Two points, 25ml and 35ml
Two chemists walk into a lab. One of them asks the other "So, how did it go?" The second chemist replies, "Not well - my compounds are on top of each other." What is the second chemist most likely referring to?
(A) The compounds they are working with have similar polarities.
(B) The compounds they are working with have similar boiling points.
(C) The compounds they are working with are bonding to each other through non-covalent/van der Waals interactions.
(D) The compounds they are working with have similar optical rotations.
(A)
Mutations of which of the mitochondrial proteins listed below are least likely to be genetically transmitted from a father to his children?
(A) Translocase of inner mitochondrial membrane 17B
(B) ATP binding cassette subfamily B member 8
(C) NADH dehydrogenase 2
(D) Tu translation elongation factor, mitochondrial
(C) NADH dehydrogenase 2