k-nucleotide benchmark | Python Interpreters Benchmarks Game

k-nucleotide benchmark N=10,000

Each chart bar shows how many times slower, one ↓ k-nucleotide program was, compared to the fastest program.

These are not the only programs that could be written. These are not the only compilers and interpreters. These are not the only programming languages.

Column × shows how many times more each program used compared to the benchmark program that used least.

×	Program Source Code	CPU secs	Elapsed secs	Memory KB	Code B	≈ CPU Load
		sort		sort	sort
1.0	Pyston #2	0.07	0.05	?	801	0% 20% 0% 100% 20% 0% 17% 20%
1.1	Python development version #2	0.07	0.05	?	801	0% 0% 80% 0% 20% 20% 33% 0%
1.2	Pyston #3	0.11	0.05	?	2011	40% 60% 0% 17% 33% 43% 0% 33%
1.2	Pyston #8	0.12	0.05	?	777	43% 17% 17% 33% 33% 33% 0% 60%
1.3	Python development version #3	0.12	0.06	?	2011	80% 17% 17% 29% 29% 40% 17% 33%
1.3	Python development version #8	0.12	0.06	?	777	67% 20% 20% 17% 29% 33% 20% 43%
1.3	Nuitka #2	0.08	0.06	?	801	14% 71% 14% 17% 0% 17% 17% 0%
1.3	Python development version	0.06	0.06	?	594	25% 14% 14% 14% 0% 100% 14% 0%
1.3	Python 3 #2	0.08	0.06	?	801	57% 50% 14% 0% 25% 17% 29% 17%
1.5	Nuitka #3	0.12	0.07	?	2011	29% 17% 0% 0% 86% 14% 14% 29%
1.5	Python 3 #8	0.14	0.07	?	777	14% 0% 0% 71% 29% 17% 17% 29%
1.5	Nuitka #8	0.13	0.07	?	777	33% 14% 14% 14% 86% 17% 17% 0%
1.5	Python 3 #3	0.13	0.07	?	2011	0% 0% 25% 14% 17% 29% 86% 17%
1.5	Cython	0.07	0.07	?	618	100% 22% 0% 0% 0% 0% 0% 0%
1.6	Python 2	0.07	0.07	?	593	100% 0% 0% 0% 0% 0% 0% 0%
1.7	Nuitka	0.08	0.08	?	594	0% 89% 0% 0% 0% 0% 0% 0%
1.7	Python 3	0.08	0.08	?	594	100% 0% 11% 20% 22% 0% 0% 12%
2.0	PyPy 2	0.09	0.09	?	593	22% 18% 0% 100% 0% 0% 0% 0%
2.2	PyPy 3	0.10	0.10	?	594	0% 100% 0% 0% 0% 9% 0% 0%
2.8	Python 2 #8	0.13	0.13	?	777	8% 21% 8% 0% 7% 21% 38% 8%
2.8	Python 2 #2	0.06	0.13	?	801	0% 23% 0% 13% 0% 8% 0% 0%
6.2	PyPy 2 #8	0.29	0.28	492,196	777	7% 13% 4% 10% 4% 10% 71% 11%
6.2	PyPy 2 #2	0.23	0.28	526,016	801	4% 0% 66% 4% 3% 3% 7% 3%
7.3	PyPy 3 #2	0.38	0.34	2,348	801	11% 3% 0% 97% 6% 6% 3% 9%
7.8	PyPy 3 #3	0.46	0.36	1,856	2011	3% 8% 91% 8% 8% 6% 8% 11%
8.0	PyPy 3 #8	0.45	0.37	1,016	777	0% 92% 5% 8% 3% 8% 8% 3%
14	RustPython	0.64	0.64	15,724	594	3% 2% 100% 0% 0% 0% 2% 2%
18	Graal	2.14	0.81	612,300	594	30% 4% 63% 58% 44% 52% 25% 6%
47	Jython	4.89	2.15	3,448	593	34% 52% 20% 30% 23% 31% 21% 16%
missing benchmark programs
	IronPython	No program
	Shedskin	No program
	Numba	No program
	MicroPython	No program
	Grumpy	No program

k-nucleotide benchmark : Hashtable update and k-nucleotide strings

diff program output for this 250KB input file (generated with the fasta program N = 25000) with this output file to check your program is correct before contributing.

We are trying to show the performance of various programming language implementations - so we ask that contributed programs not only give the correct result, but also use the same algorithm to calculate that result.

We use FASTA files generated by the fasta benchmark as input for this benchmark. Note: the file may include both lowercase and uppercase codes.

Each program should

read line-by-line a redirected FASTA format file from stdin
extract DNA sequence THREE
define a procedure/function to update a hashtable of k-nucleotide keys and count values, for a particular reading-frame — even though we'll combine k-nucleotide counts for all reading-frames (grow the hashtable from a small default size)
use that procedure/function and hashtable to
- count all the 1-nucleotide and 2-nucleotide sequences, and write the code and percentage frequency, sorted by descending frequency and then ascending k-nucleotide key
- count all the 3- 4- 6- 12- and 18-nucleotide sequences, and write the count and code for the specific sequences GGT GGTA GGTATT GGTATTTTAATT GGTATTTTAATTTATAGT

In practice, less brute-force would be used to calculate k-nucleotide frequencies, for example Virus Classification using k-nucleotide Frequencies and A Fast Algorithm for the Exhaustive Analysis of 12-Nucleotide-Long DNA Sequences. Applications to Human Genomics (105KB pdf).

Home Conclusions License Play

Python Interpreters Benchmarksx64 ArchLinux : AMD® Ryzen 7 4700U®

k-nucleotide benchmark N=10,000

k-nucleotide benchmark : Hashtable update and k-nucleotide strings

Python Interpreters Benchmarks
x64 ArchLinux : AMD® Ryzen 7 4700U®