Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3528 |
Symbol | |
ID | 8138900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4071600 |
End bp | 4073054 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644871147 |
Product | phosphomethylpyrimidine kinase |
Protein accession | YP_003023307 |
Protein GI | 253702118 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase [COG0352] Thiamine monophosphate synthase |
TIGRFAM ID | [TIGR00097] phosphomethylpyrimidine kinase [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 96 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTAAGAC TCGTGGTGGA TCACAGCGGC AAGGAGCGGC GCATCGGGGG GCTCTACCTC ATCACGGACC AAGCGGAACG CCTGGTCCAC CGCGTGCGCG AGGCGCTCTC CTCCGGAGGG GTCGCCGTCC TGCAGTACCG GGACAAGGTC CGCGCCTACG AGGAACGCCT GGAACTGGGA CAGGAGCTGA AACACCTCTG CACGGAATTC CAGGTGGAAT TCATCGTCAA CGACGACGTC GAACTGGCGC TAGCCCTCGA CGCCGACGGC GTCCACCTGG GGCAGGACGA CGGCGATCCG GCCGCGGCGC GCGAGGCGCT CGGCCCGAAA AAGATGATCG GCATCTCGAC CCACTCGCTT ACCGAAGCGC TCGAGGCGCA GGAGGCCGGC GCCGACTATG TCGGCTTCGG AGCCCTCTAC CCCACCGACA GCAAGGAGGT CGAGCATATC CAGGGGCCGG AGAAGCTCGC GCTTTTGAAG GGGAAGCTGA GGATACCGGT GGTCGCCATC GGCGGCATCG CCAGGGACAA CGCCTGCGCG GTTATCGACG CCGGAGCCGA CGCCATCGCG GTTATCTCGG CGGTGCTCTC CGCCAGATCC CCCGGGCTCG CCGCGACCGA ACTGGCGCTC CTCTTCAACA GGAAGGCGAT GCAGCCGCGC GGCGGCGTGC TAACCGTGGC GGGGAGCGAC TCCGGAGGTG GCGCCGGCAT CCAGGCGGAC CTGAAGACGG TGACCCTTTT GGGAAGCTAC GGCGCCTCGG CCATCACCGC GCTTACCGCA CAGAACACCC GCGGCGTCAA CGCGATCCAC CCGGTCCCGC CCGCCTTCCT CGCGGAGCAG ATCGACGCTG TCCTCTCGGA CATCCCGATC GACGTGGTGA AGGTGGGGAT GCTCTCTTCC GCCGAGAACG CCGCCATCCT CGCCGACAGG CTCACCGCCC ACGGCATGAG GATGGTTGTG CTCGACCCGG TGATGAGCGC CAAGGGCGGC GTGGCGCTCC TGGAGGGCGA GGCGCTGGGC GTGCTGAAAC AGAGGCTTAT CCCGCTTTGC TACCTGCTCA CGCCGAACAT CCCCGAGGCC GAGGCCCTCA CCGGGCTCAC CATCACCGAT ACGGCGGGGA TGGAACTCGC CGCCCGGGCC CTGCACCTCA TGGGGGCGAA GCACGTGCTG GTAAAGGGGG GGCACCTGAC CGAGGGGGTG GTCACCGACA TCCTCTTCGA CGGCGCCGGC TTCACCCGCT TCACGGCTCC GCGCGTACTC ACCCGCAACA CCCACGGCAC CGGCTGCACG CTGGCTTCGG CCATCGCAAG CTACCTGGCC CAGGGGGAAC CGCTCCCCGG CGCGGTGCTC CGGGCGAAGC TCTTCGTCAC GCGCGCGATC AAGTACGCCC AGCCGCTGGG AAAGGGGCAC GGCCCAGTGA ACCATTTCCT CGCCGCCAAA GACCAGGCGG AATAA
|
Protein sequence | MLRLVVDHSG KERRIGGLYL ITDQAERLVH RVREALSSGG VAVLQYRDKV RAYEERLELG QELKHLCTEF QVEFIVNDDV ELALALDADG VHLGQDDGDP AAAREALGPK KMIGISTHSL TEALEAQEAG ADYVGFGALY PTDSKEVEHI QGPEKLALLK GKLRIPVVAI GGIARDNACA VIDAGADAIA VISAVLSARS PGLAATELAL LFNRKAMQPR GGVLTVAGSD SGGGAGIQAD LKTVTLLGSY GASAITALTA QNTRGVNAIH PVPPAFLAEQ IDAVLSDIPI DVVKVGMLSS AENAAILADR LTAHGMRMVV LDPVMSAKGG VALLEGEALG VLKQRLIPLC YLLTPNIPEA EALTGLTITD TAGMELAARA LHLMGAKHVL VKGGHLTEGV VTDILFDGAG FTRFTAPRVL TRNTHGTGCT LASAIASYLA QGEPLPGAVL RAKLFVTRAI KYAQPLGKGH GPVNHFLAAK DQAE
|
| |