Gene GM21_3528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3528 
Symbol 
ID8138900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4071600 
End bp4073054 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content68% 
IMG OID644871147 
Productphosphomethylpyrimidine kinase 
Protein accessionYP_003023307 
Protein GI253702118 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase
[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase
[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones96 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTAAGAC TCGTGGTGGA TCACAGCGGC AAGGAGCGGC GCATCGGGGG GCTCTACCTC 
ATCACGGACC AAGCGGAACG CCTGGTCCAC CGCGTGCGCG AGGCGCTCTC CTCCGGAGGG
GTCGCCGTCC TGCAGTACCG GGACAAGGTC CGCGCCTACG AGGAACGCCT GGAACTGGGA
CAGGAGCTGA AACACCTCTG CACGGAATTC CAGGTGGAAT TCATCGTCAA CGACGACGTC
GAACTGGCGC TAGCCCTCGA CGCCGACGGC GTCCACCTGG GGCAGGACGA CGGCGATCCG
GCCGCGGCGC GCGAGGCGCT CGGCCCGAAA AAGATGATCG GCATCTCGAC CCACTCGCTT
ACCGAAGCGC TCGAGGCGCA GGAGGCCGGC GCCGACTATG TCGGCTTCGG AGCCCTCTAC
CCCACCGACA GCAAGGAGGT CGAGCATATC CAGGGGCCGG AGAAGCTCGC GCTTTTGAAG
GGGAAGCTGA GGATACCGGT GGTCGCCATC GGCGGCATCG CCAGGGACAA CGCCTGCGCG
GTTATCGACG CCGGAGCCGA CGCCATCGCG GTTATCTCGG CGGTGCTCTC CGCCAGATCC
CCCGGGCTCG CCGCGACCGA ACTGGCGCTC CTCTTCAACA GGAAGGCGAT GCAGCCGCGC
GGCGGCGTGC TAACCGTGGC GGGGAGCGAC TCCGGAGGTG GCGCCGGCAT CCAGGCGGAC
CTGAAGACGG TGACCCTTTT GGGAAGCTAC GGCGCCTCGG CCATCACCGC GCTTACCGCA
CAGAACACCC GCGGCGTCAA CGCGATCCAC CCGGTCCCGC CCGCCTTCCT CGCGGAGCAG
ATCGACGCTG TCCTCTCGGA CATCCCGATC GACGTGGTGA AGGTGGGGAT GCTCTCTTCC
GCCGAGAACG CCGCCATCCT CGCCGACAGG CTCACCGCCC ACGGCATGAG GATGGTTGTG
CTCGACCCGG TGATGAGCGC CAAGGGCGGC GTGGCGCTCC TGGAGGGCGA GGCGCTGGGC
GTGCTGAAAC AGAGGCTTAT CCCGCTTTGC TACCTGCTCA CGCCGAACAT CCCCGAGGCC
GAGGCCCTCA CCGGGCTCAC CATCACCGAT ACGGCGGGGA TGGAACTCGC CGCCCGGGCC
CTGCACCTCA TGGGGGCGAA GCACGTGCTG GTAAAGGGGG GGCACCTGAC CGAGGGGGTG
GTCACCGACA TCCTCTTCGA CGGCGCCGGC TTCACCCGCT TCACGGCTCC GCGCGTACTC
ACCCGCAACA CCCACGGCAC CGGCTGCACG CTGGCTTCGG CCATCGCAAG CTACCTGGCC
CAGGGGGAAC CGCTCCCCGG CGCGGTGCTC CGGGCGAAGC TCTTCGTCAC GCGCGCGATC
AAGTACGCCC AGCCGCTGGG AAAGGGGCAC GGCCCAGTGA ACCATTTCCT CGCCGCCAAA
GACCAGGCGG AATAA
 
Protein sequence
MLRLVVDHSG KERRIGGLYL ITDQAERLVH RVREALSSGG VAVLQYRDKV RAYEERLELG 
QELKHLCTEF QVEFIVNDDV ELALALDADG VHLGQDDGDP AAAREALGPK KMIGISTHSL
TEALEAQEAG ADYVGFGALY PTDSKEVEHI QGPEKLALLK GKLRIPVVAI GGIARDNACA
VIDAGADAIA VISAVLSARS PGLAATELAL LFNRKAMQPR GGVLTVAGSD SGGGAGIQAD
LKTVTLLGSY GASAITALTA QNTRGVNAIH PVPPAFLAEQ IDAVLSDIPI DVVKVGMLSS
AENAAILADR LTAHGMRMVV LDPVMSAKGG VALLEGEALG VLKQRLIPLC YLLTPNIPEA
EALTGLTITD TAGMELAARA LHLMGAKHVL VKGGHLTEGV VTDILFDGAG FTRFTAPRVL
TRNTHGTGCT LASAIASYLA QGEPLPGAVL RAKLFVTRAI KYAQPLGKGH GPVNHFLAAK
DQAE