Gene GM21_2782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2782 
Symbol 
ID8138125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3230721 
End bp3231923 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content63% 
IMG OID644870385 
ProductGTP cyclohydrolase II 
Protein accessionYP_003022574 
Protein GI253701385 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones96 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTCG CCAGCATAGC CGAGGCAATC GAGGACATCC GGGCCGGGAA GATGGTCATT 
CTGGTCGATG ACGAGGATCG CGAGAACGAA GGCGATCTCA CCATCGCCGC TCAATGCGTA
ACCCCCGAGA TAATCAACTT CATGGCCATG CACGGGCGCG GGCTGATCTG CCTGACCATG
ACCGGAGAGC GCTGCGACAG GCTGGCGCTC CCCTCCATGG TGCGCGACAA CACCTCCTCC
TTCGGCACCG CCTTCACCGT CTCCATCGAG GCACGCCGCG GAGTCACCAC CGGCATCTCC
GCCGCCGATC GCGCCCACAC CATTCTCACC GCAGTAGCCA CCGATGCCTG CGCCGAGGAC
CTGGCGCGGC CGGGACACAT ATTCCCTCTG CGCGCAAGAG AAGGAGGCGT CCTGGTACGC
TCGGGGCAGA CCGAGGGATC CGTCGACCTG GCGCGCCTGG CAGGGTTGGA GCCCGCCGGC
GTCATCTGCG AGATCATGAA CGAAGACGGC ACCATGTCCC GCATGCCCGA CCTGAAGAAG
TTCGCCCTGC GCCACAGGAT CAAGATCTGC ACCATAGCCG ACCTCGTCGC TTACCGGATG
CAGCACGAGT CCTTGGTCCG CCGTTGCGTC GAGGTGAATC TCCCCACCCA GTTCGGCGAC
TTCCGCGCCG TCGGCTTCGA AAACGACGTC GACGGACTGG AGCACATTGC CCTGGTCAAA
GGCGACATAG GTGGAGACGA ACCGGTCCTG GTGCGGGTCC ACTCCGAGTG CCTGACCGGC
GACGTCTTCG GCAGTGTCCG GTGCGACTGT GCCGACCAAC TGCACCTAGC CATGCAGCAC
GTACAAGCCG AGGGGAGGGG GGTAATCCTC TACATGCGGC AGGAGGGGCG CGGCATCGGG
CTCACCAACA AATTGAAGGC GTATGCGCTG CAGGATCAGG GCAAGGACAC AGTCGAGGCG
AACGTCGCTC TCGGTTTCAA GGCCGACATG CGGGATTACG GCATAGGCGC CCAGATCCTC
TCCAATCTAG GAATAAAGAA GATCCGGCTC ATGACCAACA ACCCGAAGAA GCTGGTGGGG
CTGGCAGGAT ACGGGATCGG CATCGAAGAA CGGGTGCCGT TGGAGCTACC CCCCGGTAAT
GCCAACGCGG GGTACCTCAA GACCAAGCGG GAAAAGATGG GGCATTTGTT GAACTTCATC
TGA
 
Protein sequence
MSVASIAEAI EDIRAGKMVI LVDDEDRENE GDLTIAAQCV TPEIINFMAM HGRGLICLTM 
TGERCDRLAL PSMVRDNTSS FGTAFTVSIE ARRGVTTGIS AADRAHTILT AVATDACAED
LARPGHIFPL RAREGGVLVR SGQTEGSVDL ARLAGLEPAG VICEIMNEDG TMSRMPDLKK
FALRHRIKIC TIADLVAYRM QHESLVRRCV EVNLPTQFGD FRAVGFENDV DGLEHIALVK
GDIGGDEPVL VRVHSECLTG DVFGSVRCDC ADQLHLAMQH VQAEGRGVIL YMRQEGRGIG
LTNKLKAYAL QDQGKDTVEA NVALGFKADM RDYGIGAQIL SNLGIKKIRL MTNNPKKLVG
LAGYGIGIEE RVPLELPPGN ANAGYLKTKR EKMGHLLNFI