Gene GM21_1228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1228 
Symbol 
ID8136553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1434561 
End bp1435763 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content63% 
IMG OID644868842 
ProductGTP cyclohydrolase II 
Protein accessionYP_003021047 
Protein GI253699858 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.00154476 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTGTTG CAAGCATAGA GGAAGCGATT GAGGAGATCA GGGCCGGCAG GATGGTCATC 
CTCGCGGACG ACGAGGATCG CGAGAACGAA GGCGACCTAA CCATGGCGGC ACAGTGCGTA
ACGCCGGAGG CCATAAATTT CATGGCGAAA TACGGCCGCG GCTTGATCTG CCTCACCATG
ACCTCGGAGC GTTGCGACCG CCTCGACCTG CAGCCGATGG TGCAGACCAA CACCTCTTCC
TTCGGCACCG CCTTCACCGT TTCCATCGAG GCGAAGAAGG GCGTTACCAC CGGCATCTCG
GCGGCGGACC GCGCCCATAC CATACTGACG GCCGTGGCAC CCGATGCCAC CGCGGCGGAC
CTGGCGCGGC CGGGCCACAT CTTCCCGCTC AGGGCCCGCA ACGGCGGCGT CCTGGTGCGC
TCCGGGCAGA CCGAAGGCTC GGTCGATCTG GCGCGTCTCG CCGGGTTGGA GCCTGCCGGC
GTCATCTGCG AGATCATGAA CGACGACGGC ACCATGTCGC GCATGCCCGA GTTGAAGAAG
TTCGCCAAGG AGCACGGCAT CAAGGTCTGC ACCGTCGCCG ACCTGGTCGC CTACCGCCTG
AAGCACGAAT CGCTGGTGCG CCGCTCGGTC GACGTGGCGC TCCCCAGCCA GTATGGCAGC
TTCCGCGCGG TAGCCTTCGA GAACGACATC GACAAGTTGG AGCATCTCGC GCTGGTCAAA
GGGGACATCA AGGGTGACGA GCCAGTACTG GTGCGTGTCC ATTCCGAGTG CCTCACCGGG
GATGTCTTCG GCAGCGTCAG GTGCGACTGC GCCGATCAAT TGCACAGCGC CATGGAGCGG
ATCGAGAAGG AAGGGACGGG AGTCATCCTC TACATGCGCC AGGAAGGGCG CGGCATCGGG
CTCACCAACA AGCTGAAGGC GTACGCGCTG CAGGACCAGG GGCACGACAC GGTTGAGGCG
AACCTTGCCT TGGGCTTCAA GGCCGACCTG AGGGACTACG GCATCGGCGC GCAGATCCTG
GTGAACCTGG GTATTAAGAA GATCCGGCTC ATGACCAACA ACCCGAAGAA ACTGGTAGGT
CTCCAGGGGT ATGGCATCAA CATCGTCGAG CGCGTACCCA TCGAGATCGC CGCTTCCAAG
AGCAACGAGA AGTACCTGAA GACCAAGCGC GAGAAGATGG GGCACCTGCT GGAAAACATA
TAA
 
Protein sequence
MSVASIEEAI EEIRAGRMVI LADDEDRENE GDLTMAAQCV TPEAINFMAK YGRGLICLTM 
TSERCDRLDL QPMVQTNTSS FGTAFTVSIE AKKGVTTGIS AADRAHTILT AVAPDATAAD
LARPGHIFPL RARNGGVLVR SGQTEGSVDL ARLAGLEPAG VICEIMNDDG TMSRMPELKK
FAKEHGIKVC TVADLVAYRL KHESLVRRSV DVALPSQYGS FRAVAFENDI DKLEHLALVK
GDIKGDEPVL VRVHSECLTG DVFGSVRCDC ADQLHSAMER IEKEGTGVIL YMRQEGRGIG
LTNKLKAYAL QDQGHDTVEA NLALGFKADL RDYGIGAQIL VNLGIKKIRL MTNNPKKLVG
LQGYGINIVE RVPIEIAASK SNEKYLKTKR EKMGHLLENI