Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2782 |
Symbol | |
ID | 8138125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3230721 |
End bp | 3231923 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644870385 |
Product | GTP cyclohydrolase II |
Protein accession | YP_003022574 |
Protein GI | 253701385 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase |
TIGRFAM ID | [TIGR00505] GTP cyclohydrolase II [TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 96 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGTCG CCAGCATAGC CGAGGCAATC GAGGACATCC GGGCCGGGAA GATGGTCATT CTGGTCGATG ACGAGGATCG CGAGAACGAA GGCGATCTCA CCATCGCCGC TCAATGCGTA ACCCCCGAGA TAATCAACTT CATGGCCATG CACGGGCGCG GGCTGATCTG CCTGACCATG ACCGGAGAGC GCTGCGACAG GCTGGCGCTC CCCTCCATGG TGCGCGACAA CACCTCCTCC TTCGGCACCG CCTTCACCGT CTCCATCGAG GCACGCCGCG GAGTCACCAC CGGCATCTCC GCCGCCGATC GCGCCCACAC CATTCTCACC GCAGTAGCCA CCGATGCCTG CGCCGAGGAC CTGGCGCGGC CGGGACACAT ATTCCCTCTG CGCGCAAGAG AAGGAGGCGT CCTGGTACGC TCGGGGCAGA CCGAGGGATC CGTCGACCTG GCGCGCCTGG CAGGGTTGGA GCCCGCCGGC GTCATCTGCG AGATCATGAA CGAAGACGGC ACCATGTCCC GCATGCCCGA CCTGAAGAAG TTCGCCCTGC GCCACAGGAT CAAGATCTGC ACCATAGCCG ACCTCGTCGC TTACCGGATG CAGCACGAGT CCTTGGTCCG CCGTTGCGTC GAGGTGAATC TCCCCACCCA GTTCGGCGAC TTCCGCGCCG TCGGCTTCGA AAACGACGTC GACGGACTGG AGCACATTGC CCTGGTCAAA GGCGACATAG GTGGAGACGA ACCGGTCCTG GTGCGGGTCC ACTCCGAGTG CCTGACCGGC GACGTCTTCG GCAGTGTCCG GTGCGACTGT GCCGACCAAC TGCACCTAGC CATGCAGCAC GTACAAGCCG AGGGGAGGGG GGTAATCCTC TACATGCGGC AGGAGGGGCG CGGCATCGGG CTCACCAACA AATTGAAGGC GTATGCGCTG CAGGATCAGG GCAAGGACAC AGTCGAGGCG AACGTCGCTC TCGGTTTCAA GGCCGACATG CGGGATTACG GCATAGGCGC CCAGATCCTC TCCAATCTAG GAATAAAGAA GATCCGGCTC ATGACCAACA ACCCGAAGAA GCTGGTGGGG CTGGCAGGAT ACGGGATCGG CATCGAAGAA CGGGTGCCGT TGGAGCTACC CCCCGGTAAT GCCAACGCGG GGTACCTCAA GACCAAGCGG GAAAAGATGG GGCATTTGTT GAACTTCATC TGA
|
Protein sequence | MSVASIAEAI EDIRAGKMVI LVDDEDRENE GDLTIAAQCV TPEIINFMAM HGRGLICLTM TGERCDRLAL PSMVRDNTSS FGTAFTVSIE ARRGVTTGIS AADRAHTILT AVATDACAED LARPGHIFPL RAREGGVLVR SGQTEGSVDL ARLAGLEPAG VICEIMNEDG TMSRMPDLKK FALRHRIKIC TIADLVAYRM QHESLVRRCV EVNLPTQFGD FRAVGFENDV DGLEHIALVK GDIGGDEPVL VRVHSECLTG DVFGSVRCDC ADQLHLAMQH VQAEGRGVIL YMRQEGRGIG LTNKLKAYAL QDQGKDTVEA NVALGFKADM RDYGIGAQIL SNLGIKKIRL MTNNPKKLVG LAGYGIGIEE RVPLELPPGN ANAGYLKTKR EKMGHLLNFI
|
| |