Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0106 |
Symbol | |
ID | 4808730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 137902 |
End bp | 139143 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105515 |
Product | GTP cyclohydrolase II |
Protein accession | YP_001036540 |
Protein GI | 125972630 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase [COG0807] GTP cyclohydrolase II |
TIGRFAM ID | [TIGR00505] GTP cyclohydrolase II [TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTTG ATACAATTGA GTCCGCGATA GAGGACATCA GGCAGGGAAA AATGATAGTT GTGGTTGACG ATGAAGACAG GGAAAACGAA GGCGACCTGC TGATGGCGGC GGAGCTGGTT ACACCGGAGC ATATCAATTT CATGGCAACA TACGGAAGAG GTATGATTTG CGCCCCGATA ACCGAGGCCA GGGCAAAGGA ATTGGGACTG GAATTAATGG TAACGAAAAA CAGTGAACAC ATGAGAACCG CCTTTACAGT GACGGTGGAC CATAAAAGCA CGACTACCGG TATTTCGGCT TTTGAAAGAG CTAAAACAAT AAAGGAACTT TCAAATCCTG ATGCCAAACC GGATGATTTT GTAAGGCCTG GACATGTTTT TCCATTAATT GCTAAAGAAG GCGGAGTTCT AAAACGTGCA GGTCATACTG AAGCCGCAGT AGACTTTGCA AAACTGGCAG GACTTTACCC TGCCGGGGTA ATATGCGAAA TTATGAATGA TGACGGAACC ATGGCCAGAG TTCCGCAGCT TATGGAATTT GTAAAAAAAC ACGGACTTAA ACTTGTTACC ATAGCAGATC TTATAAAATA CAGAAGAAAT AATGAAAAAT TGATAAGAAG GGCGGCGGAG GCAAAACTTC CCACTGAATA CGGGGATTTT AAAATAGTTG CTTATGAAAA CGTTATAAAC GGAGAACATC ATGTAGCATT GGTAAAGGGA GATGTGGCAA ACTCCGACGA GCCTGTAATG GTGAGGGTGC ATTCGGAGTG CCTTACCGGG GATGCGTTCC ATTCCTTAAG GTGCGACTGC GGTGAGCAGC TGCACAAAGC CATGGAAATG ATAGGAAAAG AGGACAAGGG AGTACTTTTG TACATGAGAC AGGAAGGCCG TGGAATTGGC CTTGTAAACA AAATCCGGGC GTATGAGCTT CAGGATCAGG GAAAAGATAC CGTTGAGGCC AATGTGCTTT TGGGATTTCC CCCCGATCTT AGGGAATATG GCATAGGAGC CCAGATATTA TACGACCTCG GTGTAAGGAA AATAAGGCTT CTTACCAACA ATCCCAAGAA GCTGATAGGA CTTGGCGGTC ACGGACTGGA AATTGTTGAA AGGGTGCCCA TTGAGATAAA GGGAAACGAA ATCAACAGTT TCTATTTGAA AACCAAAAAG GAAAAAATGG GGCATTTATT AACCAGTATA AACACAACGG AACATCAGGA GGGAGAATCA AATGGCAATT AA
|
Protein sequence | MNFDTIESAI EDIRQGKMIV VVDDEDRENE GDLLMAAELV TPEHINFMAT YGRGMICAPI TEARAKELGL ELMVTKNSEH MRTAFTVTVD HKSTTTGISA FERAKTIKEL SNPDAKPDDF VRPGHVFPLI AKEGGVLKRA GHTEAAVDFA KLAGLYPAGV ICEIMNDDGT MARVPQLMEF VKKHGLKLVT IADLIKYRRN NEKLIRRAAE AKLPTEYGDF KIVAYENVIN GEHHVALVKG DVANSDEPVM VRVHSECLTG DAFHSLRCDC GEQLHKAMEM IGKEDKGVLL YMRQEGRGIG LVNKIRAYEL QDQGKDTVEA NVLLGFPPDL REYGIGAQIL YDLGVRKIRL LTNNPKKLIG LGGHGLEIVE RVPIEIKGNE INSFYLKTKK EKMGHLLTSI NTTEHQEGES NGN
|
| |