Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0602 |
Symbol | |
ID | 4808204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 737976 |
End bp | 739274 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106016 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001037030 |
Protein GI | 125973120 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATACAA CTCAGATGGA TGCAGCCAAA AAAGGAATAA TAACCGATGA AATGAGAATA GTGGCCGAAA AAGAGGGTGT GCATGTTGAG AAGTTGCGAG AGCTTGTTGC ATCAGGCAAA GTGGTCATAC CGGCCAATAA AAACCACAAA AAACTTGAGC CTCAAGGAAT AGGAGAAGGT TTAAGAACAA AAATCAATGT TAATATCGGA ATATCGAAGG ACTGCTGCAA TTTTGAGATG GAGTTGGAAA AGGCAAAAAA GGCAATTGAG CTTAAAGCCG AGGCGATAAT GGATTTAAGT TCCTACGGAA AGACAAGGGA GTTTAGGCGA AAGCTTGTGG AAATGTCTCC TGTAATGATA GGTACTGTGC CGGTATATGA CGCGGTGGGG TTTTATGAAA AAGACCTTAA AGATATTAGC GCTGAAGAAT TTTTCGAAGT GGTGGAAAAG CATGCCGAAG ACGGTGTGGA CTTTATGACA ATTCATGCCG GCATAAACCG GGAGACCGCA AAGAGATTTA AGGAAAACGG CAGACTTACC AATATCGTAT CCAGAGGGGG TTCTTTGATA TTTGCATGGA TGGAGCTTAC GGGAAATGAA AATCCCTTCT ATGAGCAGTA TGACAGGCTG CTTCAAATAT TTGAAAAGTA TGATGTCACC ATAAGTCTTG GGGATGCATT AAGGCCGGGA AGCATAAATG ATTCCACCGA TGCGTCGCAA ATACAGGAAC TTATTGTTTT GGGAGAGCTT ACCAAAAGAG CATGGGAGAA GAACGTACAG GTTATGATTG AAGGACCGGG GCATATGGCC ATAAATGAAA TTGCCCCAAA CATGGTTTTG GAAAAAAAGC TTTGTCACGG TGCACCTTTC TATGTTTTAG GACCGATTGT CACGGACATT GCACCGGGAT ACGACCACAT AACCAGTGCT ATTGGAGGAG CCATTGCGGC TGCAAACGGT GCGGATTTTC TGTGTTATGT CACTCCCGCG GAGCATTTGA GGCTTCCTGA CATAGATGAC ATGAAAGAAG GAATTATAGC AGCCAGAATT GCGGCCCATG CCGCAGATAT AGCGAAAGGA ATCAAAGGAG CAAGGGAATG GGACTACCAA ATGAGCGAGG CAAGGAGAAA CCTTGACTGG AACAGGATGT TTGAGCTTGC AATAGACAGG GAAAAAGCGG AAAGATACCG CAAAAGCTCG ATGCCTGAAG ATGAAGACAC CTGTACCATG TGCGGCAGAA TGTGCGCAGT CAAAAACACA AACAAAGCCC TTAAGGGTGA AAAAATAAAT ATTCTTTAA
|
Protein sequence | MYTTQMDAAK KGIITDEMRI VAEKEGVHVE KLRELVASGK VVIPANKNHK KLEPQGIGEG LRTKINVNIG ISKDCCNFEM ELEKAKKAIE LKAEAIMDLS SYGKTREFRR KLVEMSPVMI GTVPVYDAVG FYEKDLKDIS AEEFFEVVEK HAEDGVDFMT IHAGINRETA KRFKENGRLT NIVSRGGSLI FAWMELTGNE NPFYEQYDRL LQIFEKYDVT ISLGDALRPG SINDSTDASQ IQELIVLGEL TKRAWEKNVQ VMIEGPGHMA INEIAPNMVL EKKLCHGAPF YVLGPIVTDI APGYDHITSA IGGAIAAANG ADFLCYVTPA EHLRLPDIDD MKEGIIAARI AAHAADIAKG IKGAREWDYQ MSEARRNLDW NRMFELAIDR EKAERYRKSS MPEDEDTCTM CGRMCAVKNT NKALKGEKIN IL
|
| |