Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1063 |
Symbol | |
ID | 4811361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1269628 |
End bp | 1270806 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106485 |
Product | thiamine biosynthesis/tRNA modification protein ThiI |
Protein accession | YP_001037488 |
Protein GI | 125973578 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGG TAATTTTGGT AAGATATGGT GAAATACTTT TAAAGGGATT GAACAGACCT ATTTTTGAGG ACAAGCTTAT GAGCAACATA AAAAGGGCCA TTCACAAGCT GGGTAAGGTG CGCATTACAA AGTCCCAGGC GAGAATATAC ATTGAGCCCT TGGAAGAAAA CTATGATTTT GATGAGGCTT TAAAACTTTT GTCAAAGGTT TTCGGAATTG TTTCAGTAAG TCCGGTGTGG AAGATAGATT CGGATTTTGA GTGCATAAAA GAAAACTCGG TAAAAATGGT AAAGGACCTC ATAAATCGGG AAGGGTACAA GACTTTCAAG GTTGAGACCA AGAGGGGAAA CAAGCGTTTT CCCATGGATT CACCGGAGAT AAGCAGGCAG CTGGGAGGAT ATATTTTAAG AAATGTGCCT GAGCTTAGCG TTGATGTAAA AAACCCTGAT TTCATTTTAT ATGTGGAAGT AAGAGAGTTT ACATACATTT ACTCGGAGAT AATACAGGCA GTTTGCGGAA TGCCCCTTGG CAGCAACGGA AAGGCTGTGC TTTTGCTGTC GGGAGGTATT GACAGCCCGG TAGCCGGTTG GATGATAGCA AAAAGAGGTG TGGAAATAGA GGCGGTTCAT TTTTACAGTT ATCCTTACAC CAGTGAGAGG GCAAAGGAGA AGGTTATTGA ACTTACAAAA ATTCTTGCCA CATACTGCCA AAAAATTAAC CTTCATATTG TTCCCTTTAC CGAGATTCAG CTGGAGATAA ACGAAAAATG TCCTCATGAA GAATTGACAA TAATCATGCG AAGAGCAATG ATGAGAATAG CAGAAATAAT TGCTAATAAA ACCGGAGCTC TGGCATTGGT GACGGGAGAG AGTGTCGGAC AGGTTGCAAG CCAGACAATA CAAAGCCTTG TGGTTACAAA TGCCGTGGTA AGCCTTCCGG TTTTCCGTCC TTTGATAGGT ATGGATAAAA ACGAGGTTGT GGATATTGCC AAAAAAATCG GTACTTTTGA AACATCGATT CTTCCTTATG AGGATTGCTG CACGGTTTTT GTCGCAAAAC ATCCCACCAC CAAGCCGAAA CTGGAAAGAA TACAGCTTTC GGAAAGCAGG CTGAACATGG AAGAATTGAT AAACAAGGCA GTTGAAAATA CCGAGGTTTT GACGATAACG AGGGATTAA
|
Protein sequence | MKKVILVRYG EILLKGLNRP IFEDKLMSNI KRAIHKLGKV RITKSQARIY IEPLEENYDF DEALKLLSKV FGIVSVSPVW KIDSDFECIK ENSVKMVKDL INREGYKTFK VETKRGNKRF PMDSPEISRQ LGGYILRNVP ELSVDVKNPD FILYVEVREF TYIYSEIIQA VCGMPLGSNG KAVLLLSGGI DSPVAGWMIA KRGVEIEAVH FYSYPYTSER AKEKVIELTK ILATYCQKIN LHIVPFTEIQ LEINEKCPHE ELTIIMRRAM MRIAEIIANK TGALALVTGE SVGQVASQTI QSLVVTNAVV SLPVFRPLIG MDKNEVVDIA KKIGTFETSI LPYEDCCTVF VAKHPTTKPK LERIQLSESR LNMEELINKA VENTEVLTIT RD
|
| |