Gene Cthe_1063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1063 
Symbol 
ID4811361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1269628 
End bp1270806 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content41% 
IMG OID640106485 
Productthiamine biosynthesis/tRNA modification protein ThiI 
Protein accessionYP_001037488 
Protein GI125973578 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGG TAATTTTGGT AAGATATGGT GAAATACTTT TAAAGGGATT GAACAGACCT 
ATTTTTGAGG ACAAGCTTAT GAGCAACATA AAAAGGGCCA TTCACAAGCT GGGTAAGGTG
CGCATTACAA AGTCCCAGGC GAGAATATAC ATTGAGCCCT TGGAAGAAAA CTATGATTTT
GATGAGGCTT TAAAACTTTT GTCAAAGGTT TTCGGAATTG TTTCAGTAAG TCCGGTGTGG
AAGATAGATT CGGATTTTGA GTGCATAAAA GAAAACTCGG TAAAAATGGT AAAGGACCTC
ATAAATCGGG AAGGGTACAA GACTTTCAAG GTTGAGACCA AGAGGGGAAA CAAGCGTTTT
CCCATGGATT CACCGGAGAT AAGCAGGCAG CTGGGAGGAT ATATTTTAAG AAATGTGCCT
GAGCTTAGCG TTGATGTAAA AAACCCTGAT TTCATTTTAT ATGTGGAAGT AAGAGAGTTT
ACATACATTT ACTCGGAGAT AATACAGGCA GTTTGCGGAA TGCCCCTTGG CAGCAACGGA
AAGGCTGTGC TTTTGCTGTC GGGAGGTATT GACAGCCCGG TAGCCGGTTG GATGATAGCA
AAAAGAGGTG TGGAAATAGA GGCGGTTCAT TTTTACAGTT ATCCTTACAC CAGTGAGAGG
GCAAAGGAGA AGGTTATTGA ACTTACAAAA ATTCTTGCCA CATACTGCCA AAAAATTAAC
CTTCATATTG TTCCCTTTAC CGAGATTCAG CTGGAGATAA ACGAAAAATG TCCTCATGAA
GAATTGACAA TAATCATGCG AAGAGCAATG ATGAGAATAG CAGAAATAAT TGCTAATAAA
ACCGGAGCTC TGGCATTGGT GACGGGAGAG AGTGTCGGAC AGGTTGCAAG CCAGACAATA
CAAAGCCTTG TGGTTACAAA TGCCGTGGTA AGCCTTCCGG TTTTCCGTCC TTTGATAGGT
ATGGATAAAA ACGAGGTTGT GGATATTGCC AAAAAAATCG GTACTTTTGA AACATCGATT
CTTCCTTATG AGGATTGCTG CACGGTTTTT GTCGCAAAAC ATCCCACCAC CAAGCCGAAA
CTGGAAAGAA TACAGCTTTC GGAAAGCAGG CTGAACATGG AAGAATTGAT AAACAAGGCA
GTTGAAAATA CCGAGGTTTT GACGATAACG AGGGATTAA
 
Protein sequence
MKKVILVRYG EILLKGLNRP IFEDKLMSNI KRAIHKLGKV RITKSQARIY IEPLEENYDF 
DEALKLLSKV FGIVSVSPVW KIDSDFECIK ENSVKMVKDL INREGYKTFK VETKRGNKRF
PMDSPEISRQ LGGYILRNVP ELSVDVKNPD FILYVEVREF TYIYSEIIQA VCGMPLGSNG
KAVLLLSGGI DSPVAGWMIA KRGVEIEAVH FYSYPYTSER AKEKVIELTK ILATYCQKIN
LHIVPFTEIQ LEINEKCPHE ELTIIMRRAM MRIAEIIANK TGALALVTGE SVGQVASQTI
QSLVVTNAVV SLPVFRPLIG MDKNEVVDIA KKIGTFETSI LPYEDCCTVF VAKHPTTKPK
LERIQLSESR LNMEELINKA VENTEVLTIT RD