Gene Cthe_0602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0602 
Symbol 
ID4808204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp737976 
End bp739274 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content44% 
IMG OID640106016 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001037030 
Protein GI125973120 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATACAA CTCAGATGGA TGCAGCCAAA AAAGGAATAA TAACCGATGA AATGAGAATA 
GTGGCCGAAA AAGAGGGTGT GCATGTTGAG AAGTTGCGAG AGCTTGTTGC ATCAGGCAAA
GTGGTCATAC CGGCCAATAA AAACCACAAA AAACTTGAGC CTCAAGGAAT AGGAGAAGGT
TTAAGAACAA AAATCAATGT TAATATCGGA ATATCGAAGG ACTGCTGCAA TTTTGAGATG
GAGTTGGAAA AGGCAAAAAA GGCAATTGAG CTTAAAGCCG AGGCGATAAT GGATTTAAGT
TCCTACGGAA AGACAAGGGA GTTTAGGCGA AAGCTTGTGG AAATGTCTCC TGTAATGATA
GGTACTGTGC CGGTATATGA CGCGGTGGGG TTTTATGAAA AAGACCTTAA AGATATTAGC
GCTGAAGAAT TTTTCGAAGT GGTGGAAAAG CATGCCGAAG ACGGTGTGGA CTTTATGACA
ATTCATGCCG GCATAAACCG GGAGACCGCA AAGAGATTTA AGGAAAACGG CAGACTTACC
AATATCGTAT CCAGAGGGGG TTCTTTGATA TTTGCATGGA TGGAGCTTAC GGGAAATGAA
AATCCCTTCT ATGAGCAGTA TGACAGGCTG CTTCAAATAT TTGAAAAGTA TGATGTCACC
ATAAGTCTTG GGGATGCATT AAGGCCGGGA AGCATAAATG ATTCCACCGA TGCGTCGCAA
ATACAGGAAC TTATTGTTTT GGGAGAGCTT ACCAAAAGAG CATGGGAGAA GAACGTACAG
GTTATGATTG AAGGACCGGG GCATATGGCC ATAAATGAAA TTGCCCCAAA CATGGTTTTG
GAAAAAAAGC TTTGTCACGG TGCACCTTTC TATGTTTTAG GACCGATTGT CACGGACATT
GCACCGGGAT ACGACCACAT AACCAGTGCT ATTGGAGGAG CCATTGCGGC TGCAAACGGT
GCGGATTTTC TGTGTTATGT CACTCCCGCG GAGCATTTGA GGCTTCCTGA CATAGATGAC
ATGAAAGAAG GAATTATAGC AGCCAGAATT GCGGCCCATG CCGCAGATAT AGCGAAAGGA
ATCAAAGGAG CAAGGGAATG GGACTACCAA ATGAGCGAGG CAAGGAGAAA CCTTGACTGG
AACAGGATGT TTGAGCTTGC AATAGACAGG GAAAAAGCGG AAAGATACCG CAAAAGCTCG
ATGCCTGAAG ATGAAGACAC CTGTACCATG TGCGGCAGAA TGTGCGCAGT CAAAAACACA
AACAAAGCCC TTAAGGGTGA AAAAATAAAT ATTCTTTAA
 
Protein sequence
MYTTQMDAAK KGIITDEMRI VAEKEGVHVE KLRELVASGK VVIPANKNHK KLEPQGIGEG 
LRTKINVNIG ISKDCCNFEM ELEKAKKAIE LKAEAIMDLS SYGKTREFRR KLVEMSPVMI
GTVPVYDAVG FYEKDLKDIS AEEFFEVVEK HAEDGVDFMT IHAGINRETA KRFKENGRLT
NIVSRGGSLI FAWMELTGNE NPFYEQYDRL LQIFEKYDVT ISLGDALRPG SINDSTDASQ
IQELIVLGEL TKRAWEKNVQ VMIEGPGHMA INEIAPNMVL EKKLCHGAPF YVLGPIVTDI
APGYDHITSA IGGAIAAANG ADFLCYVTPA EHLRLPDIDD MKEGIIAARI AAHAADIAKG
IKGAREWDYQ MSEARRNLDW NRMFELAIDR EKAERYRKSS MPEDEDTCTM CGRMCAVKNT
NKALKGEKIN IL