Gene Cthe_2170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2170 
Symbol 
ID4810883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2581757 
End bp2582866 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content42% 
IMG OID640107573 
Producthypothetical protein 
Protein accessionYP_001038565 
Protein GI125974655 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGCGT TCATAACCAT AAAAAAGTCC ATACTTATAG GTACCGTTTT TGCAATTACA 
ATTTGCGCCT TCAGTATAGC CCTGTATAAT GCCTCTCCGG CTCTGCCAAG CGTAGCCATA
AGCGAAGAGC ATATAATTAT GCTCGATGAA ATGTTTAAAA TCAGAAGCAA AGCCTTCCTG
GATAACGACT TGAAACTGAT AGAAACACTC TACAACAAAA AAACAAAGTT TGGGATATGG
GCACTTGAAC ATGAGAAAAA AAAGATGAAA TACCTCCATG ACTGGGCTGA CAAACAAGGC
ATAAAATTTA CCGATATAAA ACCCAAAGTT AAAATCTACC AGGTAAAGGA AAGGGGCAGC
GGCTATTTAA TTAACTTCAT AGCCTCTACA GAGTATCATT ACCACTACGA AAATCAGCCC
GAAATCACCA ATTTTTTCAG AATCGGAACC TACCATTCCA TGAATCTGGA TAATGTTGAC
GGTCAATGGC TTATTTCAAA AGAATGGTAT ACTGATCCCT TTGCCGACTC ATTAAACACC
GACAATATAA AAGTGGACGA ATTTAAACAG TATTTATTGA ATTCTTCTCC CAGAGATTTC
TCCAAGCTAA ACAAAAGGCG CATAAGTGCG GTGGAATATG CCGACCGCTA CTGTGGGGCC
GCCGCCGATG AGCAATACGG CTATTCATAC AATAAAAAAT ACAAAAATTA CAACCCTTTA
GGCGGAGACT GCGCAAACTT TGCCTCCCAA ATTCTCTATG AAGGAGGCAA ATTCAAGCAG
ACCGGAGCAT GGAGGTATGA AAAAGACGGA AGCAAAGCAT GGGTCAACGC CCATGCTTTC
AACAGCTACA TGCTCTACAG CGGCAGAGGT TCATTAATCG CAAGAGGTAC CTACAATCAA
GTTTTCAAAG CTTCATTCAA GCTTCTGCCG GGGGACTACA TAGCCTACGA AAAGAAAGGA
AAAGTAGTCC ACATATCCGT TGTCACCGGC GCCGATTCAA AGGGCTATAC CCTGGTCAAC
TGCCACAATA CAGACAGATA CAGAGTCCCG TGGGATTTAG GATGGAGTGA CAAAGGTATT
AAATTCTGGC TGGTTCGTGT AAACTACTAA
 
Protein sequence
MYAFITIKKS ILIGTVFAIT ICAFSIALYN ASPALPSVAI SEEHIIMLDE MFKIRSKAFL 
DNDLKLIETL YNKKTKFGIW ALEHEKKKMK YLHDWADKQG IKFTDIKPKV KIYQVKERGS
GYLINFIAST EYHYHYENQP EITNFFRIGT YHSMNLDNVD GQWLISKEWY TDPFADSLNT
DNIKVDEFKQ YLLNSSPRDF SKLNKRRISA VEYADRYCGA AADEQYGYSY NKKYKNYNPL
GGDCANFASQ ILYEGGKFKQ TGAWRYEKDG SKAWVNAHAF NSYMLYSGRG SLIARGTYNQ
VFKASFKLLP GDYIAYEKKG KVVHISVVTG ADSKGYTLVN CHNTDRYRVP WDLGWSDKGI
KFWLVRVNY