Gene Cthe_3209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3209 
Symbol 
ID4809511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3801316 
End bp3802515 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content33% 
IMG OID640108643 
Producthypothetical protein 
Protein accessionYP_001039597 
Protein GI125975687 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCAATTT TTTATTTCAT TTTACTGTTG CATTTAATTT TACAACGAAC CAAAACTAAA 
GAATTATATG AAAAACTTGA AAAGAAAAAA GCCAAAAGAG ATATGTATTC TTATATAGTT
GACTTTGGAG TATCGAAAAA ATGCAGTTCC ACAAGATTAG CTGCCGTTTA CAAAGAAGAA
TTAAGCGGAA ACTATTATTC CGGCGAGGCA AAATCCAAAA TTGATATGTT CAACCAAAGG
GAAAAAAGTA ACAATCAATT TGCTTTAAAT ATTAAGGATT TGGGTGTAAG TAAAAATGAA
AAAAGCTATA TAGCCATTAC ACATATTGAC GGAAACAGGA TGGGAAAGCG GATAAGCAAC
TTGAGAGAAA GTTTTAGAAG CAAGTATAAT TCTGAAAATA TAAAGGAAAT AAATGAGCAA
TACATAAATG CATTGAATCA GTTTTCAATA GATATTGACA AGGCTTTTAA AACAGCTTTT
AACAAAATGG TTGAGACAGT AGAAAAGAAT CGTGAAAACT TGGAAAAAGA AGGAATGGAG
CTTAAGAGTT CAGTTATACC GATTCGAAAA GTTGTGCTGG CAGGCGATGA TGTTTGTTAT
ATCACTGATG CGAGGATTGC TTTGGAATGT GCTTATATTT TCTTGCGAGA GTTGGAAAAG
CATAGCGTTA TGGGAGAAAA GATAACGGCC TGCGCCGGTA TTGCAATTGT GAAAGAAAAA
TATCCGTTTT TTAAAACATA CGAATTATCC GAAGAACTTT GTAAAAATGC AAAATCAAGT
ATTGAAGAAG GCAAAATTGA ATCGAGGATT GACTGGCATA TAGTTCAAGG GGAATACAAT
AACAATTTGG ATGAGATTAG AAACACGGTT TATAAGACAC TTGACGGTAA AGACCTTTCC
ATGAGACCGT TGGTGGTTTC AAAAGAATCA GATTCACCAA ATCATTATTC TCTTTTTAGG
AAGGATATAG AGGTAATAAG ATCGAGAAAA TTACCCAGGG GAAAAATAAA AGGAATGTTA
AAAGAAATGA AAAAAGGAGA GGCTTATTTA GATACCTATA TAGAAATTAA TCAGATTTAT
AATGTACTTG GTGCACACCG GCTTAATGCA AAAAGCGGAT TCCTAAACGG AAAATGTGTT
TTGTTTGATG CAATAGAAGC ATTGGATTAT TTTATACCGT TTTGTGATGA GGAGGTGTAA
 
Protein sequence
MSIFYFILLL HLILQRTKTK ELYEKLEKKK AKRDMYSYIV DFGVSKKCSS TRLAAVYKEE 
LSGNYYSGEA KSKIDMFNQR EKSNNQFALN IKDLGVSKNE KSYIAITHID GNRMGKRISN
LRESFRSKYN SENIKEINEQ YINALNQFSI DIDKAFKTAF NKMVETVEKN RENLEKEGME
LKSSVIPIRK VVLAGDDVCY ITDARIALEC AYIFLRELEK HSVMGEKITA CAGIAIVKEK
YPFFKTYELS EELCKNAKSS IEEGKIESRI DWHIVQGEYN NNLDEIRNTV YKTLDGKDLS
MRPLVVSKES DSPNHYSLFR KDIEVIRSRK LPRGKIKGML KEMKKGEAYL DTYIEINQIY
NVLGAHRLNA KSGFLNGKCV LFDAIEALDY FIPFCDEEV