Gene Cthe_0266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0266 
Symbol 
ID4808549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp326614 
End bp328101 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content37% 
IMG OID640105678 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001036698 
Protein GI125972788 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGAAA TTACTCTGGA AAGTCATGCC AGAAGAGTGA ACAAAGTTCT CCTATTTATT 
TTCTGGGCAT ATTTTTTTGT GTGTATAGTT TTAGGAATGA CTATCACTGC ACCTACTTCC
CTGATCAGTA TTGAAAGTAT TCCGCTGATA ATTTTTTTCG CAGGCATGAT TACTTCAACA
ATTTACTTTG TCAAAAGGAA ATATAACGAC ATTACCGGTC TGATTTTATG TCTGTCAGTT
ATGGTTTCGC TTGTAGCCAC CTATTTTAAT CTGAATTCTG TGGTCGAGGA AACTATGATT
CTTGTATTGG TAATACCTGC ATGTGCCATG GCTCTGTATT TAAACAAGAG AAATTTTGCG
ATTTATGCAG TTTGTTTCAA TTTGCTCTGT GTAATTTTTG AGGCAGTATA TAAAACAATG
GGTATGCAGA AATTTATTTC AGATTTGGTT AAAATTGATT TTATTTTGTT GTTTTTATAT
TTTTCCGCCA AATGGGGAAG TGAAATAATC CATCAAATGA TAGAAAAAGA GAAGAATTCA
AATAATATAC TGAATAAACT GGAAGATACC ATGGGATTTC TGAGAAAAAA TACTGAAATT
TTAAACCGTG ACATAATGAA TTGCAATATG ACTTTAAAAA TGATTAAGGA ATCGGGCGAC
AGTGTTGCAC TGGCGGTGGA AGAGGTTGCG AAAGGTATTA GTGAGGAAGC TGAAAGCGTC
AGTAATATTA ATGTTATGAT GTCGGAAGCA GATAAATTAG TGGCTGATAC GGCCGCAATT
TCAAGGGAAA TGGCTGAAGT TTCCGTTTCA ACCGTTCAGA TTGTAAATGA AGGTGTTAAA
AATATAAATG AAATGAATAA ACAAATGGAC ATTATTAACA CTGCGGTAAA TGAATCTTAC
TCAACGGTAC TAAAGCTTCA GGAGAGCATG GACAGGGTAA ATGAATTTCT GCAGGGAATT
ACGGAGATAG CCGAGCAAAC CAATATGTTG GCTTTAAATG CGTCAATTGA GGCGGCAAGA
GCAGGAGAAT CGGGAAAGGG ATTTGCCGTT GTCGCAGATG AAGTCAGAAG GCTTGCGGAA
CAGAGTACTC AAACAGTGGG ACTTATCCAT CAGGTTATAA CGAATATAAA GGACGAAGCA
GATGCGATAT TGGACAAGGT ATATAGCGGA ACCGAGGCAA CAAAAGCAGG AGAAGCCATT
GTGAAAAAAG TAAGTGAGAG TTATGACCGG ATGAACCAGT CTTTTAAGGA TATTGACAAT
TATATTGACA GTGAGTTGAA AATGATTGAA AACACTACTC TTCTCTTTTC TCAAATACGT
AAGGAAATGG AAAGCATAGC CGGAATTTCG CAGGAGCATG CGGCAGCATC CGAAGAAATG
GCTGCTTCGA TGCAGGACCA GAAAGATAAA ATTGAGAGTA TCTTTAATTC TATGAAAGAA
ATTCAGAAAT CAAGTGAAGA GCTTGAAATG ATTGTAAAAG ACAAGTGA
 
Protein sequence
MKEITLESHA RRVNKVLLFI FWAYFFVCIV LGMTITAPTS LISIESIPLI IFFAGMITST 
IYFVKRKYND ITGLILCLSV MVSLVATYFN LNSVVEETMI LVLVIPACAM ALYLNKRNFA
IYAVCFNLLC VIFEAVYKTM GMQKFISDLV KIDFILLFLY FSAKWGSEII HQMIEKEKNS
NNILNKLEDT MGFLRKNTEI LNRDIMNCNM TLKMIKESGD SVALAVEEVA KGISEEAESV
SNINVMMSEA DKLVADTAAI SREMAEVSVS TVQIVNEGVK NINEMNKQMD IINTAVNESY
STVLKLQESM DRVNEFLQGI TEIAEQTNML ALNASIEAAR AGESGKGFAV VADEVRRLAE
QSTQTVGLIH QVITNIKDEA DAILDKVYSG TEATKAGEAI VKKVSESYDR MNQSFKDIDN
YIDSELKMIE NTTLLFSQIR KEMESIAGIS QEHAAASEEM AASMQDQKDK IESIFNSMKE
IQKSSEELEM IVKDK