Gene Cthe_1266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1266 
Symbol 
ID4809771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1538134 
End bp1540602 
Gene Length2469 bp 
Protein Length822 aa 
Translation table11 
GC content37% 
IMG OID640106689 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001037691 
Protein GI125973781 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTG GTCCTGTTAA CGGAATAAAA GTTGTAATTG CCAGGGCAAC TTCTAAAGTC 
AGAGAATTAA CCAAATCCAA ATTAAAAAGA AAATCAATGA AAACTTTGCA GAATGATACA
AAGAAAAAGG AAAGAAACGA TACCTATAAA GTTTATGATA AAAACCGGTT TGAAAAAATA
ATCCGGCTGT CCAGAGTAAA TAAAATAAAA ATAGGAATAA AAATAAATCT TTCCTTTGTT
ATAACAATTC TTCTTCTGTC AGTGGTTTTG GGATATTCTC TTCTTACTCT GTCAAATACC
ATGATACAAC AGGCAAAAGA AAGTACTTTG GGACTTATGG AGCAAACGGG GAACAAAATA
AAAATTGTTC TTGAGGAAGT TGACAATTTG GCAATGACTA TCACCAGGGA TATTACAATT
GCACCGGCTT TGGATGAAAT CAATCGTGCG GAAAGTGAGT ACATGCGTGC GCGATGGGCC
GGGATTATAA AGCCCTATTT GAATGCTTAC CAAAGTTACA GAGTTGACAC AATTTCAAAT
CTCACTTTGG CTTCCAACAA TGGTTATGTA ATTCTTGGAG GAGAAGGAAC ATTTGAAGAC
GTGAAAAAAG ATTACTATGA TACCGTGGCT GCCCGCGAAT TTGTGGAAAG CGGCGCAAAA
TCGTTGTGGA TTGATACACA TATAGCCGAT TTATTCTACC TGAGAAGAAA GGGCGGCAAC
ACTACGATTG CCCTTATGAA AGCGGTTTAC AAATCAACAA GCCTGAAAAG TGTCGGTGTT
CTTCAAGTAA ACCTCAGGGA GGATTACCTG ACTCGTATGC TCGAAGACAT ACATATTCCT
CATAACGGAT TCTTTTTTAT AGTGGGAAGC AAAGACAATA TGATATTCAA TCCCCAGGAT
AAAAGAGACA ACGGGCTTTT GATTGAGGAT CTGTGTTATG TAAACAGCAA GGGACAGACC
AAAGCGGAAT TGCTGAAGAA AACGGAATTA TTGGATCTAA AAGATTCGGA AGTAAAAGAG
TTGTTGTATA AAGCTGATAC AGGAGTGGAA CTTGATGAAA AAACGCTCAA AGACTTGAGG
GAAAGGGACA ACTATATAAA TCTTCGTATT CTTGAAAAGG TAAGGAAAAG TATAGATGAG
ATTCAAACGA AAAACAGACA GGCCACCGCT TTTGGGGGTA TAATTGAAGA TGATGTTATT
ATAAACGGCA AAAAAATGAT GGTTACTTTT TACACTATAA GAGAAATTAA AGGCACCCCT
TTGGAGTGGA CTTTGATATC TATAACGCCT TTGGAAAACA TAACCCGGGA TGTAAATTTG
GTAGCCGGGT TTATAGTACT TATAGGCGTT GTATGCATTA TAATAGGAAT AATGCTTTCA
GTGCTGATTA CCGGCGATAT TTCTTCAGGT ATCGGGAAAC TGGTAAAATT AATGAATAAA
ATAAAAGAGG GAGATCTTGA AGTAGAATGT GATACCACCA GAAAAGATGA GATAGGAAAA
TTGGGAATAA ATTTCGTGGA TATGGTGGAA AACTTGAAAA AGCTCATAGG AAGCATAAAA
AATGCTTCCA ATATAGCCGT TGAATCATCG CAAACGGTAT CTGCGACCTG TCAGGAGAGC
TATGCATCGA TCCAGGAATT TTCAGCTATG TTGGATGAAA TGAAAAATGA GATTAATCCC
CAGACCCAGG AAATAATGAA CAATGATAAT GTAGTAAATG AATTGTCGGA GCAAATTCAA
GTGATTATCG ATGATTTCAA AAATGTCAGC AATATGGTTA CCGGAGCCAA AAAGTTGAGC
GAAGCTGGAA AAGAAACTGT CAATATGCTG AAAAACAATG CCGATGAGGT AAAAAAGACC
ATAGAAGAAT TCTCAGAACT AATTGGAAGT CTCAAAAGAG AGTCTGCGGA AATTTCAAAA
ATAACTTCCA CCATCAAAGG CATTTCCAGC CAGACCAATC TTTTGGCGCT TAATGCAACC
ATTGAGGCTG CAAGAGCGGG AGAAGCGGGA AAAAGCTTTA GTATAGTTGC GTCGGAAATC
AAGAAACTTG CCGACCAGTC GAAAGTGTCT GCAAATTATA TTGAATCCAA ACTTAAAAAT
ATAGGCAAGA CCATTGAAAA AACCAATGAG GCTGTCAAGT CTTCCGGCGA GGTAATAAGC
GGGCATGACC TTGCCGTTGT TGAAACAATA AACAAGTTCG ACAATATTGT AGGATTTATG
GATAACGTAT TTAGTGCGAT CACAAGTATT ACGGATTATG TGCAGCATAT AGAAGAAGCC
CGCTGCAATA TTATCAAGTC AATGGAAAGA CTGAATGAAA GTACGAAAAA CAATATCAGG
GATATACAGA ATATTTCTGC CGCAATGGAT GAACAGGTGG ATTTGATAAA ACATCTTCTC
TCCCTCTCCG AAAATTTGAG TGAATTGTCT GTAAAACTTG AACAGACCAT TAACATATTT
AAAATATAA
 
Protein sequence
MSFGPVNGIK VVIARATSKV RELTKSKLKR KSMKTLQNDT KKKERNDTYK VYDKNRFEKI 
IRLSRVNKIK IGIKINLSFV ITILLLSVVL GYSLLTLSNT MIQQAKESTL GLMEQTGNKI
KIVLEEVDNL AMTITRDITI APALDEINRA ESEYMRARWA GIIKPYLNAY QSYRVDTISN
LTLASNNGYV ILGGEGTFED VKKDYYDTVA AREFVESGAK SLWIDTHIAD LFYLRRKGGN
TTIALMKAVY KSTSLKSVGV LQVNLREDYL TRMLEDIHIP HNGFFFIVGS KDNMIFNPQD
KRDNGLLIED LCYVNSKGQT KAELLKKTEL LDLKDSEVKE LLYKADTGVE LDEKTLKDLR
ERDNYINLRI LEKVRKSIDE IQTKNRQATA FGGIIEDDVI INGKKMMVTF YTIREIKGTP
LEWTLISITP LENITRDVNL VAGFIVLIGV VCIIIGIMLS VLITGDISSG IGKLVKLMNK
IKEGDLEVEC DTTRKDEIGK LGINFVDMVE NLKKLIGSIK NASNIAVESS QTVSATCQES
YASIQEFSAM LDEMKNEINP QTQEIMNNDN VVNELSEQIQ VIIDDFKNVS NMVTGAKKLS
EAGKETVNML KNNADEVKKT IEEFSELIGS LKRESAEISK ITSTIKGISS QTNLLALNAT
IEAARAGEAG KSFSIVASEI KKLADQSKVS ANYIESKLKN IGKTIEKTNE AVKSSGEVIS
GHDLAVVETI NKFDNIVGFM DNVFSAITSI TDYVQHIEEA RCNIIKSMER LNESTKNNIR
DIQNISAAMD EQVDLIKHLL SLSENLSELS VKLEQTINIF KI