Gene Cthe_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0401 
Symbol 
ID4808404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp498519 
End bp500786 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content35% 
IMG OID640105815 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001036832 
Protein GI125972922 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00237245 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACA GGTTTAATAT AAGAGAGGTT TTCAGTAAAA TCAAAACTTC TGGCTTTAAA 
AAAATAAAAA GATTAAAACA GGCGATTGCA TCGAAAGTGC GCTTTTTAAA AAGTTTGAAC
GAAAAAAAAT CAAGTGAAAA AAATATTTTT ATGCCTTTCT TCAGGCTTAA AGATGTAAGT
ATAGGAATTA AACTCAATGT AATTGTATCT ATTGTTCTTG TAGTATCTTT GTCTGTAGTA
ATTTTGTACT CTTTTGGCAT TGTGAGGAAT ATCCTTGTGG AGCAGGCAAA GGACGGAACG
CTGCAGGTTT CAAAGCAAAC CAATTCGAAT ATGCGAATGC TCCTTGAATC AATGGATCAG
GAGGCAACGG CTTTAAGCAG AAATGAACAA ATAGCGGATA TTATTTCGAG GTTGAACAGT
ACTGATGATG CTACTTTACA AAGCAGATAC TCGAGTCAGT TAAGAACGAT GCTGACTAAT
TATGTGGACG AAAAAAGTAA CTATTTTTAC ACTATTCTGG CGGTTTCCAA TAACTACCTA
TATTCGATTT CCGGACAGCA AATAATAAGT CATCAGATAA ATATAAAAGA AGTTGACAGT
ATAAAATCTT TTGTTGAAAG TACGAAATAT TCAAAATGGT ATGATCCTTA CGTGCAAGAT
GTTTTAATTC ATCAGGACAC AACCGGTTTG GGCGGAAAAG TAATTACTTT GGCTAAAAAA
GTTTATTCAA GGACAAAGCT AAAAAGTCCG GGGCAGCTGT TCTTCTATAT CAACTATGAA
AATTTGGGCA GAATATTTAA TGATTTGCAT TTGCCTTATG ACGGGGTTAT GTATGTTGTC
GGAAACGACT ATAACATTGT AATGAATCCT TCAAAAAAAG AACACATGTC GCTGGCTATA
AATGATGTTA GCCAGGAAGA AAAAGAACGG AATTTTTATA TTGACGAAGA GATATTCAAG
AAAATTAAAT CAGAACCGAG CGGTGCTTTT ACAACAAAGT TGTATGGGAA AGATGTTTTG
ATAACTTTTT CAACAATAGA TAAAGTCAAT GTTACGGATT TGGGATGGAC TTTTGTAACC
GTTACGGAAG TAGATAAGAT TACGGAAAGT GTCAACAGAG TTTCGGCACA GGTAATAATT
ATCGGTTTAA TTTGTTTGGC GCTGGGGATA CTGTTATCTC AATTAATAAA TAAAGACATT
ACGAAGAATA TAAAGAAACT GGTAAAAACA ATGGAGAAAG TAGGCAGCGG TAATTTATCA
ATAGAGTTTA AAGTTGAGGA GAGAAAGGAT GAGATAGGCA AGCTTAGTAA CAGTTTTGCA
AAGATGATAA GCAACTTGAA CGGATTGATT ATGAGTGTTA AACATGCTTC GGATGTCACA
ATTGACGCAT CTTCCAATGT ATCCGCAAAA ATTCAGGAAA CGTATGCTTC TATTCAACAA
ACAAATGCTA TTTTGGACGT TATAAAGGAA AAGAGTTTTC AACAGGGTAA GATTGTAAAA
GAAGGTGAGC GACAGGTTGC CTTGACAAAA GATGAAGTCA ATCAGGCAAA AGACACGATG
AAAGATGTAG ATGCGATACT TTCCAAGTCA AAAGAAATCA GCGAAGATAA TCGAATGTCA
GTAAGCTTGT TGCATGAAAT GTCTCAAAAT ATCAGAACAG CTATGAATGA AATAGGTGCG
GAATCTAAAG AGCTTATTGC AACTTCAAAG GAAATAACAA AATTTACAAG ACAGATCAAA
GAGATTTCAG AACAAACAAA ATTGATTGCT CTGAATTCAG CTATTGAGTC AGCCCGATTG
GGAGCCCAGG GAAAGACATT CCATTTGTTA TCTGAAGAAA CCAGAAACTT GGCTTCAAAA
ACCAAAGATT TGTCGGGCAG CATAGACCAG ATAATTCAAA ATCTTATTGA AAAGATTAAC
AATACCAACA AGGTTGTATT AAAGTTGGAC AAAGTGGCCG AGAATACCGA AAATTCGGTA
AAGGACGTAA CAGAAAGTCT GGATAAAAAT ATTGAATTTT TAAATGAGAT AACATCAAAT
GTCTCTCGGA TAAAACAGGT ATTTACGCAC ATAGACGACT TTGTCAATCA GATAGTATCT
ACGATAGAAT ATATAAGCGC AAGTGCCGAG GCGAATATTC AAGACATTTC TGATGTAAGC
AAAGCGATGA ACGAGCAGAT AAAGTGTCAG GAATCATTAT TGGAGCAGAC CACCAATTTG
CTTAACTTGT CACAGGAGCT CAAGAAAAAA GCTGAAGAGA TATCCTGA
 
Protein sequence
MKNRFNIREV FSKIKTSGFK KIKRLKQAIA SKVRFLKSLN EKKSSEKNIF MPFFRLKDVS 
IGIKLNVIVS IVLVVSLSVV ILYSFGIVRN ILVEQAKDGT LQVSKQTNSN MRMLLESMDQ
EATALSRNEQ IADIISRLNS TDDATLQSRY SSQLRTMLTN YVDEKSNYFY TILAVSNNYL
YSISGQQIIS HQINIKEVDS IKSFVESTKY SKWYDPYVQD VLIHQDTTGL GGKVITLAKK
VYSRTKLKSP GQLFFYINYE NLGRIFNDLH LPYDGVMYVV GNDYNIVMNP SKKEHMSLAI
NDVSQEEKER NFYIDEEIFK KIKSEPSGAF TTKLYGKDVL ITFSTIDKVN VTDLGWTFVT
VTEVDKITES VNRVSAQVII IGLICLALGI LLSQLINKDI TKNIKKLVKT MEKVGSGNLS
IEFKVEERKD EIGKLSNSFA KMISNLNGLI MSVKHASDVT IDASSNVSAK IQETYASIQQ
TNAILDVIKE KSFQQGKIVK EGERQVALTK DEVNQAKDTM KDVDAILSKS KEISEDNRMS
VSLLHEMSQN IRTAMNEIGA ESKELIATSK EITKFTRQIK EISEQTKLIA LNSAIESARL
GAQGKTFHLL SEETRNLASK TKDLSGSIDQ IIQNLIEKIN NTNKVVLKLD KVAENTENSV
KDVTESLDKN IEFLNEITSN VSRIKQVFTH IDDFVNQIVS TIEYISASAE ANIQDISDVS
KAMNEQIKCQ ESLLEQTTNL LNLSQELKKK AEEIS