Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0401 |
Symbol | |
ID | 4808404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 498519 |
End bp | 500786 |
Gene Length | 2268 bp |
Protein Length | 755 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640105815 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_001036832 |
Protein GI | 125972922 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00237245 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACA GGTTTAATAT AAGAGAGGTT TTCAGTAAAA TCAAAACTTC TGGCTTTAAA AAAATAAAAA GATTAAAACA GGCGATTGCA TCGAAAGTGC GCTTTTTAAA AAGTTTGAAC GAAAAAAAAT CAAGTGAAAA AAATATTTTT ATGCCTTTCT TCAGGCTTAA AGATGTAAGT ATAGGAATTA AACTCAATGT AATTGTATCT ATTGTTCTTG TAGTATCTTT GTCTGTAGTA ATTTTGTACT CTTTTGGCAT TGTGAGGAAT ATCCTTGTGG AGCAGGCAAA GGACGGAACG CTGCAGGTTT CAAAGCAAAC CAATTCGAAT ATGCGAATGC TCCTTGAATC AATGGATCAG GAGGCAACGG CTTTAAGCAG AAATGAACAA ATAGCGGATA TTATTTCGAG GTTGAACAGT ACTGATGATG CTACTTTACA AAGCAGATAC TCGAGTCAGT TAAGAACGAT GCTGACTAAT TATGTGGACG AAAAAAGTAA CTATTTTTAC ACTATTCTGG CGGTTTCCAA TAACTACCTA TATTCGATTT CCGGACAGCA AATAATAAGT CATCAGATAA ATATAAAAGA AGTTGACAGT ATAAAATCTT TTGTTGAAAG TACGAAATAT TCAAAATGGT ATGATCCTTA CGTGCAAGAT GTTTTAATTC ATCAGGACAC AACCGGTTTG GGCGGAAAAG TAATTACTTT GGCTAAAAAA GTTTATTCAA GGACAAAGCT AAAAAGTCCG GGGCAGCTGT TCTTCTATAT CAACTATGAA AATTTGGGCA GAATATTTAA TGATTTGCAT TTGCCTTATG ACGGGGTTAT GTATGTTGTC GGAAACGACT ATAACATTGT AATGAATCCT TCAAAAAAAG AACACATGTC GCTGGCTATA AATGATGTTA GCCAGGAAGA AAAAGAACGG AATTTTTATA TTGACGAAGA GATATTCAAG AAAATTAAAT CAGAACCGAG CGGTGCTTTT ACAACAAAGT TGTATGGGAA AGATGTTTTG ATAACTTTTT CAACAATAGA TAAAGTCAAT GTTACGGATT TGGGATGGAC TTTTGTAACC GTTACGGAAG TAGATAAGAT TACGGAAAGT GTCAACAGAG TTTCGGCACA GGTAATAATT ATCGGTTTAA TTTGTTTGGC GCTGGGGATA CTGTTATCTC AATTAATAAA TAAAGACATT ACGAAGAATA TAAAGAAACT GGTAAAAACA ATGGAGAAAG TAGGCAGCGG TAATTTATCA ATAGAGTTTA AAGTTGAGGA GAGAAAGGAT GAGATAGGCA AGCTTAGTAA CAGTTTTGCA AAGATGATAA GCAACTTGAA CGGATTGATT ATGAGTGTTA AACATGCTTC GGATGTCACA ATTGACGCAT CTTCCAATGT ATCCGCAAAA ATTCAGGAAA CGTATGCTTC TATTCAACAA ACAAATGCTA TTTTGGACGT TATAAAGGAA AAGAGTTTTC AACAGGGTAA GATTGTAAAA GAAGGTGAGC GACAGGTTGC CTTGACAAAA GATGAAGTCA ATCAGGCAAA AGACACGATG AAAGATGTAG ATGCGATACT TTCCAAGTCA AAAGAAATCA GCGAAGATAA TCGAATGTCA GTAAGCTTGT TGCATGAAAT GTCTCAAAAT ATCAGAACAG CTATGAATGA AATAGGTGCG GAATCTAAAG AGCTTATTGC AACTTCAAAG GAAATAACAA AATTTACAAG ACAGATCAAA GAGATTTCAG AACAAACAAA ATTGATTGCT CTGAATTCAG CTATTGAGTC AGCCCGATTG GGAGCCCAGG GAAAGACATT CCATTTGTTA TCTGAAGAAA CCAGAAACTT GGCTTCAAAA ACCAAAGATT TGTCGGGCAG CATAGACCAG ATAATTCAAA ATCTTATTGA AAAGATTAAC AATACCAACA AGGTTGTATT AAAGTTGGAC AAAGTGGCCG AGAATACCGA AAATTCGGTA AAGGACGTAA CAGAAAGTCT GGATAAAAAT ATTGAATTTT TAAATGAGAT AACATCAAAT GTCTCTCGGA TAAAACAGGT ATTTACGCAC ATAGACGACT TTGTCAATCA GATAGTATCT ACGATAGAAT ATATAAGCGC AAGTGCCGAG GCGAATATTC AAGACATTTC TGATGTAAGC AAAGCGATGA ACGAGCAGAT AAAGTGTCAG GAATCATTAT TGGAGCAGAC CACCAATTTG CTTAACTTGT CACAGGAGCT CAAGAAAAAA GCTGAAGAGA TATCCTGA
|
Protein sequence | MKNRFNIREV FSKIKTSGFK KIKRLKQAIA SKVRFLKSLN EKKSSEKNIF MPFFRLKDVS IGIKLNVIVS IVLVVSLSVV ILYSFGIVRN ILVEQAKDGT LQVSKQTNSN MRMLLESMDQ EATALSRNEQ IADIISRLNS TDDATLQSRY SSQLRTMLTN YVDEKSNYFY TILAVSNNYL YSISGQQIIS HQINIKEVDS IKSFVESTKY SKWYDPYVQD VLIHQDTTGL GGKVITLAKK VYSRTKLKSP GQLFFYINYE NLGRIFNDLH LPYDGVMYVV GNDYNIVMNP SKKEHMSLAI NDVSQEEKER NFYIDEEIFK KIKSEPSGAF TTKLYGKDVL ITFSTIDKVN VTDLGWTFVT VTEVDKITES VNRVSAQVII IGLICLALGI LLSQLINKDI TKNIKKLVKT MEKVGSGNLS IEFKVEERKD EIGKLSNSFA KMISNLNGLI MSVKHASDVT IDASSNVSAK IQETYASIQQ TNAILDVIKE KSFQQGKIVK EGERQVALTK DEVNQAKDTM KDVDAILSKS KEISEDNRMS VSLLHEMSQN IRTAMNEIGA ESKELIATSK EITKFTRQIK EISEQTKLIA LNSAIESARL GAQGKTFHLL SEETRNLASK TKDLSGSIDQ IIQNLIEKIN NTNKVVLKLD KVAENTENSV KDVTESLDKN IEFLNEITSN VSRIKQVFTH IDDFVNQIVS TIEYISASAE ANIQDISDVS KAMNEQIKCQ ESLLEQTTNL LNLSQELKKK AEEIS
|
| |