Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4900 |
Symbol | tsr |
ID | 6147169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 5019981 |
End bp | 5021645 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619703 |
Product | methyl-accepting chemotaxis protein I |
Protein accession | YP_001746810 |
Protein GI | 170682951 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.979291 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAAAAC GTATCAAGAT CGTGACCAGC TTACTGCTGG TTTTGGCCGT TTTTGGCCTT TTACAACTGA CATCAGGCGG TCTGTTCTTT AATGCCTTAA AGAATGACAA AGAAAATTTC ACTGTTTTAC AAACCATTCG CCAGCAGCAA TCCACGCTGA ATGGCAGCTG GGTCGCGTTG TTGCAGACGC GTAACACCCT CAACCGCGCG GGTATCCGCT ACATGATGGA TCAGAATAAT ATTGGTAGCG GTTCAACCGT TGCTGAGCTG ATGCAGAGTG CCAGTATTTC GCTGAAACAG GCGGAAAAAA ACTGGGCGGA TTACGAAGCG TTGCCGCGTG ACCCGCGTCA GAGCACCGCC GCAGCGGCAG AGATCAAACG TAATTACGAT ATTTATCACA ATGCGCTGGC GGAGCTGATC CAACTGTTAG GTGCAGGCAA AATCAACGAG TTCTTTGATC AGCCAACCCA GGGATATCAG GACGGTTTCG AGAAGCAGTA TGTGGCTTAC ATGGAGCAAA ACGATCGGCT CTATGATATC GCCGTCAGCG ATAATAATGC CTCCTACAGC CAGGCGATGT GGATTCTGGT GGGCGTGATG ATCGTCGTAC TGGCGGTCAT CTTCGCCGTC TGGTTCGGTA TTAAAGCCTC GTTGGTAGCG CCAATGAATC GCCTGATTGA CAGCATTCGT CATATTGCAG GCGGCGATCT GGTGAAACCG ATTGAGGTGG ATGGCTCTAA TGAGATGGGG CAACTGGCTG AGAGTTTGCG CCATATGCAG GGAGAGCTGA TGCGTACCGT CGGTGATGTG CGCAACGGGG CCAATGCCAT CTATAGCGGT GCCAGCGAAA TCGCTACCGG CAATAACGAT CTCTCTTCGC GCACCGAGCA ACAGGCCGCT TCGCTGGAAG AGACGGCAGC CAGCATGGAG CAACTGACCG CAACGGTGAA GCAGAACGCC GAAAATGCGC GCCAGGCCAG CCACCTGGCG TTAAGTGCTT CTGAAACGGC GCAACGCGGC GGCAAAGTGG TGGATAACGT GGTGCAGACC ATGCGCGATA TCTCCACCAG TTCGCAGAAA ATCGCCGATA TTATCAGCGT AATTGACGGC ATTGCCTTCC AGACCAATAT TCTCGCTTTG AACGCGGCGG TTGAAGCAGC GCGCGCGGGT GAGCAAGGGC GCGGTTTTGC GGTGGTTGCG GGAGAAGTGC GTAATCTGGC CCAGCGTAGC GCTCAGGCGG CTCGCGAAAT TAAAAGCCTG ATTGAAGACT CGGTGGGCAA AGTGGATGTT GGCTCTACGC TGGTCGAAAG CGCCGGGGAA ACCATGGCAG AGATTGTCAG CGCTGTGACC CGCGTGACGG ACATTATGGG CGAAATAGCT TCTGCTTCTG ATGAGCAGAG CCGTGGTATC GATCAGGTTG GATTAGCGGT TGCTGAGATG GACCGGGTAA CTCAACAGAA CGCTGCGCTG GTGGAAGAAT CTGCCGCTGC CGCCGCCGCG CTGGAAGAGC AGGCCAGTCG CCTGACCGAA GCTGTGGCAG TGTTCCGGAT TCAGCAACAG CAGCAACATC AGCGTGAAAC ATCGGCTGTG GTAAAAACCG TGACGCCAGC TGCGCCGCGT AAAATGGCCG TGGCAGATAG CGGGGAGAAC TGGGAAACGT TTTAA
|
Protein sequence | MLKRIKIVTS LLLVLAVFGL LQLTSGGLFF NALKNDKENF TVLQTIRQQQ STLNGSWVAL LQTRNTLNRA GIRYMMDQNN IGSGSTVAEL MQSASISLKQ AEKNWADYEA LPRDPRQSTA AAAEIKRNYD IYHNALAELI QLLGAGKINE FFDQPTQGYQ DGFEKQYVAY MEQNDRLYDI AVSDNNASYS QAMWILVGVM IVVLAVIFAV WFGIKASLVA PMNRLIDSIR HIAGGDLVKP IEVDGSNEMG QLAESLRHMQ GELMRTVGDV RNGANAIYSG ASEIATGNND LSSRTEQQAA SLEETAASME QLTATVKQNA ENARQASHLA LSASETAQRG GKVVDNVVQT MRDISTSSQK IADIISVIDG IAFQTNILAL NAAVEAARAG EQGRGFAVVA GEVRNLAQRS AQAAREIKSL IEDSVGKVDV GSTLVESAGE TMAEIVSAVT RVTDIMGEIA SASDEQSRGI DQVGLAVAEM DRVTQQNAAL VEESAAAAAA LEEQASRLTE AVAVFRIQQQ QQHQRETSAV VKTVTPAAPR KMAVADSGEN WETF
|
| |