Gene EcSMS35_4900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4900 
Symboltsr 
ID6147169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp5019981 
End bp5021645 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content54% 
IMG OID641619703 
Productmethyl-accepting chemotaxis protein I 
Protein accessionYP_001746810 
Protein GI170682951 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.979291 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAAC GTATCAAGAT CGTGACCAGC TTACTGCTGG TTTTGGCCGT TTTTGGCCTT 
TTACAACTGA CATCAGGCGG TCTGTTCTTT AATGCCTTAA AGAATGACAA AGAAAATTTC
ACTGTTTTAC AAACCATTCG CCAGCAGCAA TCCACGCTGA ATGGCAGCTG GGTCGCGTTG
TTGCAGACGC GTAACACCCT CAACCGCGCG GGTATCCGCT ACATGATGGA TCAGAATAAT
ATTGGTAGCG GTTCAACCGT TGCTGAGCTG ATGCAGAGTG CCAGTATTTC GCTGAAACAG
GCGGAAAAAA ACTGGGCGGA TTACGAAGCG TTGCCGCGTG ACCCGCGTCA GAGCACCGCC
GCAGCGGCAG AGATCAAACG TAATTACGAT ATTTATCACA ATGCGCTGGC GGAGCTGATC
CAACTGTTAG GTGCAGGCAA AATCAACGAG TTCTTTGATC AGCCAACCCA GGGATATCAG
GACGGTTTCG AGAAGCAGTA TGTGGCTTAC ATGGAGCAAA ACGATCGGCT CTATGATATC
GCCGTCAGCG ATAATAATGC CTCCTACAGC CAGGCGATGT GGATTCTGGT GGGCGTGATG
ATCGTCGTAC TGGCGGTCAT CTTCGCCGTC TGGTTCGGTA TTAAAGCCTC GTTGGTAGCG
CCAATGAATC GCCTGATTGA CAGCATTCGT CATATTGCAG GCGGCGATCT GGTGAAACCG
ATTGAGGTGG ATGGCTCTAA TGAGATGGGG CAACTGGCTG AGAGTTTGCG CCATATGCAG
GGAGAGCTGA TGCGTACCGT CGGTGATGTG CGCAACGGGG CCAATGCCAT CTATAGCGGT
GCCAGCGAAA TCGCTACCGG CAATAACGAT CTCTCTTCGC GCACCGAGCA ACAGGCCGCT
TCGCTGGAAG AGACGGCAGC CAGCATGGAG CAACTGACCG CAACGGTGAA GCAGAACGCC
GAAAATGCGC GCCAGGCCAG CCACCTGGCG TTAAGTGCTT CTGAAACGGC GCAACGCGGC
GGCAAAGTGG TGGATAACGT GGTGCAGACC ATGCGCGATA TCTCCACCAG TTCGCAGAAA
ATCGCCGATA TTATCAGCGT AATTGACGGC ATTGCCTTCC AGACCAATAT TCTCGCTTTG
AACGCGGCGG TTGAAGCAGC GCGCGCGGGT GAGCAAGGGC GCGGTTTTGC GGTGGTTGCG
GGAGAAGTGC GTAATCTGGC CCAGCGTAGC GCTCAGGCGG CTCGCGAAAT TAAAAGCCTG
ATTGAAGACT CGGTGGGCAA AGTGGATGTT GGCTCTACGC TGGTCGAAAG CGCCGGGGAA
ACCATGGCAG AGATTGTCAG CGCTGTGACC CGCGTGACGG ACATTATGGG CGAAATAGCT
TCTGCTTCTG ATGAGCAGAG CCGTGGTATC GATCAGGTTG GATTAGCGGT TGCTGAGATG
GACCGGGTAA CTCAACAGAA CGCTGCGCTG GTGGAAGAAT CTGCCGCTGC CGCCGCCGCG
CTGGAAGAGC AGGCCAGTCG CCTGACCGAA GCTGTGGCAG TGTTCCGGAT TCAGCAACAG
CAGCAACATC AGCGTGAAAC ATCGGCTGTG GTAAAAACCG TGACGCCAGC TGCGCCGCGT
AAAATGGCCG TGGCAGATAG CGGGGAGAAC TGGGAAACGT TTTAA
 
Protein sequence
MLKRIKIVTS LLLVLAVFGL LQLTSGGLFF NALKNDKENF TVLQTIRQQQ STLNGSWVAL 
LQTRNTLNRA GIRYMMDQNN IGSGSTVAEL MQSASISLKQ AEKNWADYEA LPRDPRQSTA
AAAEIKRNYD IYHNALAELI QLLGAGKINE FFDQPTQGYQ DGFEKQYVAY MEQNDRLYDI
AVSDNNASYS QAMWILVGVM IVVLAVIFAV WFGIKASLVA PMNRLIDSIR HIAGGDLVKP
IEVDGSNEMG QLAESLRHMQ GELMRTVGDV RNGANAIYSG ASEIATGNND LSSRTEQQAA
SLEETAASME QLTATVKQNA ENARQASHLA LSASETAQRG GKVVDNVVQT MRDISTSSQK
IADIISVIDG IAFQTNILAL NAAVEAARAG EQGRGFAVVA GEVRNLAQRS AQAAREIKSL
IEDSVGKVDV GSTLVESAGE TMAEIVSAVT RVTDIMGEIA SASDEQSRGI DQVGLAVAEM
DRVTQQNAAL VEESAAAAAA LEEQASRLTE AVAVFRIQQQ QQHQRETSAV VKTVTPAAPR
KMAVADSGEN WETF