Gene EcSMS35_1753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1753 
Symboltrg 
ID6142692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1756817 
End bp1758457 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content51% 
IMG OID641616629 
Productmethyl-accepting chemotaxis protein III 
Protein accessionYP_001743807 
Protein GI170682410 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.79367e-17 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATACAA CTCCCTCACA GCGATTAGGT TTTTTGCATC ACATCAGGCT GGTTCCGTTA 
TTTGCCTGCA TTCTTGGCGG TATCTTAGTT CTATTCGCAT TAAGTTCTGC TCTGGCTGGC
TATTTCCTCT GGCAGGCCGA TCGCGATCAG CGTGATGTTA CTGCGGAGAT TGAGATTCGG
ACCGGGTTAG CGAACAGTTC AGATTTTTTG CGTTCAGCCC GGATCAATAT GATTCAGGCC
GGGGCAGCGA GTCGTATTGC GGAAATGGAA GCAATGAAGC GAAATATTGC GCAAGCCGAA
TCGGAGATTA AACAGTCGCA GCAAGGTTAT CGTGCTTATC AGAATCGACC GGTGAAAACA
CCTGCTGATG AAGCCCTCGA CACTGAATTA AATCAACGCT TCCAGGCTTA TATCACGGGT
ATGCAACCGA TGTTGAAATA TGCCAAAAAT GGCATGTTTG AAGCGATTAT CAATCATGAA
AGTGAGCAGA TCCGACCGCT GGATAATGCT TATACCGATA TTTTGAACAA AGCCGTTAAG
ATACGTAGCA CCAGAGCCAA CCAACTGGCG GAACTGGCCC ATCAGCGCAC CCGCCTGGGT
GGGATGTTCA TGATTGGCGC GTTTGTGCTT GCCCTGGTGA TGACGCTGAT AACATTTATG
GTGCTACGTC GGATCGTCAT TCGTCCACTG CAAAATGCCG CACAACGGAT TGAAAAAATC
GCCAGTGGCG ATCTGACGAT GAATGATGAA CCGGCGGGTC GTAATGAAAT CGGTCGCTTA
AGTCGTCATT TACAGCAAAT GCAGCATTCA CTGGGGATGA CTGTAGGGAC CGTTCGACAG
GGCGCGGAAG AGATTTATCG TGGCACCAGC GAAATTTCAG CTGGCAATGC GGACCTGTCA
TCTCGCACCG AAGAACAAGC GGCGGCTATC GAACAAACTG CCGCCAGCAT GGAGCAACTC
ACTGCGACGG TGAAACAGAA TGCGGATAAC GCGCATCATG CCAGCAAACT GGCGCAGGAG
GCTTCTATTA AAGCCAGCGA TGGCGGGCAG ATGGTTTCCG GTGTAGTAAA AACGATGGGC
GCTATCTCCA CGAGTTCGAA GAAAATTTCT GAGATCACCG CCGTCATCAA CAGTATTGCT
TTCCAGACGA ATATTCTGGC ACTGAATGCT GCCGTTGAAG CCGCGCGAGC GGGTGAGCAA
GGACGTGGAT TTGCCGTTGT CGCCAGCGAA GTACGGACAC TCGCAAGCCG CAGCGCTCAG
GCGGCGAAAG AGATTGAAGG CTTGATCAGT GAATCAGTCA GGTTAATTGA CCTGGGGTCG
GATGAGGTGG CAACGGCAGG GAAAACCATG AGCACTATTG TTGATGCCGT CGCGAGTGTC
ACACATATCA TGCAGGAAAT CGCCGCCGCC TCGGATGAAC AAAGTAGAGG CATAACGCAG
GTTAGCCAGG CGATTTCTGA AATGGATAAG GTGACGCAAC AGAATGCTTC TCTGGTAGAA
GAGGCCTCAG CGGCGGCGGT GTCCCTTGAA GAACAGGCGG CACGATTAAC TGAGGCGGTG
GACGTATTCC GTCTGAACAA ACATTCTGTG TCGGCAGAAC CTCGCGGAGC GGGTGAACCA
GTTAGTTTCG CTACGGTGTG A
 
Protein sequence
MNTTPSQRLG FLHHIRLVPL FACILGGILV LFALSSALAG YFLWQADRDQ RDVTAEIEIR 
TGLANSSDFL RSARINMIQA GAASRIAEME AMKRNIAQAE SEIKQSQQGY RAYQNRPVKT
PADEALDTEL NQRFQAYITG MQPMLKYAKN GMFEAIINHE SEQIRPLDNA YTDILNKAVK
IRSTRANQLA ELAHQRTRLG GMFMIGAFVL ALVMTLITFM VLRRIVIRPL QNAAQRIEKI
ASGDLTMNDE PAGRNEIGRL SRHLQQMQHS LGMTVGTVRQ GAEEIYRGTS EISAGNADLS
SRTEEQAAAI EQTAASMEQL TATVKQNADN AHHASKLAQE ASIKASDGGQ MVSGVVKTMG
AISTSSKKIS EITAVINSIA FQTNILALNA AVEAARAGEQ GRGFAVVASE VRTLASRSAQ
AAKEIEGLIS ESVRLIDLGS DEVATAGKTM STIVDAVASV THIMQEIAAA SDEQSRGITQ
VSQAISEMDK VTQQNASLVE EASAAAVSLE EQAARLTEAV DVFRLNKHSV SAEPRGAGEP
VSFATV