Gene ECH74115_4385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4385 
Symbolaer 
ID6971686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4060170 
End bp4061690 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content55% 
IMG OID643388107 
Productaerotaxis receptor 
Protein accessionYP_002272544 
Protein GI209395772 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein
[COG2202] FOG: PAS/PAC domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.472426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCTC ATCCGTATGT CACCCAGCAA AATACCCCGC TGGCGGACGA TACCACTCTG 
ATGTCCACTA CCGATCTGCA AAGCTATATC ACTCATGCTA ATGACACTTT TGTGCAGGTG
AGCGGCTTTA CCTTGCAAGA GTTACAAGGG CAGCCGCACA ACATGGTGCG TCACCCGGAT
ATGCCAAAAG CGGCGTTTGC GGATATGTGG TTCACCCTGA AAAAAGGGGA GCCCTGGAGC
GGCATCGTGA AAAATCGCCG CAAAAATGGT GACCATTATT GGGTGCGGGC CAATGCGGTA
CCGATGGTGC GCGAGGGAAA AATCAGTGGC TATATGTCGA TTCGTACCCG GGCGACGGAT
GAAGAGATCG CGGCGGTGGA GCCGCTGTAC AAAGCGCTGA ACGCCGGACG TACCGGTAAG
CGTATTCATA AAGGCCTGGT GGTGCGTAAA GGCTGGCTGG GTAAACTGCC TTCATTACCG
CTTCGCTGGC GGGCGCGTGG AGTGATGACC CTGATGTTTA TCTTGCTGGC GGCCATGCTT
TGGTTTGTTG CTGCCCCGGT GGTGACGTAT ATCCTCTGTG CGTTAGTGGT ATTGTTGGCA
AGCGCCTGTT TTGAATGGCA GATTGTCCGC CCGATAGAAA ATGTCGCCCG TCAGGCACTG
AAGGTGGCGA CCGGAGAGCG TAATAGTGTT GAGCATCTGA ATCGCAGCGA TGAGCTGGGG
CTGACATTAC GCGCGGTAGG GCAGCTTGGC CTGATGTGCC GTTGGTTAAT TAACGATGTC
TCAAGCCAGG TGTCCAGTGT CAGAAACGGC AGTGAGACGC TGGCGAAAGG CACCGATGAA
CTGAACGAAC ATACCCAGCA GACAGTTGAT AACGTTCAGC AAACGGTGGC GACCATGAAC
CAAATGGCGG CGTCGGTGAA ACAGAACTCT GCCACGGCGT CGGCTGCCGA TAAACTGTCA
ATCACCGCCA GTAATGCGGC AGTGCAGGGC GGGGAGGCGA TGACCACGGT GATCAAGACA
ATGGACGATA TCGCCGACAG TACCCAGCGC ATTGGCACCA TTACTTCGCT GATTAACGAT
ATTGCGTTTC AGACCAATAT TCTGGCCCTG AATGCGGCGG TGGAAGCGGC GCGTGCCGGC
GAACAGGGCA AAGGTTTTGC GGTGGTGGCG GGGGAAGTGC GTCATTTAGC CAGCCGCAGC
GCCAATGCTG CCAACGATAT TCGCAAGCTG ATTGATGCCA GTGCTGATAA GGTGCAATCC
GGTTCGCAGC AGGTACACGC CGCCGGACGT ACGATGGAAG ATATTGTGGC ACAGGTGAAA
AACGTCACCC AGTTGATTGC CCAGATTAGC CATTCAACGC TGGAACAGGC CGATGGTCTT
TCCAGCCTGA CCCGTGCAGT GGATGAGCTT AACCTCATCA CCCAGAAAAA TGCCGAGCTG
GTGGAAGAGA GTGCGCAGGT GTCGGCGATG GTGAAACACC GCGCCAGCCG ACTGGAAGAC
GCGGTGACGG TGCTGCATTA A
 
Protein sequence
MSSHPYVTQQ NTPLADDTTL MSTTDLQSYI THANDTFVQV SGFTLQELQG QPHNMVRHPD 
MPKAAFADMW FTLKKGEPWS GIVKNRRKNG DHYWVRANAV PMVREGKISG YMSIRTRATD
EEIAAVEPLY KALNAGRTGK RIHKGLVVRK GWLGKLPSLP LRWRARGVMT LMFILLAAML
WFVAAPVVTY ILCALVVLLA SACFEWQIVR PIENVARQAL KVATGERNSV EHLNRSDELG
LTLRAVGQLG LMCRWLINDV SSQVSSVRNG SETLAKGTDE LNEHTQQTVD NVQQTVATMN
QMAASVKQNS ATASAADKLS ITASNAAVQG GEAMTTVIKT MDDIADSTQR IGTITSLIND
IAFQTNILAL NAAVEAARAG EQGKGFAVVA GEVRHLASRS ANAANDIRKL IDASADKVQS
GSQQVHAAGR TMEDIVAQVK NVTQLIAQIS HSTLEQADGL SSLTRAVDEL NLITQKNAEL
VEESAQVSAM VKHRASRLED AVTVLH