Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4385 |
Symbol | aer |
ID | 6971686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4060170 |
End bp | 4061690 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643388107 |
Product | aerotaxis receptor |
Protein accession | YP_002272544 |
Protein GI | 209395772 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein [COG2202] FOG: PAS/PAC domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.472426 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTCTC ATCCGTATGT CACCCAGCAA AATACCCCGC TGGCGGACGA TACCACTCTG ATGTCCACTA CCGATCTGCA AAGCTATATC ACTCATGCTA ATGACACTTT TGTGCAGGTG AGCGGCTTTA CCTTGCAAGA GTTACAAGGG CAGCCGCACA ACATGGTGCG TCACCCGGAT ATGCCAAAAG CGGCGTTTGC GGATATGTGG TTCACCCTGA AAAAAGGGGA GCCCTGGAGC GGCATCGTGA AAAATCGCCG CAAAAATGGT GACCATTATT GGGTGCGGGC CAATGCGGTA CCGATGGTGC GCGAGGGAAA AATCAGTGGC TATATGTCGA TTCGTACCCG GGCGACGGAT GAAGAGATCG CGGCGGTGGA GCCGCTGTAC AAAGCGCTGA ACGCCGGACG TACCGGTAAG CGTATTCATA AAGGCCTGGT GGTGCGTAAA GGCTGGCTGG GTAAACTGCC TTCATTACCG CTTCGCTGGC GGGCGCGTGG AGTGATGACC CTGATGTTTA TCTTGCTGGC GGCCATGCTT TGGTTTGTTG CTGCCCCGGT GGTGACGTAT ATCCTCTGTG CGTTAGTGGT ATTGTTGGCA AGCGCCTGTT TTGAATGGCA GATTGTCCGC CCGATAGAAA ATGTCGCCCG TCAGGCACTG AAGGTGGCGA CCGGAGAGCG TAATAGTGTT GAGCATCTGA ATCGCAGCGA TGAGCTGGGG CTGACATTAC GCGCGGTAGG GCAGCTTGGC CTGATGTGCC GTTGGTTAAT TAACGATGTC TCAAGCCAGG TGTCCAGTGT CAGAAACGGC AGTGAGACGC TGGCGAAAGG CACCGATGAA CTGAACGAAC ATACCCAGCA GACAGTTGAT AACGTTCAGC AAACGGTGGC GACCATGAAC CAAATGGCGG CGTCGGTGAA ACAGAACTCT GCCACGGCGT CGGCTGCCGA TAAACTGTCA ATCACCGCCA GTAATGCGGC AGTGCAGGGC GGGGAGGCGA TGACCACGGT GATCAAGACA ATGGACGATA TCGCCGACAG TACCCAGCGC ATTGGCACCA TTACTTCGCT GATTAACGAT ATTGCGTTTC AGACCAATAT TCTGGCCCTG AATGCGGCGG TGGAAGCGGC GCGTGCCGGC GAACAGGGCA AAGGTTTTGC GGTGGTGGCG GGGGAAGTGC GTCATTTAGC CAGCCGCAGC GCCAATGCTG CCAACGATAT TCGCAAGCTG ATTGATGCCA GTGCTGATAA GGTGCAATCC GGTTCGCAGC AGGTACACGC CGCCGGACGT ACGATGGAAG ATATTGTGGC ACAGGTGAAA AACGTCACCC AGTTGATTGC CCAGATTAGC CATTCAACGC TGGAACAGGC CGATGGTCTT TCCAGCCTGA CCCGTGCAGT GGATGAGCTT AACCTCATCA CCCAGAAAAA TGCCGAGCTG GTGGAAGAGA GTGCGCAGGT GTCGGCGATG GTGAAACACC GCGCCAGCCG ACTGGAAGAC GCGGTGACGG TGCTGCATTA A
|
Protein sequence | MSSHPYVTQQ NTPLADDTTL MSTTDLQSYI THANDTFVQV SGFTLQELQG QPHNMVRHPD MPKAAFADMW FTLKKGEPWS GIVKNRRKNG DHYWVRANAV PMVREGKISG YMSIRTRATD EEIAAVEPLY KALNAGRTGK RIHKGLVVRK GWLGKLPSLP LRWRARGVMT LMFILLAAML WFVAAPVVTY ILCALVVLLA SACFEWQIVR PIENVARQAL KVATGERNSV EHLNRSDELG LTLRAVGQLG LMCRWLINDV SSQVSSVRNG SETLAKGTDE LNEHTQQTVD NVQQTVATMN QMAASVKQNS ATASAADKLS ITASNAAVQG GEAMTTVIKT MDDIADSTQR IGTITSLIND IAFQTNILAL NAAVEAARAG EQGKGFAVVA GEVRHLASRS ANAANDIRKL IDASADKVQS GSQQVHAAGR TMEDIVAQVK NVTQLIAQIS HSTLEQADGL SSLTRAVDEL NLITQKNAEL VEESAQVSAM VKHRASRLED AVTVLH
|
| |