Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2623 |
Symbol | tap |
ID | 6971441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2477607 |
End bp | 2479208 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643386487 |
Product | methyl-accepting protein IV |
Protein accession | YP_002270969 |
Protein GI | 209396965 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0000000448067 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTAATC GTATTCGAAT TTCGACCACG CTGTTTTTAA TTTTGATTCT CTGCGGGATC TTGCAGATTG GCAGTAACGG CATGTCTTTT TGGGCATTTC GCGACGATTT GCAACGACTG AATCAGGTCG AGCAGAGCAA TCAGCAACGT GCGGCATTAG CGCAAACTCG GGCGGTAATG TTACAGGCCA GTACCACGCT GAACAAAGCA GGCACTCTGA CAGCGCTTAG CTATCCGGCG GATGACATTA AAACGTTGAT GACGACGGCG CGTGCCAGTC TGACGCAATC CACCACGCTG TTTAAAAGTT TTATGGCGAT GACTGCGGGC AACGAGCACG TCAGGGCATT GCAAAAAGAG ACGGAGAAAA GTTTTGCCCG CTGGCACAAC GATCTCGAAC ATCAGGCGAC CTGGCTTGAA AGTAATCAAC TTTCGGATTT CCTCACTGCG CCGGTGCAGG GATCACAGAA TGCGTTTGAC GTTAACTTTG AGGCCTGGCA GCTGGAGATC ATCCATGTGC TGGAAGCCGC CAGTGCGCAA AGCCAGCGTA ACTATCAGAT TTCGGCGCTG GTGTTTATCA GCATGATTAT TGTTGCGGCG ATCTACATCA GCAGTGCGCT GTGGTGGACG CGCAAGATGA TTGTTCAACC ACTGGCCATT ATCGGTAGCC ATTTTGACAG CATTGCTGCG GGTAATCTGG CGCGTCCGAT TGCGGTATAT GGTCGTAATG AGATCACCGC CATTTTTGCC AGCCTGAAGA CCATGCAGCA GGCTTTGCGT GGGACGGTAA GTGATGTGCG TAAGGGAAGC CAGGAGATGC ACATTGGTAT CGCGGAGATT GTCGCAGGCA ATAACGATCT CTCAAGTCGT ACCGAACAGC AGGCGGCATC GCTGGCACAA ACGGCCGCCA GTATGGAGCA ATTAACCGCC ACGGTAGGGC AAAACGCCGA TAACGCACGA CAGGCGTCGG AACTTGCAAA AAATGCCGCG ACAACGGCGC AGGCGGGCGG TGTTCAGGTC AGTACCATGA CTCACACCAT GCAGGAGATC GCCACCAGCT CGCAAAAAAT TGGCGACATT ATCAGCGTTA TCGACGGAAT TGCTTTCCAG ACCAATATTC TGGCCCTGAA TGCGGCAGTA GAAGCGGCTC GCGCCGGAGA GCAGGGGCGT GGTTTTGCGG TAGTGGCAGG AGAAGTGCGC AATCTTGCCA GCCGAAGCGC CCAGGCGGCA AAAGAGATCA AAGGGCTGAT CGAAGAGTCA GTCAATCGTG TCCAGCAGGG TTCGAAACTG GTGAATAACG CCGCCGCGAC CATGACCGAT ATTGTCAGTT CGGTGACCCG CGTGAACGAC ATTATGGGAG AAATTGCTTC GGCCTCGGAA GAACAACGGC GGGGGATTGA GCAGGTTGCG CAGGCTGTCA GCCAGATGGA TCAGGTGACA CAGCAGAACG CCTCGCTGGT AGAAGAAGCG GCGGTGGCAA CGGAACAACT GGCGAATCAG GCCGACCATC TTTCGTCGCG TGTGGCGGTA TTTACCCTTG AAGAACATGA AGTAGCACGA CATGAGTCGG CGCAGTTACA AATTGCGCCA GTGGTATCCT GA
|
Protein sequence | MFNRIRISTT LFLILILCGI LQIGSNGMSF WAFRDDLQRL NQVEQSNQQR AALAQTRAVM LQASTTLNKA GTLTALSYPA DDIKTLMTTA RASLTQSTTL FKSFMAMTAG NEHVRALQKE TEKSFARWHN DLEHQATWLE SNQLSDFLTA PVQGSQNAFD VNFEAWQLEI IHVLEAASAQ SQRNYQISAL VFISMIIVAA IYISSALWWT RKMIVQPLAI IGSHFDSIAA GNLARPIAVY GRNEITAIFA SLKTMQQALR GTVSDVRKGS QEMHIGIAEI VAGNNDLSSR TEQQAASLAQ TAASMEQLTA TVGQNADNAR QASELAKNAA TTAQAGGVQV STMTHTMQEI ATSSQKIGDI ISVIDGIAFQ TNILALNAAV EAARAGEQGR GFAVVAGEVR NLASRSAQAA KEIKGLIEES VNRVQQGSKL VNNAAATMTD IVSSVTRVND IMGEIASASE EQRRGIEQVA QAVSQMDQVT QQNASLVEEA AVATEQLANQ ADHLSSRVAV FTLEEHEVAR HESAQLQIAP VVS
|
| |