Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1299 |
Symbol | tap |
ID | 6144587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1287756 |
End bp | 1289357 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616177 |
Product | methyl-accepting protein IV |
Protein accession | YP_001743357 |
Protein GI | 170679775 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.00139959 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTAATC GTATTCGAAT TTCGACCACG CTGTTTTTAA TTTTGATTCT CTGCGGGATC TTGCAGATTG GCAGTAACGG CATGTCTTTT TGGGCATTTC GCGACGATGT GCAACGACTG AATCAGGTCG AGCAGAGCAA TCAGCAACGT GCGGCTTTAG CGCAAACTCG GGCGGTAATG TTACAGGCCA GTACCGCGCT GAACAAAGCG GGCACTCTGA CGGCGCTTAG CTATCCGGCG GATGACATTA AAACGTTGAT GACGACGGCG CGCGCCAGTC TGACGCAATC CACCACGCTG TTTAAAAGTT TTATGGCGAT GACTGCGGGC AACGAGCACG TCAGGGCGTT GCAAAAAGAG ACGGAGAAAA GTTTTGCCCG CTGGCACAAC GATCTCGAAC ATCAGGCGAC CTGGCTTGAA AGTAATCAAC TTTCGGATTT CCTCTCAGCG CCGGTGCAGG AATCACAAAA TGCGTTTGAC GTTAACTTTG AGGTCTGGCA GCAGGAGATC AACCATGTGC TGGAAGGTGC CAGTGCGCAA AGCCAGCGTA ACTATCACAT TTCGGCGCTG GTGTTTATCA GCATGATTAT TGTTGCAGCG CTCTACATCA GCAGTGCGCT GTGGTGGACG CGCAAGATGA TTGTTCAACC ACTGGCCATC ATCGGTAGCC ATTTTGACAG CATTGCTGCG GGTAATCTGG CGCGTCCGAT TGCGGTATAT GGTCGCAATG AGATCACCGC CATTTTTGCC AGTCTGAAGA CCATGCAGCA GGCTTTGCGT GGGACGGTAA GTGATGTGCG TAAGGGAAGC CAGGAGATGC ACATTGGTAT CGCGGAGATT GTCGCAGGCA ATAACGATCT CTCAAGTCGC ACCGAACAGC AGGCGGCATC GCTGGCACAA ACGGCCGCCA GTATGGAGCA ATTAACCGCC ACGGTAGGGC AAAACGCCGA TAACGCACGA CAGGCGTCGG AACTGGCAAA AAATGCCGCG ACAACGGCGC AGGCGGGCGG TGTTCAGGTC AGTACCATGA CTCACACCAT GCAGGAGATC GCCACCAGTT CGCAAAAAAT TGGCGACATT ATCAGCGTTA TCGACGGCAT TGCTTTCCAG ACCAATATTC TGGCCCTGAA TGCGGCAGTG GAAGCGGCTC GCGCCGGAGA GCAGGGGCGT GGTTTTGCGG TCGTGGCAGG TGAAGTGCGC AATCTTGCCA GCCGTAGCGC CCAGGCAGCA AAAGAGATCA AAGGGCTGAT CGAAGAGTCA GTCAATCGTG TCCAGCAGGG TTCGAAACTG GTGAATAACG CCGCCGGGAC CATGACCGAT ATTGTCAGTT CGGTGACCCG CGTGAACGAC ATTATGGGAG AAATTGCTTC GGCCTCGGAA GAACAACGGC GGGGGATTGA GCAGGTTGCA CAGGCTGTCA GCCAGATGGA TCAGGTGACA CAGCAGAACG CCTCCCTGGT AGAAGAAGCG GCGGTGGCAA CGGAACAACT GGCGAATCAG GCCGACCATC TTTCGTCGCG CGTGGCGGTA TTTACCCTTG AAGAGCATGA AGTAGCACGA CATGAGTCGG CGCAGTTACA AATTGCGCCA GTGGTATCCT GA
|
Protein sequence | MFNRIRISTT LFLILILCGI LQIGSNGMSF WAFRDDVQRL NQVEQSNQQR AALAQTRAVM LQASTALNKA GTLTALSYPA DDIKTLMTTA RASLTQSTTL FKSFMAMTAG NEHVRALQKE TEKSFARWHN DLEHQATWLE SNQLSDFLSA PVQESQNAFD VNFEVWQQEI NHVLEGASAQ SQRNYHISAL VFISMIIVAA LYISSALWWT RKMIVQPLAI IGSHFDSIAA GNLARPIAVY GRNEITAIFA SLKTMQQALR GTVSDVRKGS QEMHIGIAEI VAGNNDLSSR TEQQAASLAQ TAASMEQLTA TVGQNADNAR QASELAKNAA TTAQAGGVQV STMTHTMQEI ATSSQKIGDI ISVIDGIAFQ TNILALNAAV EAARAGEQGR GFAVVAGEVR NLASRSAQAA KEIKGLIEES VNRVQQGSKL VNNAAGTMTD IVSSVTRVND IMGEIASASE EQRRGIEQVA QAVSQMDQVT QQNASLVEEA AVATEQLANQ ADHLSSRVAV FTLEEHEVAR HESAQLQIAP VVS
|
| |