Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2178 |
Symbol | tap |
ID | 6270956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1980993 |
End bp | 1982594 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641726203 |
Product | methyl-accepting protein IV |
Protein accession | YP_001880692 |
Protein GI | 187731667 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAATC GTATTCGAAT TTCGACCACG CTGTTTTTAA TTTTGATTCT CTGCGGGATC TTGCAGATTG GCAGTAACGG CATGTCTTTT TGGGCATTTC GCGACGATTT GCAACGACTG AATCAGGTCG AGCAGAGCAA TCAGCAACGT GCGGCATTAG CGCAAACTCG GGCGATAATG TTACAGGCCA GTACCGCGCT GAACAAAGCG GGCACTCTGA CGGCACTTAG CTATCCGGCG GATGACATTA AAACGTTGAT GACGACGGCG CGCGCCAGTC TGACGCAATC CACCACGCTG TTTAAAAGTT TTATGGCGAT GACTGCGGGC AACGAGCACG TCAGGGCATT GCAAAAAGAG ACGGAGAAAA GTTTTGCCCG CTGGCACAAC GATCTCGAAC ATCAGGCGAC CTGGCTTGAA AGTAATCAAC TTTCGGATTT CCTCACTGCG CCGGTGCAGG GATCACAGAA TGCGTTTGAC GTTAACTTTG AGGCCTGGCA GCTGGAGATC AACCATGTGC TGGAAGCCGC CAGTGCGCAA AGTCAGCGTA ACTATCAGAT TTCGGCGCTG GTGTTTATCA GCATGATTAT TGTTGCGGCA ATCTACATCA GCAGTGCGCT GTGGTGGACG CGCAAGATGA TTGTTCAACC ACTGGCCATT ATTGGTAGCC ATTTTGACAG CATTGCTGCG GGTAATCTGG CGCGTCCGAT TGCGGTATAT GGTCGTAATG AGATCACCGC CATTTTTGCC AGCCTGAAGA CCATGCAGCA GGCTTTGCGT GGGACGGTGA GTGATGTGCG TAAGGGAAGC CATGAGATGC ACATTGGGAT CGCGGAGATT GTCGCAGGCA ATAACGATCT CTCAAGTCGT ACCGAACAGC AGGCGGCATC GCTGGCACAA ACGGCCGCCA GTATGGAGCA ATTAACCGCC ACGGTAGGGC AAAACGCCGA TAACGCACGA CAGGCGTCGG AACTGGCAAA AAATGCCGCG ACAACGGCGC AGGCGGGCGG TGTTCAGGTC AGTACCATGA CTCACACCAT GCAGGAGATC GCCACCAGCT CGCAAAAAAT TGGCGACATT ATCAGCGTTA TCGACGGAAT TGCTTTCCAG ACCAATATTC TGGCCCTGAA TGCGGCAGTG GAAGCGGCTC GCGCCGGAGA GCAGGGGCGT GGTTTTGCGG TAGTGGCAGG TGAAGTGCGC AATCTTGCCA GCCGTAGCGC GCAGGCAGCA AAAGAGATCA AAGGGCTGAT CGAAGAGTCA GTCAATCGTG TCCAGCAGGG TTCGAAACTG GTGAATAACG CCGCCGCGAC CATGACCGAT ATTGTCAGTT CGGTGACCCG CGTGAACGAC ATTATGGGAG AAATTGCTTC GGCGTCGGAA GAACAACGGC GGGGGATTGA GCAGGTTGCA CAGGCTGTCA GCCAGATGGA TCAGGTGACT CAGCAGAACG CCTCGCTGGT AGAAGAAGCG GCGGTGGCAA CGGAACAACT GGCGAATCAG GCCGACCATC TTTCGTCGCG TGTGGCGGTA TTTACCCTTG AAGAACATGA AGTAGCACGA CATGAGTCGG CGCAGTTACA AATTGCGCCA GTGGTATCCT GA
|
Protein sequence | MFNRIRISTT LFLILILCGI LQIGSNGMSF WAFRDDLQRL NQVEQSNQQR AALAQTRAIM LQASTALNKA GTLTALSYPA DDIKTLMTTA RASLTQSTTL FKSFMAMTAG NEHVRALQKE TEKSFARWHN DLEHQATWLE SNQLSDFLTA PVQGSQNAFD VNFEAWQLEI NHVLEAASAQ SQRNYQISAL VFISMIIVAA IYISSALWWT RKMIVQPLAI IGSHFDSIAA GNLARPIAVY GRNEITAIFA SLKTMQQALR GTVSDVRKGS HEMHIGIAEI VAGNNDLSSR TEQQAASLAQ TAASMEQLTA TVGQNADNAR QASELAKNAA TTAQAGGVQV STMTHTMQEI ATSSQKIGDI ISVIDGIAFQ TNILALNAAV EAARAGEQGR GFAVVAGEVR NLASRSAQAA KEIKGLIEES VNRVQQGSKL VNNAAATMTD IVSSVTRVND IMGEIASASE EQRRGIEQVA QAVSQMDQVT QQNASLVEEA AVATEQLANQ ADHLSSRVAV FTLEEHEVAR HESAQLQIAP VVS
|
| |