Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2142 |
Symbol | sotB |
ID | 6971914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2056040 |
End bp | 2057230 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643386037 |
Product | sugar efflux transporter |
Protein accession | YP_002270526 |
Protein GI | 209400312 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACAA ACACTGTTTC CCGCAAAGTG GCGTGGCTAC GGGTCGTTAC GCTGGCAGTC GCCGCCTTCA TCTTCAACAC CACCGAATTT GTCCCTGTTG GCCTGCTCTC TGACATTGCG CAAAGTTTTC ACATGCAAAC CGCTCAGGTC GGCATCATGT TGACCATTTA CGCATGGGTA GTAGCGCTAA TGTCATTGCC TTTTATGTTA ATGACCAGCC AGGTTGAACG GCGCAAATTA CTGATCTGCC TGTTTGTGGT GTTTATTGCC AGCCACGTAC TGTCGTTTTT ATCGTGGAGC TTTACCGTTC TGGTGATCAG TCGCATTGGT GTGGCTTTTG CACATGCGAT TTTCTGGTCG ATTACGGCGT CCCTGGCGAT CCGTATGGCT CCGGCCGGGA AGCGAGCACA GGCATTGAGT TTAATTGCCA CAGGTACAGC ACTGGCAATG GTATTAGGAT TACCTCTCGG GCGCATTGTG GGGCAGTATT TCGGTTGGCG AATGACCTTC TTCGCGATTG GTATCGGGGC GCTTATTACC CTTTTGTGCC TGATTAAGTT ACTTCCCTTA CTGCCCAGTG AGCATTCCGG TTCATTGAAA AGCCTCCCGC TATTATTCCG CCGCCCGGCA TTGATGAGCA TTTATTTGTT AACTGTGGTG GTTGTCACCG CCCATTACAC GGCATACAGC TATATTGAGC CTTTTGTGCA AAACATTGCG GGATTCAGCG CCAACTTTGC CACGGCATTA CTGTTATTAC TCGGTGGTGC GGGCATTATT GGCAGCGTGA TTTTCGGTAA ACTGGGTAAT CAATATGCGT CTGCGTTGGT AAGTACGGCG ATTGCGCTGT TGCTGGTGTG CCTGGCACTG CTGCTACCTG CGGCGAACAG TGAAATACAC CTCGGGGTGC TGAGTATTTT CTGGGGGATC GCGATGATGA TCATCGGGCT TGGTATGCAG GTTAAAGTGC TGGCGCTGGC ACCAGATGCC ACCGACGTCG CGATGGCGCT ATTCTCCGGG ATATTTAATA TTGGAATCGG GGCGGGTGCG TTGGTAGGTA ATCAGGTGAG TCTGCACTTG TCAATGTCGA TGATTGGTTA TGTGGGCACG GTGCCTGCTT TTGCCGCGTT AATATGGTCA ATCATTATAT TTCGCCGCTG GCCAGTGACA CTCGAAGAAC AGACGCAATA G
|
Protein sequence | MTTNTVSRKV AWLRVVTLAV AAFIFNTTEF VPVGLLSDIA QSFHMQTAQV GIMLTIYAWV VALMSLPFML MTSQVERRKL LICLFVVFIA SHVLSFLSWS FTVLVISRIG VAFAHAIFWS ITASLAIRMA PAGKRAQALS LIATGTALAM VLGLPLGRIV GQYFGWRMTF FAIGIGALIT LLCLIKLLPL LPSEHSGSLK SLPLLFRRPA LMSIYLLTVV VVTAHYTAYS YIEPFVQNIA GFSANFATAL LLLLGGAGII GSVIFGKLGN QYASALVSTA IALLLVCLAL LLPAANSEIH LGVLSIFWGI AMMIIGLGMQ VKVLALAPDA TDVAMALFSG IFNIGIGAGA LVGNQVSLHL SMSMIGYVGT VPAFAALIWS IIIFRRWPVT LEEQTQ
|
| |