Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1610 |
Symbol | sotB |
ID | 5592684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1633850 |
End bp | 1635040 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640920762 |
Product | sugar efflux transporter |
Protein accession | YP_001458318 |
Protein GI | 157161000 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACAA ACACTGTTTC CCGCAAAGTG GCGTGGCTAC GGGTCGTTAC GCTGGCAGTC GCCGCCTTCA TCTTCAACAC CACCGAATTT GTCCCTGTTG GCCTGCTCTC TGACATTGCG CAAAGTTTTC ACATGCAAAC CGCTCAGGTC GGCATCATGT TGACCATTTA CGCATGGGTA GTAGCGCTAA TGTCATTGCC TTTTATGTTA ATGACCAGTC AGGTTGAACG GCGCAAATTA CTGATCTGCC TGTTTGTGGT GTTTATTGCC AGCCACGTAC TGTCGTTTTT GTCGTGGAGC TTTACCGTTC TGGTGATCAG TCGCATTGGT GTGGCTTTTG CACATGCGAT TTTCTGGTCG ATTACGGCGT CTCTGGCGAT CCGTATGGCT CCGGCCGGGA AGCGAGCACA GGCATTGAGT TTAATTGCCA CCGGTACAGC ACTGGCGATG GTCTTAGGTT TACCTCTCGG GCGCATTGTG GGCCAGTATT TCGGTTGGCG AATGACCTTC TTCGCGATTG GTATTGGGGC GCTTATCACC CTTTTGTGCC TGATTAAGTT ACTTCCCTTA CTGCCCAGTG AGCATTCCGG TTCACTGAAA AGCCTCCCGC TATTGTTCCG CCGCCCGGCA TTGATGAGCA TTTATTTGTT AACTGTGGTG GTTGTCACCG CCCATTACAC GGCATACAGC TATATCGAGC CTTTTGTGCA AAACATTGCG GGATTCAGCG CCAACTTTGC CACGGCATTA CTGTTATTAC TCGGTGGTGC GGGCATTATT GGCAGCGTGA TTTTCGGTAA ACTGGGTAAT CAGTATGCGT CTGCGTTGGT GAGTACGGCA ATTGCGCTGT TGCTGGTGTG TCTGGCACTG CTGCTACCTG CGGCGAACAG TGAAATACAC CTCGGGGTGC TGAGTATTTT CTGGGGGATC GCGATGATGA TCATCGGGCT TGGTATGCAG GTTAAAGTGC TGGCGCTGGC ACCAGATGCT ACCGACGTCG CGATGGCGCT ATTCTCCGGC ATATTTAATA TTGGAATCGG GGCGGGTGCG TTGGTAGGTA ATCAGGTGAG TTTGCACTGG TCAATGTCGA TGATTGGTTA TGTGGGCGCG GTGCCTGCTT TTGCCGCGTT AATATGGTCA ATCATTATAT TTCGCCGCTG GCCAGTGACA CTCGAAGAAC AGACGCAATA G
|
Protein sequence | MTTNTVSRKV AWLRVVTLAV AAFIFNTTEF VPVGLLSDIA QSFHMQTAQV GIMLTIYAWV VALMSLPFML MTSQVERRKL LICLFVVFIA SHVLSFLSWS FTVLVISRIG VAFAHAIFWS ITASLAIRMA PAGKRAQALS LIATGTALAM VLGLPLGRIV GQYFGWRMTF FAIGIGALIT LLCLIKLLPL LPSEHSGSLK SLPLLFRRPA LMSIYLLTVV VVTAHYTAYS YIEPFVQNIA GFSANFATAL LLLLGGAGII GSVIFGKLGN QYASALVSTA IALLLVCLAL LLPAANSEIH LGVLSIFWGI AMMIIGLGMQ VKVLALAPDA TDVAMALFSG IFNIGIGAGA LVGNQVSLHW SMSMIGYVGA VPAFAALIWS IIIFRRWPVT LEEQTQ
|
| |