Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1642 |
Symbol | sotB |
ID | 6144457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1630915 |
End bp | 1632105 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616518 |
Product | sugar efflux transporter |
Protein accession | YP_001743696 |
Protein GI | 170680058 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACAA ACACTGTTTC CCGCAAAGTG GCGTGGCTAC GGGTCGTTAC GCTGGCAGTC GCCGCCTTCA TCTTCAACAC CACCGAATTT GTCCCAGTTG GCCTGCTCTC TGACATTGCG CAAAGTTTTC ACATGCAAAC CGCTCAGGTC GGCATTATGT TGACCATTTA CGCATGGGTA GTAGCGCTAA TGTCATTGCC TTTTATGTTA ATGACCAGCC AGGTTGAACG GCGCAAATTA CTGATCTGCC TGTTTGTGGT GTTTATTGCC AGCCACGTAC TGTCGTTTTT ATCGTGGAGT TTTACCGTTC TGGTGATCAG TCGCATTGGT GTGGCTTTTG CACATGCGAT TTTCTGGTCG ATTACGGCGT CTCTGGCGAT CCGTATGGCT CCGGCCGGGA AGCGAGCACA GGCATTGAGT TTAATTGCCA CCGGTACTGC ACTGGCGATG GTCTTAGGTT TACCTCTCGG GCGCATTGTG GGGCAGTATT TCGGTTGGCG AATGACCTTC TTCGCGATTG GTATCGGGGC GCTTATCACC CTTTTGTGCC TGATTAAGTT ACTTCCCTTA CTGCCCAGTG AGCATTCCGG TTCGCTGAAA AGCCTCCCGC TATTATTCCG CCGCCCGGCA TTGATGAGCA TTTATTTGTT AACTGTGGTA GTTGTCACCG CCCATTACAC GGCATACAGC TATATCGAGC CTTTTGTGCA AAACATTGCG GGATTCAGCG CCAACTTTGC CACGGCATTA CTGTTATTAC TCGGTGGTGC GGGCATTATT GGCAGCGTGA TTTTCGGTAA ACTGGGTAAT CAGTATGCGT CTACGCTGGT AAGTACGGCG ATTGCGCTGT TGCTGGTGTG CCTGGCACTG CTGCTACCTG CGGCGAACAG TGAAATACAC CTCGGGGTGC TGAGTATTTT CTGGGGGATC GCGATGATGA TCATCGGGCT TGGTATGCAG GTTAAAGTGC TGGCGCTGGC ACCAGATGCT ACCGACGTCG CGATGGCGCT ATTCTCCGGG ATATTTAATA TTGGAATCGG CGCGGGTGCG TTGGTAGGTA ATCAGGTCAG TCTGCACTGG TCAATGTCGA TGATTGGTTA TGTGGGCGCG GTGCCTGCTT TTGCCGCGTT AATATGGTCA ATCATCATTT TCCGCCGCTG GCCAGTGACA CTCGAAGAAC AGACGCAATA G
|
Protein sequence | MTTNTVSRKV AWLRVVTLAV AAFIFNTTEF VPVGLLSDIA QSFHMQTAQV GIMLTIYAWV VALMSLPFML MTSQVERRKL LICLFVVFIA SHVLSFLSWS FTVLVISRIG VAFAHAIFWS ITASLAIRMA PAGKRAQALS LIATGTALAM VLGLPLGRIV GQYFGWRMTF FAIGIGALIT LLCLIKLLPL LPSEHSGSLK SLPLLFRRPA LMSIYLLTVV VVTAHYTAYS YIEPFVQNIA GFSANFATAL LLLLGGAGII GSVIFGKLGN QYASTLVSTA IALLLVCLAL LLPAANSEIH LGVLSIFWGI AMMIIGLGMQ VKVLALAPDA TDVAMALFSG IFNIGIGAGA LVGNQVSLHW SMSMIGYVGA VPAFAALIWS IIIFRRWPVT LEEQTQ
|
| |