Gene ECH74115_2142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2142 
SymbolsotB 
ID6971914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2056040 
End bp2057230 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content51% 
IMG OID643386037 
Productsugar efflux transporter 
Protein accessionYP_002270526 
Protein GI209400312 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAA ACACTGTTTC CCGCAAAGTG GCGTGGCTAC GGGTCGTTAC GCTGGCAGTC 
GCCGCCTTCA TCTTCAACAC CACCGAATTT GTCCCTGTTG GCCTGCTCTC TGACATTGCG
CAAAGTTTTC ACATGCAAAC CGCTCAGGTC GGCATCATGT TGACCATTTA CGCATGGGTA
GTAGCGCTAA TGTCATTGCC TTTTATGTTA ATGACCAGCC AGGTTGAACG GCGCAAATTA
CTGATCTGCC TGTTTGTGGT GTTTATTGCC AGCCACGTAC TGTCGTTTTT ATCGTGGAGC
TTTACCGTTC TGGTGATCAG TCGCATTGGT GTGGCTTTTG CACATGCGAT TTTCTGGTCG
ATTACGGCGT CCCTGGCGAT CCGTATGGCT CCGGCCGGGA AGCGAGCACA GGCATTGAGT
TTAATTGCCA CAGGTACAGC ACTGGCAATG GTATTAGGAT TACCTCTCGG GCGCATTGTG
GGGCAGTATT TCGGTTGGCG AATGACCTTC TTCGCGATTG GTATCGGGGC GCTTATTACC
CTTTTGTGCC TGATTAAGTT ACTTCCCTTA CTGCCCAGTG AGCATTCCGG TTCATTGAAA
AGCCTCCCGC TATTATTCCG CCGCCCGGCA TTGATGAGCA TTTATTTGTT AACTGTGGTG
GTTGTCACCG CCCATTACAC GGCATACAGC TATATTGAGC CTTTTGTGCA AAACATTGCG
GGATTCAGCG CCAACTTTGC CACGGCATTA CTGTTATTAC TCGGTGGTGC GGGCATTATT
GGCAGCGTGA TTTTCGGTAA ACTGGGTAAT CAATATGCGT CTGCGTTGGT AAGTACGGCG
ATTGCGCTGT TGCTGGTGTG CCTGGCACTG CTGCTACCTG CGGCGAACAG TGAAATACAC
CTCGGGGTGC TGAGTATTTT CTGGGGGATC GCGATGATGA TCATCGGGCT TGGTATGCAG
GTTAAAGTGC TGGCGCTGGC ACCAGATGCC ACCGACGTCG CGATGGCGCT ATTCTCCGGG
ATATTTAATA TTGGAATCGG GGCGGGTGCG TTGGTAGGTA ATCAGGTGAG TCTGCACTTG
TCAATGTCGA TGATTGGTTA TGTGGGCACG GTGCCTGCTT TTGCCGCGTT AATATGGTCA
ATCATTATAT TTCGCCGCTG GCCAGTGACA CTCGAAGAAC AGACGCAATA G
 
Protein sequence
MTTNTVSRKV AWLRVVTLAV AAFIFNTTEF VPVGLLSDIA QSFHMQTAQV GIMLTIYAWV 
VALMSLPFML MTSQVERRKL LICLFVVFIA SHVLSFLSWS FTVLVISRIG VAFAHAIFWS
ITASLAIRMA PAGKRAQALS LIATGTALAM VLGLPLGRIV GQYFGWRMTF FAIGIGALIT
LLCLIKLLPL LPSEHSGSLK SLPLLFRRPA LMSIYLLTVV VVTAHYTAYS YIEPFVQNIA
GFSANFATAL LLLLGGAGII GSVIFGKLGN QYASALVSTA IALLLVCLAL LLPAANSEIH
LGVLSIFWGI AMMIIGLGMQ VKVLALAPDA TDVAMALFSG IFNIGIGAGA LVGNQVSLHL
SMSMIGYVGT VPAFAALIWS IIIFRRWPVT LEEQTQ