Gene ECH74115_5189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5189 
Symbol 
ID6971367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4835724 
End bp4837151 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content49% 
IMG OID643388856 
Productdrug resistance MFS transporter, drug:H+ antiporter-1 (DHA2) family 
Protein accessionYP_002273282 
Protein GI209397220 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000921639 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA AAAAGAAGCG CAGTATGGCG GGTTTGCCGT GGATCGCGGC GATGGCCTTC 
TTCATGCAGG CACTTGATGC CACTATTCTG AATACCGCCT TACCCGCAAT CGCTCATAGC
CTTAATCGTT CTCCTCTCGC GATGCAATCA GCCATCATCA GTTATACGCT GACGGTGGCG
ATGCTTATTC CGGTAAGCGG ATGGCTAGCC GATCGCTTCG GTACGCGTCG CATTTTTACC
CTTGCCGTGA GTCTGTTCAC ATTGGGTTCT CTGGCCTGCG CACTTTCTAA TTCGCTACCA
CAGCTGGTTG TCTTCCGGGT TATTCAGGGG ATAGGCGGCG CAATGATGAT GCCTGTTGCT
CGGCTGGCCT TACTGCGCGC TTATCCTCGT AATGAACTTC TTCCAGTATT GAATTTTGTC
GCCATGCCGG GTCTGGTGGG GCCAATTTTA GGCCCCATTC TTGGCGGCGT GCTGGTCACC
TGGGCAACCT GGCACTGGAT ATTTTTAATC AATATCCCCA TAGGTATTGC GGGCCTTCTT
TACGCGCGCA AACATATGCC CAATTTCACC ACCGCACGAC GCAGATTCGA TATCACTGGC
TTTTTGCTGT TTGGCCTCAG CCTTGTTCTC TTCTCAAGCG GAATAGAGCT ATTCGGGGAA
AAGATTGTCG CCAGCTGGAT TGCCTTGACG GTAATTGTCA CCAGCATCGG GTTACTGCTT
CTCTATATTC TCCATGCACG ACGCACGCCA AACCCATTAA TTTCATTAGA TTTATTTAAA
ACCCGCACTT TCTCGATCGG TATCGTAGGC AATATTGCAA CCCGTCTGGG GACCGGCTGT
GTACCGTTCC TTATGCCATT AATGTTACAG GTAGGATTTG GTTATCAGGC GTTTATTGCT
GGCTGTATGA TGGCACCGAC AGCGTTAGGT TCCATTATTG CAAAATCGAT GGTTACCCAA
GTCTTACGTC GTCTGGGCTA TCGCCATACA TTAGTGGGGA TCACGGTGAT TATTGGGCTA
ATGATCGCTC AGTTCTCTTT GCAATCACCG GCAATGGCAA TATGGATGCT GATCTTGCCG
TTGTTTATAT TAGGGATGGC TATGTCGACG CAGTTTACCG CGATGAATAC CATCACACTT
GCCGATCTGA CCGATGACAA CGCCAGCAGC GGTAACAGTG TTCTGGCGGT CACGCAGCAA
CTGTCTATCA GTTTAGGCGT AGCTGTAAGT GCGGCCGTCC TTCGCGTTTA TGAAGGAATG
GAAGGCACAA CGACTGTCGA ACAATTCCAC TATACGTTTA TCACAATGGG CATTATTACT
GTTGCTTCAG CAGCAATGTT CATGCTTCTG AAAACAACCG ATGGTAATAA TTTGATCAAA
AGACAGCGTA AATCTAAGCC GAACCACGTT CCATCAGAAT CGGAGTAA
 
Protein sequence
MSDKKKRSMA GLPWIAAMAF FMQALDATIL NTALPAIAHS LNRSPLAMQS AIISYTLTVA 
MLIPVSGWLA DRFGTRRIFT LAVSLFTLGS LACALSNSLP QLVVFRVIQG IGGAMMMPVA
RLALLRAYPR NELLPVLNFV AMPGLVGPIL GPILGGVLVT WATWHWIFLI NIPIGIAGLL
YARKHMPNFT TARRRFDITG FLLFGLSLVL FSSGIELFGE KIVASWIALT VIVTSIGLLL
LYILHARRTP NPLISLDLFK TRTFSIGIVG NIATRLGTGC VPFLMPLMLQ VGFGYQAFIA
GCMMAPTALG SIIAKSMVTQ VLRRLGYRHT LVGITVIIGL MIAQFSLQSP AMAIWMLILP
LFILGMAMST QFTAMNTITL ADLTDDNASS GNSVLAVTQQ LSISLGVAVS AAVLRVYEGM
EGTTTVEQFH YTFITMGIIT VASAAMFMLL KTTDGNNLIK RQRKSKPNHV PSESE