Gene EcSMS35_4121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4121 
Symbol 
ID6144620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4218984 
End bp4220411 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content49% 
IMG OID641618946 
ProductDHA2 family drug:H+ antiporter-1 
Protein accessionYP_001746084 
Protein GI170679741 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00185105 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.108904 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA AAAAGAAGCG CAGTATGGCG GGTTTGCCGT GGATCGCGGC GATGGCCTTC 
TTCATGCAGG CACTTGATGC CACTATTCTG AATACCGCCT TACCCGCAAT CGCTCATAGC
CTTAATCGTT CTCCTCTCGC GATGCAATCA GCCATCATCA GTTATACGCT GACGGTGGCG
ATGCTTATTC CGGTAAGCGG ATGGCTAGCC GATCGCTTCG GTACGCGTCG CATATTTACC
CTTGCCGTGA GTCTGTTCAC GTTGGGTTCT CTGGCCTGCG CACTTTCTAA CTCGCTACCA
CAGCTGGTTG TCTTCCGGGT TATTCAAGGG ATAGGCGGCG CAATGATGAT GCCTGTTGCT
CGGCTGGCCT TACTGCGCGC TTATCCTCGT AATGAGCTTC TTCCTGTATT GAATTTTGTC
GCCATGCCGG GTCTGGTGGG GCCAATTTTA GGCCCCGTTC TTGGCGGCGT GCTGGTCACC
TGGGCAACCT GGCACTGGAT ATTTTTAATC AATATCCCCA TAGGTATTGC GGGCCTTCTT
TACGCGCGCA AACATATGCC CAATTTCACC ACCGCACGAC GCAGATTCGA TATCACTGGC
TTTTTGCTGT TTGGCCTTAG CCTTGTTCTC TTCTCAAGCG GAATAGAGCT ATTCGGGGAA
AAGATTGTCG CCAGTTGGAT TGCCTTGACG GTAATTGTCA CCAGCATCGG GTTACTGCTT
CTCTATATTC TCCATGCACG ACGCACGCCA AACCCATTAA TTTCATTAGA TTTATTTAAA
ACCCGCACTT TCTCGATCGG TATCGTAGGC AATATTGCAA CCCGTCTGGG GACCGGTTGT
GTACCGTTCC TTATGCCATT GATGTTACAG GTAGGATTTG GTTATCAGGC GTTTATTGCC
GGCTGTATGA TGGCACCGAC AGCGTTAGGT TCCATTATTG CAAAATCGAT GGTTACCCAA
GTCTTACGTC GTCTGGGCTA TCGCCATACG TTAGTGGGGA TCACGGTGAT TATTGGGCTA
ATGATCGCTC AGTTCTCTTT GCAATCACCG GCAATGGCGA TATGGATGCT GATCTTGCCG
TTGTTTATAT TAGGGATGGC TATGTCGACG CAATTTACCG CGATGAATAC CATCACACTT
GCCGATCTGA CCGATGACAA CGCCAGCAGC GGTAACAGTG TTCTGGCGGT CACGCAGCAA
CTGTCTATCA GTTTAGGCGT TGCTGTAAGT GCGGCCGTCC TTCGCGTTTA TGAAGGGATG
GAAGGCACAA CGACTGTCGA ACAATTCCAC TATACGTTTA TCACGATGGG CATTATTACT
GTTGCTTCAG CAGCAATGTT CATGCTTCTG AAAACAACCG ATGGTAATAA TTTGATCAAA
AGACAGCGTA AATCTAAGCC GAACCGCGTT CCATCAGAAT CGGAGTAA
 
Protein sequence
MSDKKKRSMA GLPWIAAMAF FMQALDATIL NTALPAIAHS LNRSPLAMQS AIISYTLTVA 
MLIPVSGWLA DRFGTRRIFT LAVSLFTLGS LACALSNSLP QLVVFRVIQG IGGAMMMPVA
RLALLRAYPR NELLPVLNFV AMPGLVGPIL GPVLGGVLVT WATWHWIFLI NIPIGIAGLL
YARKHMPNFT TARRRFDITG FLLFGLSLVL FSSGIELFGE KIVASWIALT VIVTSIGLLL
LYILHARRTP NPLISLDLFK TRTFSIGIVG NIATRLGTGC VPFLMPLMLQ VGFGYQAFIA
GCMMAPTALG SIIAKSMVTQ VLRRLGYRHT LVGITVIIGL MIAQFSLQSP AMAIWMLILP
LFILGMAMST QFTAMNTITL ADLTDDNASS GNSVLAVTQQ LSISLGVAVS AAVLRVYEGM
EGTTTVEQFH YTFITMGIIT VASAAMFMLL KTTDGNNLIK RQRKSKPNRV PSESE