Gene EcE24377A_2653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2653 
SymbolemrY 
ID5590262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2644147 
End bp2645685 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content42% 
IMG OID640926309 
ProductDHA2 family drug:H+ antiporter-1 
Protein accessionYP_001463702 
Protein GI157157326 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily
[TIGR01168] Gram-positive signal peptide, YSIRK family 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATCA CTAAATCGAC TCCGGCACCA TTAACCGGTG GGACGTTATG GTGCGTTACT 
ATTGCATTGT CATTAGCGAC GTTTATGCAA ATGTTGGATT CCACTATTTC TAACGTCGCA
ATACCGACAA TATCTGGCTT TCTGGGAGCA TCAACAGACG AAGGCACCTG GGTTATCACC
TCGTTTGGCG TAGCAAATGC CATTGCGATC CCTGTTACCG GCAGGCTGGC ACAAAGAATA
GGCGAATTAA GGTTATTTTT ACTTTCAGTC ACTTTTTTTT CGCTGTCTTC CTTAATGTGT
AGTTTATCGA CCAATCTTGA TGTGCTGATA TTTTTTAGAG TCGTTCAGGG ATTAATGGCG
GGGCCGTTAA TTCCACTGTC ACAGAGTTTA TTATTAAGGA ATTATCCACC AGAAAAAAGA
ACATTTGCTC TGGCATTATG GTCAATGACC GTGATTATCG CTCCGATATG TGGGCCGATA
TTGGGCGGTT ATATTTGCGA TAACTTTAGC TGGGGTTGGA TATTTTTAAT CAATGTCCCT
ATGGGGATTA TCGTCCTGAC ATTATGCTTA ACCTTACTTA AAGGAAGAGA AACTGAAACT
TCACCGGTCA AAATGAATCT ACCAGGACTG ACCCTGTTAG TGCTCGGTGT TGGTGGCTTG
CAAATTATGC TTGATAAAGG GCGCGATCTG GATTGGTTCA ACTCGAGTAC AATAATAATA
TTAACAGTAG TATCAGTTAT TTCTCTGATC TCTTTAGTCA TTTGGGAGTC GACCTCAGAG
AACCCGATTC TTGATCTCAG TTTGTTTAAG TCCCGTAACT TCACCATTGG TATTGTGAGT
ATCACATGCG CGTATTTATT TTACTCTGGA GCGATCGTCC TTATGCCGCA GTTACTCCAG
GAAACGATGG GGTATAATGC GATATGGGCC GGACTTGCTT ATGCGCCCAT CGGCATCATG
CCACTATTAA TTTCACCTTT GATAGGACGT TATGGCAACA AAATAGACAT GCGGTTGTTA
GTGACATTTA GTTTTTTGAT GTATGCGGTT TGCTATTACT GGCGTTCTGT GACATTTATG
CCAACGATTG ATTTTACAGG CATCATTTTG CCACAGTTTT TCCAGGGATT CGCCGTTGCC
TGTTTCTTTT TACCCTTAAC AACGATTTCG TTTTCAGGCT TGCCAGATAA TAAATTTGCC
AATGCCTCGA GTATGAGTAA TTTTTTTCGT ACCTTGTCAG GATCAGTTGG TACGTCGTTG
ACAATGACGC TGTGGGGACG ACGCGAATCG TTACACCATA GTCAGTTGAC AGCAACCATC
GATCAATTTA ACCCCGTGTT TAATTCATCG TCACAAATTA TGGATAAATA CTATGGTTCG
CTTTCAGGAG TTCTTAATGA AATTAATAAT GAAATAACCC AGCAGTCACT TTCTATTTCT
GCAAATGAGA TTTTCCGTAT GGCGGCTATT GCTTTTATCT TACTTACGGT TTTGGTTTGG
TTTGCGAAAC CGCCGTTTAC AGCGAAAGGC GTTGGGTGA
 
Protein sequence
MAITKSTPAP LTGGTLWCVT IALSLATFMQ MLDSTISNVA IPTISGFLGA STDEGTWVIT 
SFGVANAIAI PVTGRLAQRI GELRLFLLSV TFFSLSSLMC SLSTNLDVLI FFRVVQGLMA
GPLIPLSQSL LLRNYPPEKR TFALALWSMT VIIAPICGPI LGGYICDNFS WGWIFLINVP
MGIIVLTLCL TLLKGRETET SPVKMNLPGL TLLVLGVGGL QIMLDKGRDL DWFNSSTIII
LTVVSVISLI SLVIWESTSE NPILDLSLFK SRNFTIGIVS ITCAYLFYSG AIVLMPQLLQ
ETMGYNAIWA GLAYAPIGIM PLLISPLIGR YGNKIDMRLL VTFSFLMYAV CYYWRSVTFM
PTIDFTGIIL PQFFQGFAVA CFFLPLTTIS FSGLPDNKFA NASSMSNFFR TLSGSVGTSL
TMTLWGRRES LHHSQLTATI DQFNPVFNSS SQIMDKYYGS LSGVLNEINN EITQQSLSIS
ANEIFRMAAI AFILLTVLVW FAKPPFTAKG VG