Gene EcE24377A_4269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4269 
Symbol 
ID5587387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4261082 
End bp4262485 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content50% 
IMG OID640927885 
ProductDHA2 family drug:H+ antiporter-1 
Protein accessionYP_001465244 
Protein GI157157369 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000116309 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGGTT TGCCGTGGAT CGCGGCGATG GCCTTCTTCA TGCAGGCACT TGATGCCACT 
ATTCTGAATA CCGCCTTACC CGCAATCGCT CATAGCCTTA ATCGTTCTCC TCTCGCGATG
CAATCAGCCA TCATCAGTTA TACGCTGACG GTGGCGATGC TTATTCCGGT AAGCGGATGG
CTAGCCGATC GCTTCGGTAC GCGTCGCATT TTTACCCTTG CCGTGAGTCT GTTCACATTG
GGTTCTCTGG CCTGCGCACT TTCTAATTCG CTACCACAGC TGGTTGTCTT CCGGGTTATT
CAGGGGATAG GCGGCGCAAT GATGATGCCT GTTGCTCGGC TGGCCTTACT GCGCGCTTAT
CCTCGTAATG AACTTCTTCC AGTATTGAAT TTTGTCGCCA TGCCGGGTCT GGTGGGGCCA
ATTTTAGGCC CCGTTCTTGG CGGCGTGCTG GTCACCTGGG CAACCTGGCA CTGGATATTT
TTAATCAATA TCCCCATAGG TATTGCGGGC CTTCTTTACG CGCGCAAACA TATGCCCAAT
TTCACCACCG CACGACGCAG ATTCGATATC ACTGGCTTTT TGCTGTTTGG CCTCAGCCTT
GTTCTCTTCT CAAGCGGAAT AGAGCTATTC GGGGAAAAGA TTGTCGCCAG CTGGATTGCC
TTGACGGTAA TTGTCACCAG CATCGGGTTA CTGCTTCTCT ATATTCTCCA TGCGCGACAC
ACGCCAAACC CATTAATTTC ATTAGATTTA TTTAAAACCC GCACTTTCTC GATCGGTATC
GTAGGCAATA TTGCAACCCG TCTGGGGACC GGTTGTGTAC CGTTCCTTAT GCCATTGATG
TTACAGGTAG GATTTGGTTA TCAGGCGTTT ATTGCCGGCT GTATGATGGC ACCGACAGCG
TTAGGTTCCA TTATTGCAAA ATCGATGGTT ACCCAAGTCT TACGTCGTCT GGGCTATCGC
CATACGTTAG TGGGGATCAC GGTGATTATT GGGCTAATGA TCGCTCAGTT CTCTTTGCAA
TCACCGGCAA TGGCGATATG GATGCTGATC TTGCCGTTGT TTATATTAGG GATGGCTATG
TCGACGCAAT TTACCGCGAT GAATACCATC ACACTTGCCG ATCTGACCGA TGACAACGCC
AGCAGCGGTA ACAGTGTTCT GGCGGTCACG CAGCAACTGT CGATTAGTTT AGGCGTTGCT
GTAAGTGCGG CCGTCCTTCG CGTTTATGAA GGGATGGAAG GCACAACGAC TGTCGAACAA
TTCCACTATA CGTTTATCAC GATGGGCATT ATTACTGTTG CTTCAGCAGC AATGTTCATG
CTTCTGAAAA CAACCGATGG TAATAATTTG ATCAAAAGAC AGCGTAAATC TAAGCCGAAC
CGCGTTCCAT CAGAATCGGA GTAA
 
Protein sequence
MAGLPWIAAM AFFMQALDAT ILNTALPAIA HSLNRSPLAM QSAIISYTLT VAMLIPVSGW 
LADRFGTRRI FTLAVSLFTL GSLACALSNS LPQLVVFRVI QGIGGAMMMP VARLALLRAY
PRNELLPVLN FVAMPGLVGP ILGPVLGGVL VTWATWHWIF LINIPIGIAG LLYARKHMPN
FTTARRRFDI TGFLLFGLSL VLFSSGIELF GEKIVASWIA LTVIVTSIGL LLLYILHARH
TPNPLISLDL FKTRTFSIGI VGNIATRLGT GCVPFLMPLM LQVGFGYQAF IAGCMMAPTA
LGSIIAKSMV TQVLRRLGYR HTLVGITVII GLMIAQFSLQ SPAMAIWMLI LPLFILGMAM
STQFTAMNTI TLADLTDDNA SSGNSVLAVT QQLSISLGVA VSAAVLRVYE GMEGTTTVEQ
FHYTFITMGI ITVASAAMFM LLKTTDGNNL IKRQRKSKPN RVPSESE