Gene EcHS_A1919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1919 
Symbol 
ID5595128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1929253 
End bp1930626 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content52% 
IMG OID640921062 
Productmajor facilitator transporter 
Protein accessionYP_001458613 
Protein GI157161295 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0000000591664 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAAG TTCAGGCCGA CGGCCTGCCA TTGCCCCAGC GATACGGTGC GATATTAACC 
ATTGTGATTG GTATTTCGAT GGCTGTCCTT GACGGCGCAA TCGCCAACGT CGCCCTGCCA
ACAATCGCCA CGGACCTTCA TGCCACGCCA GCCAGTTCCA TCTGGGTAGT GAACGCCTAT
CAAATCGCCA TTGTCATCTC CCTGCTCTCG TTTTCGTTTC TGGGCGATAT GTTTGGCTAT
CGACGTATTT ATAAATGCGG TCTGGTCGTT TTTCTGTTGT CTTCACTGTT CTGCGCCCTT
TCTGATTCGC TGCAAATGCT CACCCTTGCG CGTGTCATAC AAGGTTTCGG CGGTGCAGCG
TTGATGAGCG TTAATACCGC ACTTATCCGC CTGATCTATC CACAACGTTT TCTGGGGAGA
GGGATGGGCA TAAACTCGTT TATTGTTGCC GTCTCTTCTG CTGCCGGGCC GACAATTGCT
GCAGCAATCC TCTCCATCGC ATCCTGGAAA TGGTTATTTT TAATCAACGT ACCGTTAGGT
ATTATCGCCC TGCTTCTGGC GATGCGTTTT CTGCCACCCA ATGGTTCTCG CGCCAGTAAA
CCCCGTTTCG ACCTGCCCAG CGCCGTGATG AACGCGTTAA CCTTCGGCCT GCTTATCACT
GCGTTGAGTG GTTTCGCTCA GGGGCAATCG CTAACGTTAA TTGCTGCGGA ACTGGTGGTA
ATGGTTGTTG TTGGTATTTT CTTTATTCGC CGCCAGCTTT CTCTTCCCGT ACCGCTGCTA
CCGGTGGATT TACTGCGTAT CCCGCTGTTT TCACTTTCTA TTTGCACATC TGTTTGCTCT
TTCTGCGCAC AAATGCTGGC AATGGTTTCC CTGCCCTTTT ACCTGCAAAC CGTGCTCGGG
CGTAGTGAAG TCGAAACAGG TTTACTTCTG ACACCGTGGC CGTTAGCAAC GATGGTGATG
GCTCCGCTGG CAGGCTATTT GATTGAACGC GTACATGCAG GATTGCTGGG TGCTTTAGGG
TTGTTCATCA TGGCTGCGGG GCTTTTTTCC CTGGTTCTGC TGCCCGCGTC ACCTGCGGAT
ATCAATATTA TCTGGCCGAT GATCTTATGT GGCGCTGGAT TTGGCTTATT CCAGTCACCC
AATAACCACA CCATTATTAC CTCCGCTCCG CGCGAACGTA GCGGTGGGGC CAGTGGCATG
TTAGGAACGG CTCGTCTACT GGGTCAGAGT AGTGGCGCGG CGCTGGTGGC GCTGATGCTA
AATCAGTTCG GTGATAATGG TACGCACGTT TCGCTGATGG CTGCGGCTAT TCTGGCAGTG
ATTGCTGCCT GTGTGAGTGG TTTACGTATC ACTCAGCCAC GATCCAGGGC ATAA
 
Protein sequence
MPKVQADGLP LPQRYGAILT IVIGISMAVL DGAIANVALP TIATDLHATP ASSIWVVNAY 
QIAIVISLLS FSFLGDMFGY RRIYKCGLVV FLLSSLFCAL SDSLQMLTLA RVIQGFGGAA
LMSVNTALIR LIYPQRFLGR GMGINSFIVA VSSAAGPTIA AAILSIASWK WLFLINVPLG
IIALLLAMRF LPPNGSRASK PRFDLPSAVM NALTFGLLIT ALSGFAQGQS LTLIAAELVV
MVVVGIFFIR RQLSLPVPLL PVDLLRIPLF SLSICTSVCS FCAQMLAMVS LPFYLQTVLG
RSEVETGLLL TPWPLATMVM APLAGYLIER VHAGLLGALG LFIMAAGLFS LVLLPASPAD
INIIWPMILC GAGFGLFQSP NNHTIITSAP RERSGGASGM LGTARLLGQS SGAALVALML
NQFGDNGTHV SLMAAAILAV IAACVSGLRI TQPRSRA