Gene EcolC_1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1804 
Symbol 
ID6065836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2004265 
End bp2005638 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content52% 
IMG OID641601219 
Productmajor facilitator transporter 
Protein accessionYP_001724781 
Protein GI170019827 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.673075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0488273 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAG TTCAGGCCGA CGGCCTGCCA TTGCCCCAGC GATACGGTGC GATATTAACC 
ATTGTGATTG GTATTTCGAT GGCTGTCCTT GACGGCGCAA TCGCCAACGT CGCCCTGCCA
ACAATCGCCA CGGACCTTCA TGCCACGCCA GCCAGTTCCA TCTGGGTAGT GAACGCCTAT
CAAATCGCCA TTGTCATCTC CCTGCTCTCG TTTTCGTTTC TGGGCGATAT GTTTGGCTAT
CGACGTATTT ATAAATGCGG TCTGGTCGTT TTTCTGTTGT CTTCACTGTT CTGCGCCCTT
TCTGATTCGC TGCAAATGCT CACCCTTGCG CGTGTCATAC AAGGTTTCGG CGGTGCAGCG
TTGATGAGCG TTAATACCGC ACTTATCCGC CTGATCTATC CACAACGTTT TCTGGGGAGA
GGGATGGGCA TAAACTCGTT TATTGTTGCC GTCTCTTCTG CTGCCGGGCC GACAATTGCT
GCAGCAATCC TCTCCATCGC ATCCTGGAAA TGGTTATTTT TAATCAACGT ACCGTTAGGT
ATTATCGCCC TGCTTCTGGC GATGCGTTTT CTGCCACCCA ATGGTTCTCG CGCCAGTAAA
CCCCGTTTCG ACCTGCCCAG CGCCGTGATG AACGCGTTAA CCTTCGGCCT GCTTATCACT
GCGTTGAGTG GTTTCGCTCA GGGGCAATCG CTAACGTTAA TTGCTGCGGA ACTGGTGGTA
ATGGTTGTTG TTGGTATTTT CTTTATTCGC CGCCAGCTTT CTCTTCCCGT ACCGCTGCTA
CCGGTGGATT TACTGCGTAT CCCGCTGTTT TCACTTTCTA TTTGCACATC TGTTTGCTCT
TTCTGCGCAC AAATGCTGGC AATGGTTTCC CTGCCCTTTT ACCTGCAAAC CGTGCTCGGG
CGTAGTGAAG TCGAAACAGG TTTACTTCTG ACACCGTGGC CGTTAGCAAC GATGGTGATG
GCTCCGCTGG CAGGCTATTT GATTGAACGC GTACATGCAG GATTGCTGGG TGCTTTAGGG
TTGTTCATCA TGGCTGCGGG GCTTTTTTCC CTGGTTCTGC TGCCCGCGTC ACCTGCGGAT
ATCAATATTA TCTGGCCGAT GATCTTATGT GGCGCTGGAT TTGGCTTATT CCAGTCACCC
AATAACCACA CCATTATTAC CTCCGCTCCG CGCGAACGTA GCGGTGGGGC CAGTGGCATG
TTAGGAACGG CTCGTCTACT GGGTCAGAGT AGTGGCGCGG CGCTGGTGGC GCTGATGCTA
AATCAGTTCG GTGATAATGG TACGCACGTT TCGCTGATGG CTGCGGCTAT TCTGGCAGTG
ATTGCTGCCT GTGTGAGTGG TTTACGTATC ACTCAGCCAC GATCCAGGGC ATAA
 
Protein sequence
MPKVQADGLP LPQRYGAILT IVIGISMAVL DGAIANVALP TIATDLHATP ASSIWVVNAY 
QIAIVISLLS FSFLGDMFGY RRIYKCGLVV FLLSSLFCAL SDSLQMLTLA RVIQGFGGAA
LMSVNTALIR LIYPQRFLGR GMGINSFIVA VSSAAGPTIA AAILSIASWK WLFLINVPLG
IIALLLAMRF LPPNGSRASK PRFDLPSAVM NALTFGLLIT ALSGFAQGQS LTLIAAELVV
MVVVGIFFIR RQLSLPVPLL PVDLLRIPLF SLSICTSVCS FCAQMLAMVS LPFYLQTVLG
RSEVETGLLL TPWPLATMVM APLAGYLIER VHAGLLGALG LFIMAAGLFS LVLLPASPAD
INIIWPMILC GAGFGLFQSP NNHTIITSAP RERSGGASGM LGTARLLGQS SGAALVALML
NQFGDNGTHV SLMAAAILAV IAACVSGLRI TQPRSRA