Gene EcSMS35_1359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1359 
Symbol 
ID6146574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1348444 
End bp1349817 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content52% 
IMG OID641616237 
Productmajor facilitator transporter 
Protein accessionYP_001743417 
Protein GI170680826 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000544866 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0171999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAG TTCAGGCCGA CGGCCTGCCA TTGCCCCAGC GATACGGTGC GATATTAACC 
ATTGTGATTG GTATTTCGAT GGCTGTCCTT GACGGCGCAA TCGCCAACGT CGCCCTGCCA
ACAATCGCCA CGGACCTTCA TGCCACGCCA GCCAGTTCCA TCTGGGTAGT GAACGCCTAT
CAAATCGCCA TTGTCATCTC CCTGCTCTCA TTTTCGTTTC TGGGCGATAT GTTTGGCTAT
CGACGTATTT ATAAATGCGG TCTGGTCGTT TTTCTGTTGT CTTCACTGTT CTGCGCCCTT
TCTGATTCGC TGCAAATGCT CACCCTTGCG CGTGTCATAC AAGGTTTCGG CGGTGCAGCG
TTGATGAGCG TTAATACCGC ACTTATCCGC CTGATCTATC CACAACGTTT TCTGGGTAGA
GGGATGGGCA TAAACTCGTT TATTGTTGCC GTCTCTTCTG CTGCCGGGCC GACAATTGCT
GCAGCAATCC TCTCCATCGC ATCCTGGAAA TGGTTATTTT TAATCAACGT ACCGTTGGGT
ATTATCGCCC TGCTTCTGGC GATGCGTTTT CTGCCACCCA ATGGTTCTCG CGCCAGTAAA
CCCCGTTTCG ACCTGCCCAG CGCCGTGATG AACGCGTTAA CCTTCGGCCT GCTTATTACT
GCATTGAGTG GTTTCGCTCA GAGGCAATCG CTGACATTGA TTGGTGCGGA ACTGGTGGTA
ATGGTTGTCG TTGGTATTTT CTTTATTCGC CGCCAGCTTT CTCTTCCCGT ACCGCTGCTA
CCGGTGGATT TACTGCGTAT CCCGCTGTTT TCACTTTCTA TTTGCACATC TGTTTGCTCT
TTCTGCGCAC AAATGCTGGC AATGGTTTCC CTTCCCTTTT ACCTGCAAAC CGTGCTCGGG
CGTAGTGAAG TCGAAACAGG TTTACTTCTG ACACCGTGGC CGTTAGCAAC AATGGTGATG
GCTCCACTGG CAGGCTATTT GATTGAACGC GTACATGCAG GATTGCTGGG TGCTTTAGGG
TTATTCATCA TGGCTGCGGG GCTTTTTTCC CTGGTTCTGC TGCCAGCGTC ACCTGCGGAT
ATCAATATTA TCTGGCCGAT GATCTTATGT GGTGCTGGAT TTGGCTTGTT CCAGTCACCC
AATAACCACA CCATTATTAC CTCCGCTCCT CGCGAACGTA GCGGTGGAGC CAGTGGCATG
TTAGGGACGG CTCGTCTTCT GGGTCAGAGT AGCGGCGCGG CTCTGGTAGC GCTGATGCTA
AATCAGTTTG GTGATAATGG TACGCACGTC TCGCTGATGG CTGCGGCTAT TCTGGCGGTG
ATTGCAGCCT GTGTCAGTGG TTTACGTATC ACTCAGCCAC GATCCATGGC ATAA
 
Protein sequence
MPKVQADGLP LPQRYGAILT IVIGISMAVL DGAIANVALP TIATDLHATP ASSIWVVNAY 
QIAIVISLLS FSFLGDMFGY RRIYKCGLVV FLLSSLFCAL SDSLQMLTLA RVIQGFGGAA
LMSVNTALIR LIYPQRFLGR GMGINSFIVA VSSAAGPTIA AAILSIASWK WLFLINVPLG
IIALLLAMRF LPPNGSRASK PRFDLPSAVM NALTFGLLIT ALSGFAQRQS LTLIGAELVV
MVVVGIFFIR RQLSLPVPLL PVDLLRIPLF SLSICTSVCS FCAQMLAMVS LPFYLQTVLG
RSEVETGLLL TPWPLATMVM APLAGYLIER VHAGLLGALG LFIMAAGLFS LVLLPASPAD
INIIWPMILC GAGFGLFQSP NNHTIITSAP RERSGGASGM LGTARLLGQS SGAALVALML
NQFGDNGTHV SLMAAAILAV IAACVSGLRI TQPRSMA