Gene EcSMS35_1833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1833 
SymboleefD 
ID6144437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1853956 
End bp1855119 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content49% 
IMG OID641616709 
Productmultidrug efflux transport protein EefD 
Protein accessionYP_001743887 
Protein GI170680921 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00710] drug resistance transporter, Bcr/CflA subfamily
[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.531383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0000436585 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTAGAG TCTCTCTTTC ATGGGCATTG ATTCTTGGTC TTTTAGCCGG TATCGGCCCG 
ATGTGTACCG ATCTTTATTT GCCGGCTTTG CCGGAGATGT CTGAGCAACT GGCGGCAACC
ACGACCATAA CGCAATTAAC CCTGACCGCA TCACTGATTG GTCTTGGCGT CGGACAACTG
TTATTTGGCC CTCTGAGCGA CAAAATAGGG CGTAAACGTC CGTTAATCTT GTCGTTGCTA
TTGTTTATTG TTTCTTCCAT TTTGTGCGCG ACAACGAACA ATATTTACTG GCTGGTGGTC
TGGCGTTTTA TTCAAGGGAT CGCGGGGGCG GGTGGTTCGG TGCTCTCTCG TTCTATTGCT
CGTGACAAAT ATCAGGGAGT AACGTTGACC CAGTTTTTTG CGCTGTTAAT GACGGTGAAT
GGCCTGGCAC CGGTGTTGTC GCCAGTGCTG GGCGGGTACA TTGTTAGCAC TTTTGACTGG
CGCACTTTAT TCTGGGTAAT GGCTGAAATT AGCACCGTAC TGTTGCTGGG CTGCCTGTTA
TTTATTAATG AGACCTTGCC AGAAAATAAA AGGGGCTCAT CATTGCTATT AACCGGACGA
AGCGTGGTGC AGAACCGCCG TTTTATGCGC TTTTGCCTGA TTCAAAGTTT TATGCTGGCC
GGTTTGTTTG CATATATCGG CTCTTCGTCG TTCGTGTTGC AGAAGGAATT TGGCTTTAGT
CCAATGCAAT TTAGCCTGGT GTTTGGCCTT AACGGCATCG GACTTATCAT TGCTTCATGG
ATCTTCTCGC GCCTGGCGCG ACGGATTAAC GCGATGACAT TGTTGCGAGG TGGCCTGATA
GCGGCAATTT TGTGTGCATT GCTCACGGTC TTATGCGCAT GGACACAATT GCCCATTCCG
GCACTGGTGG CATTATTTTT CACCATCGCA TTTTGTAGCG GCATCGGCAC TGTTGGCGGG
GCAGAGGCTA TGAGTGCAGT AGGGACGCAG GAATCTGGAA CGGCGTCTGC GTTGATGGGG
ATGAGCATGT TTGTCTTCGG CGGTATAGCC GCGCCATTGT CGGGAATTGG CGGAGAAACA
CTGTTAAAAA TGAGTCTGGC AATTACGGTG TGTTATACGC TGGCATTGCT GGTTGCTCTC
ACCAGAATCG ACAATCAAAA GTAA
 
Protein sequence
MARVSLSWAL ILGLLAGIGP MCTDLYLPAL PEMSEQLAAT TTITQLTLTA SLIGLGVGQL 
LFGPLSDKIG RKRPLILSLL LFIVSSILCA TTNNIYWLVV WRFIQGIAGA GGSVLSRSIA
RDKYQGVTLT QFFALLMTVN GLAPVLSPVL GGYIVSTFDW RTLFWVMAEI STVLLLGCLL
FINETLPENK RGSSLLLTGR SVVQNRRFMR FCLIQSFMLA GLFAYIGSSS FVLQKEFGFS
PMQFSLVFGL NGIGLIIASW IFSRLARRIN AMTLLRGGLI AAILCALLTV LCAWTQLPIP
ALVALFFTIA FCSGIGTVGG AEAMSAVGTQ ESGTASALMG MSMFVFGGIA APLSGIGGET
LLKMSLAITV CYTLALLVAL TRIDNQK