Gene EcSMS35_1537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1537 
Symbol 
ID6146334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1521737 
End bp1522948 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content53% 
IMG OID641616414 
Productinner membrane transport protein YdhC 
Protein accessionYP_001743592 
Protein GI170680618 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00710] drug resistance transporter, Bcr/CflA subfamily
[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0900835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0201427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCTG GGAAAAGATT TTTAGTCTGG CTGGCAGGTT TGAGCGTACT CGGTTTTCTG 
GCAACCGATA TGTATCTGCC TGCTTTCGCC GCCATACAGG CCGACCTGCA AACGCCTGCG
TCTGCTGTCA GTGCCAGCCT TAGTCTGTTC CTTGCCGGAT TTGCCGCAGC CCAGCTTCTG
TGGGGGCCGC TCTCCGACCG TTATGGTCGT AAACCGGTAT TATTCATCGG CCTGACAATT
TTTGCGTTAG GTAGTCTGGG GATGCTGTGG GTAGAAAACG CCGCTACGCT GCTGGTATTG
CGTTTTGTAC AGGCTGTGGG TGTCTGCGCC GCGGCGGTTA TCTGGCAAGC GTTAGTGACG
GATTATTATC CTTCACAGAA AGTTAACCGT ATTTTTGCGA CCATCATGCC GCTGGTGGGT
CTATCTCCGG CACTGGCTCC TCTGTTAGGA AGCTGGCTGC TGGTCCATTT TTCCTGGCAG
GCGATTTTCG CCACCCTGTT TGCCATTACC GTGGTGCTGA TTCTGCCTAT TTTCTGGCTC
AAACCCACGA CGAAGGCCCG TAACAATAGT CAGGATGGTC TGACCTTTAC CGACCTGCTA
CGTTCTAAAA CCTATCGCGG CAACGTGCTG ATATATGCGG CCTGTTCAGC CAGTTTTTTT
GCATGGCTGA CCGGCTCACC GTTCATCCTT AGTGAAATGG GTTACAGCCC GGCAGTTATT
GGTTTAAGTT ATGTCCCGCA AACTATCGCG TTTCTGATTG GTGGTTATGG CTGTCGCGCC
GCACTGCAGA AATGGCAAGG CAAGCAGTTA TTACCGTGGT TGCTGGTACT GTTTGCTGTC
AGCGTCATTG CGACCTGGGC TGCGGGCTTC ATTAGCCATG TGTCGCTAGT CGAAATCCTG
ATCCCATTCT GTGTGATGGC GATTGCCAAC GGTGCGATCT ACCCGATTGT TGTCGCTCAG
GCGCTGCGTC CCTTCCCACA TGCAACTGGT CGCGCCGCAG CGTTGCAGAA CACTCTGCAA
CTGGGTCTGT GCTTCCTCGC AAGTCTGGTA GTTTCCTGGC TTATCAGTAT CAGCACGCCA
TTGCTCACCA CCACCAGCGT GATGTTATCA ACAGTAGTGC TGGTCGCTCT GGGTTACATG
ATGCAACGTT GTGAAGAAGT TGGCTGCCAG AATCATGGCA ATGCCGAAGT CGCTCATAGC
GAATCACACT GA
 
Protein sequence
MQPGKRFLVW LAGLSVLGFL ATDMYLPAFA AIQADLQTPA SAVSASLSLF LAGFAAAQLL 
WGPLSDRYGR KPVLFIGLTI FALGSLGMLW VENAATLLVL RFVQAVGVCA AAVIWQALVT
DYYPSQKVNR IFATIMPLVG LSPALAPLLG SWLLVHFSWQ AIFATLFAIT VVLILPIFWL
KPTTKARNNS QDGLTFTDLL RSKTYRGNVL IYAACSASFF AWLTGSPFIL SEMGYSPAVI
GLSYVPQTIA FLIGGYGCRA ALQKWQGKQL LPWLLVLFAV SVIATWAAGF ISHVSLVEIL
IPFCVMAIAN GAIYPIVVAQ ALRPFPHATG RAAALQNTLQ LGLCFLASLV VSWLISISTP
LLTTTSVMLS TVVLVALGYM MQRCEEVGCQ NHGNAEVAHS ESH