Gene EcSMS35_0384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0384 
Symbol 
ID6147081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp399275 
End bp400486 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content58% 
IMG OID641615280 
Productputative 3-hydroxyphenylpropionic transporter MhpT 
Protein accessionYP_001742487 
Protein GI170680668 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00895] benzoate transport 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACTC GTACCCCTTC ATCATCTTCA TCCCGCCTGA TGCTGACCAT CGGGCTTTGT 
TTTTTGGTCG CTCTGATGGA AGGGCTGGAT CTTCAGGCGG CTGGCATTGC GGCGGGTGGC
ATCGCCCAGG CTTTTGCACT CGATAAAATG CAAATGGGCT GGATATTCAG CGCCGGAATA
CTCGGTTTGC TACCCGGCGC GCTGGTTGGC GGGATGCTGG CGGACCGTTA TGGTCGCAAG
CGCATTTTGA TTGGCTCAGT TGCGCTGTTT GGTTTGTTCT CACTGGCAAC GGCGATTGCC
TGGGATTTCC CCTCACTGGT CTTTGCGCGG CTGATGACCG GTGTCGGGCT GGGGGCGGCG
TTGCCGAATC TGATCGCCCT GACGTCTGAA GCCGCGGGTC CACGTTTTCG TGGGACGGCA
GTGAGCCTGA TGTATTGCGG TGTTCCCATT GGCGCGGCGC TGGCGGCGAC ACTGGGTTTC
GCGGGGGCAA ACTTAGCATG GCAAACGGTG TTTTGGGTAG GTGGTGTGGT GCCGTTGATT
CTGGTGCCGC TGTTAATGCG CTGGCTGCCG GAGTCGGCGG TGTTCGCTGG CGAAAAACAG
GCCGCGCCAC CACTGCGTGC GTTATTTGCG CCAGAAACGG CTACCGCGAC GCTGCTGCTG
TGGTTGTGTT ATTTCTTCAC TCTGCTGGTG GTCTACATGT TGATCAACTG GCTACCGCTG
CTTTTGGTGG AGCAAGGATT CCAGCCATCG CAGGCGGCAG GGGTGATGTT TGCACTGCAA
ATGGGGGCGG CAAGCGGGAC GTTAATGTTG GGCGCATTGA TGGATAAGCT GCGTCCAGTA
ACCATGTCGC TACTGATTTA TAGCGGCATG TTAGCTTCGC TGCTGGCGCT GGGAACGGTG
TCGTCATTTA ACGGTATGTT GCTGGCGGGA TTCGTCGCGG GGTTGTTTGC GACAGGTGGG
CAAAGCGTTT TGTATGCCCT GGCACCGTTG TTTTACAGTT CGCAGATCCG CGCAACAGGT
GTGGGAACAG CCGTGGCGGT AGGGCGTCTG GGGGCTATGA GCGGTCCGTT ACTGGCCGGG
AAAATGCTGG CATTAGGCAC TGGCACGGTT GGCGTAATGG CCGCTTCTGC GCCGGGTATT
CTTGTTGCCG GCCTGGCAGT GTTTATTTTG ATGAGCCGGA GATCACGAAT GCAGCCGTGT
GCAGATGCCT GA
 
Protein sequence
MSTRTPSSSS SRLMLTIGLC FLVALMEGLD LQAAGIAAGG IAQAFALDKM QMGWIFSAGI 
LGLLPGALVG GMLADRYGRK RILIGSVALF GLFSLATAIA WDFPSLVFAR LMTGVGLGAA
LPNLIALTSE AAGPRFRGTA VSLMYCGVPI GAALAATLGF AGANLAWQTV FWVGGVVPLI
LVPLLMRWLP ESAVFAGEKQ AAPPLRALFA PETATATLLL WLCYFFTLLV VYMLINWLPL
LLVEQGFQPS QAAGVMFALQ MGAASGTLML GALMDKLRPV TMSLLIYSGM LASLLALGTV
SSFNGMLLAG FVAGLFATGG QSVLYALAPL FYSSQIRATG VGTAVAVGRL GAMSGPLLAG
KMLALGTGTV GVMAASAPGI LVAGLAVFIL MSRRSRMQPC ADA