Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0384 |
Symbol | |
ID | 6147081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 399275 |
End bp | 400486 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641615280 |
Product | putative 3-hydroxyphenylpropionic transporter MhpT |
Protein accession | YP_001742487 |
Protein GI | 170680668 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00895] benzoate transport |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACTC GTACCCCTTC ATCATCTTCA TCCCGCCTGA TGCTGACCAT CGGGCTTTGT TTTTTGGTCG CTCTGATGGA AGGGCTGGAT CTTCAGGCGG CTGGCATTGC GGCGGGTGGC ATCGCCCAGG CTTTTGCACT CGATAAAATG CAAATGGGCT GGATATTCAG CGCCGGAATA CTCGGTTTGC TACCCGGCGC GCTGGTTGGC GGGATGCTGG CGGACCGTTA TGGTCGCAAG CGCATTTTGA TTGGCTCAGT TGCGCTGTTT GGTTTGTTCT CACTGGCAAC GGCGATTGCC TGGGATTTCC CCTCACTGGT CTTTGCGCGG CTGATGACCG GTGTCGGGCT GGGGGCGGCG TTGCCGAATC TGATCGCCCT GACGTCTGAA GCCGCGGGTC CACGTTTTCG TGGGACGGCA GTGAGCCTGA TGTATTGCGG TGTTCCCATT GGCGCGGCGC TGGCGGCGAC ACTGGGTTTC GCGGGGGCAA ACTTAGCATG GCAAACGGTG TTTTGGGTAG GTGGTGTGGT GCCGTTGATT CTGGTGCCGC TGTTAATGCG CTGGCTGCCG GAGTCGGCGG TGTTCGCTGG CGAAAAACAG GCCGCGCCAC CACTGCGTGC GTTATTTGCG CCAGAAACGG CTACCGCGAC GCTGCTGCTG TGGTTGTGTT ATTTCTTCAC TCTGCTGGTG GTCTACATGT TGATCAACTG GCTACCGCTG CTTTTGGTGG AGCAAGGATT CCAGCCATCG CAGGCGGCAG GGGTGATGTT TGCACTGCAA ATGGGGGCGG CAAGCGGGAC GTTAATGTTG GGCGCATTGA TGGATAAGCT GCGTCCAGTA ACCATGTCGC TACTGATTTA TAGCGGCATG TTAGCTTCGC TGCTGGCGCT GGGAACGGTG TCGTCATTTA ACGGTATGTT GCTGGCGGGA TTCGTCGCGG GGTTGTTTGC GACAGGTGGG CAAAGCGTTT TGTATGCCCT GGCACCGTTG TTTTACAGTT CGCAGATCCG CGCAACAGGT GTGGGAACAG CCGTGGCGGT AGGGCGTCTG GGGGCTATGA GCGGTCCGTT ACTGGCCGGG AAAATGCTGG CATTAGGCAC TGGCACGGTT GGCGTAATGG CCGCTTCTGC GCCGGGTATT CTTGTTGCCG GCCTGGCAGT GTTTATTTTG ATGAGCCGGA GATCACGAAT GCAGCCGTGT GCAGATGCCT GA
|
Protein sequence | MSTRTPSSSS SRLMLTIGLC FLVALMEGLD LQAAGIAAGG IAQAFALDKM QMGWIFSAGI LGLLPGALVG GMLADRYGRK RILIGSVALF GLFSLATAIA WDFPSLVFAR LMTGVGLGAA LPNLIALTSE AAGPRFRGTA VSLMYCGVPI GAALAATLGF AGANLAWQTV FWVGGVVPLI LVPLLMRWLP ESAVFAGEKQ AAPPLRALFA PETATATLLL WLCYFFTLLV VYMLINWLPL LLVEQGFQPS QAAGVMFALQ MGAASGTLML GALMDKLRPV TMSLLIYSGM LASLLALGTV SSFNGMLLAG FVAGLFATGG QSVLYALAPL FYSSQIRATG VGTAVAVGRL GAMSGPLLAG KMLALGTGTV GVMAASAPGI LVAGLAVFIL MSRRSRMQPC ADA
|
| |