Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3757 |
Symbol | |
ID | 6144980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3822264 |
End bp | 3823514 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641618583 |
Product | major facilitator superfamily transporter |
Protein accession | YP_001745723 |
Protein GI | 170681523 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACACT GTTGTAAAAA TGTGGTGATC CTCATGCCCG AACCCGTAGC CGAACCCGCG CTAAACGGAT TGCGCCTGAA TTTGCGCATT GTCTCCATTG TCATGTTTAA CTTCGCCAGC TACCTCACCA TCGGGTTGCC GCTCGCTGTA TTACCGGGCT ATGTCCATGA TGTGATGGGA TTTAGTGCCT TCTGGGCAGG GTTGGTTATC AGCCTGCAAT ATTTCGCCAC CTTGCTGAGC CGTCCTCATG CCGGACGTTA CGCCGATTTG CTGGGACCCA AAAAGATTGT CGTCTTCGGT TTATGCGGCT GCTTTTTGAG CGGTCTGGGA TATCTGACGG CGGGATTAAC CGCCAGTCTG CCCGTCATCA GCCTGTTATT ACTGTGCCTG GGGCGCGTAA TCCTTGGGAT TGGGCAAAGT TTTGCCGGAA CGGGATCGAC CCTGTGGGGT GTTGGCGTGG TTGGTTCGCT ACATATCGGG CGGGTGATTT CGTGGAACGG TATTGTCACT TACGGGGCGA TGGCGATGGG TGCGCCGTTA GGTGTCGTGT TTTATCACTG GGGCGGATTG CAGGCGTTAG CGTTAATCAT TATGGGCGTG GCGCTGGTGG CCATTTTGTT GGCGATCCCG CGTCCGATGG TAAAAGCCAG TAAAGGCAAA CCGCTGCCGT TTCGTGCGGT GCTTGGGCGC GTCTGGCTGT ACGGTATGGC GCTGGCACTG GCTTCCGCCG GATTTGGCGT TATCGCCACC TTTATCACGC TGTTTTATGA CGCTAAAGGT TGGGACGGTG CGGCTTTCGC GCTGACGCTG TTTAGCTGTG CGTTTGTCGG TACGCGTTTG TTATTCCCTA ACGGCATTAA CCGTATCGGC GGCTTAAACG TGGCGATGAT TTGCTTTAGC GTTGAGATAA TCGGCCTGCT ACTGGTTGGC GTGGCGACTA TGCCGTGGAT GGCGAAAATC GGCGTCTTAC TCGCGGGGGC GGGGTTTTCG CTGGTGTTCC CGGCATTGGG CGTAGTGGCG GTAAAAGCGG TTCCGCAGCA AAATCAGGGG GCGGCGCTGG CAACTTACAC CGTATTTATG GATTTATCGC TTGGCGTGAC CGGACCACTG GCTGGGCTGG TGATGAGTTG GGCGGGCGTC CCGGTGATTT ATCTGGCGGC GGCGGGACTG GTCGCAATCG CGTTATTACT GACGTGGCGA TTAAAAAAAC GGCCTCCGGT GGAAGTACCT GAGGCCATCT CATCATCTTA A
|
Protein sequence | MKHCCKNVVI LMPEPVAEPA LNGLRLNLRI VSIVMFNFAS YLTIGLPLAV LPGYVHDVMG FSAFWAGLVI SLQYFATLLS RPHAGRYADL LGPKKIVVFG LCGCFLSGLG YLTAGLTASL PVISLLLLCL GRVILGIGQS FAGTGSTLWG VGVVGSLHIG RVISWNGIVT YGAMAMGAPL GVVFYHWGGL QALALIIMGV ALVAILLAIP RPMVKASKGK PLPFRAVLGR VWLYGMALAL ASAGFGVIAT FITLFYDAKG WDGAAFALTL FSCAFVGTRL LFPNGINRIG GLNVAMICFS VEIIGLLLVG VATMPWMAKI GVLLAGAGFS LVFPALGVVA VKAVPQQNQG AALATYTVFM DLSLGVTGPL AGLVMSWAGV PVIYLAAAGL VAIALLLTWR LKKRPPVEVP EAISSS
|
| |