Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0872 |
Symbol | |
ID | 6146471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 878763 |
End bp | 879971 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615760 |
Product | major facilitator transporter |
Protein accession | YP_001742952 |
Protein GI | 170679939 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0738] Fucose permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTAA ATTCTTCACG TAATGCATTG AAACGCCGAA CCTGGGCGCT GTTTATGTTC TTCTTTTTGC CAGGCCTGTT AATGGCGTCC TGGGCAACCC GTACACCTGC TATCCGCGAC ATTCTTTCTG TCTCGATCGC TGAAATGGGT GGAGTCCTCT TTGGTCTGTC GATCGGTTCA ATGAGCGGTA TTCTCTGCTC GGCGTGGTTA GTGAAACGCT TTGGAACGCG TAATGTCATC CTGGTCACGA TGTCCTGCGC ATTGATCGGG ATGATGATAC TAAGTCTGGC ACTCTGGCTG ACATCGCCCC TGCTCTTCGC GGTTGGTCTC GGCGTCTTTG GGGCAAGTTT TGGTTCTGCG GAAGTGGCGA TAAACGTTGA AGGTGCCGCC GTTGAGCGAG AAATGAATAA AACGGTTTTG CCGATGATGC ACGGTTTTTA TAGCCTGGGC ACGCTGGCAG GCGCTGGTGT CGGGATGGCA CTGACGGCCT TTGGCGTTCC GGCAACGGTG CATATTTTAT TGGCGGCGCT GGTAGGCATC GCACCTATTT ATATCGCCAT TCAGGCAATC CCTGACGGTA CGGGCAAAAA TGCTGCCGAT GGCACCCAGC ATGGCGAAAA AGGCGTACCT TTTTATCGCG ATATCCAGTT GCTGTTGATT GGTGTTGTGG TGCTGGCGAT GGCCTTTGCC GAAGGTTCTG CCAACGACTG GTTACCCTTA TTAATGGTTG ACGGTCACGG TTTTAGTCCT ACTTCCGGCT CGCTGATTTA TGCCGGTTTT ACCCTGGGGA TGACTGTTGG ACGCTTTACC GGCGGTTGGT TCATCGACCG TTACAGTCGC GTTGCCGTGG TTCGGGCCAG TGCACTAATG GGGGCGTTGG GTATTGGGAT GATTATTTTT GTCGATAGTG CCTGGGTCGC TGGGGTGTCT GTTGTACTTT GGGGACTGGG TGCCTCGTTG GGCTTCCCGC TGACCATTTC TGCCGCCAGC GATACCGGCC CCGATGCACC GACCCGCGTC AGCGTGGTAG CAACGACCGG TTATCTGGCT TTCCTCGTTG GGCCGCCGCT GCTGGGCTAT CTCGGCGAAC ATTATGGATT ACGTAGTGCA ATGCTGGTTG TACTGGCGCT GGTTATTCTC GCGGCTATTG TCGCGAAAGC CGTCGCCAAA CCCGATACCA AAACGCAGAC GGCGATGGAG AATAGTTGA
|
Protein sequence | MTVNSSRNAL KRRTWALFMF FFLPGLLMAS WATRTPAIRD ILSVSIAEMG GVLFGLSIGS MSGILCSAWL VKRFGTRNVI LVTMSCALIG MMILSLALWL TSPLLFAVGL GVFGASFGSA EVAINVEGAA VEREMNKTVL PMMHGFYSLG TLAGAGVGMA LTAFGVPATV HILLAALVGI APIYIAIQAI PDGTGKNAAD GTQHGEKGVP FYRDIQLLLI GVVVLAMAFA EGSANDWLPL LMVDGHGFSP TSGSLIYAGF TLGMTVGRFT GGWFIDRYSR VAVVRASALM GALGIGMIIF VDSAWVAGVS VVLWGLGASL GFPLTISAAS DTGPDAPTRV SVVATTGYLA FLVGPPLLGY LGEHYGLRSA MLVVLALVIL AAIVAKAVAK PDTKTQTAME NS
|
| |