Gene EcSMS35_0872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0872 
Symbol 
ID6146471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp878763 
End bp879971 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content54% 
IMG OID641615760 
Productmajor facilitator transporter 
Protein accessionYP_001742952 
Protein GI170679939 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTAA ATTCTTCACG TAATGCATTG AAACGCCGAA CCTGGGCGCT GTTTATGTTC 
TTCTTTTTGC CAGGCCTGTT AATGGCGTCC TGGGCAACCC GTACACCTGC TATCCGCGAC
ATTCTTTCTG TCTCGATCGC TGAAATGGGT GGAGTCCTCT TTGGTCTGTC GATCGGTTCA
ATGAGCGGTA TTCTCTGCTC GGCGTGGTTA GTGAAACGCT TTGGAACGCG TAATGTCATC
CTGGTCACGA TGTCCTGCGC ATTGATCGGG ATGATGATAC TAAGTCTGGC ACTCTGGCTG
ACATCGCCCC TGCTCTTCGC GGTTGGTCTC GGCGTCTTTG GGGCAAGTTT TGGTTCTGCG
GAAGTGGCGA TAAACGTTGA AGGTGCCGCC GTTGAGCGAG AAATGAATAA AACGGTTTTG
CCGATGATGC ACGGTTTTTA TAGCCTGGGC ACGCTGGCAG GCGCTGGTGT CGGGATGGCA
CTGACGGCCT TTGGCGTTCC GGCAACGGTG CATATTTTAT TGGCGGCGCT GGTAGGCATC
GCACCTATTT ATATCGCCAT TCAGGCAATC CCTGACGGTA CGGGCAAAAA TGCTGCCGAT
GGCACCCAGC ATGGCGAAAA AGGCGTACCT TTTTATCGCG ATATCCAGTT GCTGTTGATT
GGTGTTGTGG TGCTGGCGAT GGCCTTTGCC GAAGGTTCTG CCAACGACTG GTTACCCTTA
TTAATGGTTG ACGGTCACGG TTTTAGTCCT ACTTCCGGCT CGCTGATTTA TGCCGGTTTT
ACCCTGGGGA TGACTGTTGG ACGCTTTACC GGCGGTTGGT TCATCGACCG TTACAGTCGC
GTTGCCGTGG TTCGGGCCAG TGCACTAATG GGGGCGTTGG GTATTGGGAT GATTATTTTT
GTCGATAGTG CCTGGGTCGC TGGGGTGTCT GTTGTACTTT GGGGACTGGG TGCCTCGTTG
GGCTTCCCGC TGACCATTTC TGCCGCCAGC GATACCGGCC CCGATGCACC GACCCGCGTC
AGCGTGGTAG CAACGACCGG TTATCTGGCT TTCCTCGTTG GGCCGCCGCT GCTGGGCTAT
CTCGGCGAAC ATTATGGATT ACGTAGTGCA ATGCTGGTTG TACTGGCGCT GGTTATTCTC
GCGGCTATTG TCGCGAAAGC CGTCGCCAAA CCCGATACCA AAACGCAGAC GGCGATGGAG
AATAGTTGA
 
Protein sequence
MTVNSSRNAL KRRTWALFMF FFLPGLLMAS WATRTPAIRD ILSVSIAEMG GVLFGLSIGS 
MSGILCSAWL VKRFGTRNVI LVTMSCALIG MMILSLALWL TSPLLFAVGL GVFGASFGSA
EVAINVEGAA VEREMNKTVL PMMHGFYSLG TLAGAGVGMA LTAFGVPATV HILLAALVGI
APIYIAIQAI PDGTGKNAAD GTQHGEKGVP FYRDIQLLLI GVVVLAMAFA EGSANDWLPL
LMVDGHGFSP TSGSLIYAGF TLGMTVGRFT GGWFIDRYSR VAVVRASALM GALGIGMIIF
VDSAWVAGVS VVLWGLGASL GFPLTISAAS DTGPDAPTRV SVVATTGYLA FLVGPPLLGY
LGEHYGLRSA MLVVLALVIL AAIVAKAVAK PDTKTQTAME NS