Gene EcSMS35_2942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2942 
SymbolfucP 
ID6143128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3017534 
End bp3018850 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content47% 
IMG OID641617811 
ProductL-fucose transporter 
Protein accessionYP_001744966 
Protein GI170683163 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID[TIGR00885] L-fucose:H+ symporter permease 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.643765 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAACA CATCAATACA AACGCAGAGT TACCGTGCGG TAGATAAAGA TGCAGGGCAA 
AGCAGAAGTT ACATTATTCC ATTTGCACTG CTGTGCTCAC TGTTTTTTCT TTGGGCGGTA
GCCAATAACC TTAACGACAT TTTATTGCCT CAATTCCAGC AGGCTTTTAC GCTGACAAAT
TTCCAGGCTG GCCTGATCCA ATCGGCCTTT TACTTTGGTT ATTTCATTAT CCCAATCCCT
GCCGGGATAT TGATGAAAAA ACTCAGTTAT AAAGCAGGGA TTATTACCGG ATTGTTTTTA
TATGCCTTCG GCGCTGCATT ATTCTGGCCC GCCGCAGAAA TAATGAACTA CACCTTGTTT
TTAGTTGGCC TATTTATTAT TGCAGCCGGA TTAGGTTGTC TGGAAACTGC CGCAAACCCT
TTTGTTACGG TATTAGGGCC GGAAAGCAGT GGTCACTTCC GCTTAAATCT TGCGCAAACA
TTTAACTCAT TTGGCGCAAT TATCGCGGTT GTCTTTGGGC AAAGTCTTAT TTTGTCTAAC
GTGCCACATC AATCGCAAGA CGTTCTCGAT AAAATGTCTC CAGAGCAATT GAGTGCGTAT
AAACACAGCC TGGTATTATC GGTACAGACA CCTTATATGA TCATCGTGGC TATCGTGTTA
CTGGTCGCCC TGCTGATCAT GCTGACGAAA TTCCCGGCAT TGCAGAGTGA TAATCACAGT
GACGCCAAAC AAGGATCGTT CTCCGCATCG CTTTCTCGCC TGGCGCGTAT TCGCCACTGG
CGCTGGGCAG TATTAGCGCA ATTCTGCTAC GTTGGCGCAC AAACGGCCTG CTGGAGCTAT
TTAATTCGCT ACGCTGTAGA AGAAATTCCA GGTATGACAG CAGGCTTTGC CGCTAACTAT
TTAACCGGAA CCATGGTGTG CTTCTTTATT GGTCGTTTCA CCGGTACCTG GCTCATCAGT
CGCTTCGCAC CACACAAAGT CCTGGCAGCC TACGCATTAA TTGCTATGGC ACTGTGCCTG
ATCTCAGCCT TCGCTGGCGG TCATGTGGGT TTAATAGCCC TGACTTTATG CAGTGCCTTT
ATGTCGATTC AGTACCCAAC AATCTTCTCG CTGGGCATTA AGAATCTCGG CCAGGACACC
AAATACGGTT CGTCCTTCAT CGTTATGACC ATTATTGGCG GCGGTATTGT CACTCCGGTC
ATGGGTTTTG TCAGTGACGC TGCGGGCAAC ATCCCCACAG CTGAACTGAT CCCCGCACTC
TGCTTCGCAG TCATCTTTAT CTTTGCCCGT TTCCGTTCTC AAACGGCAAC TAACTGA
 
Protein sequence
MGNTSIQTQS YRAVDKDAGQ SRSYIIPFAL LCSLFFLWAV ANNLNDILLP QFQQAFTLTN 
FQAGLIQSAF YFGYFIIPIP AGILMKKLSY KAGIITGLFL YAFGAALFWP AAEIMNYTLF
LVGLFIIAAG LGCLETAANP FVTVLGPESS GHFRLNLAQT FNSFGAIIAV VFGQSLILSN
VPHQSQDVLD KMSPEQLSAY KHSLVLSVQT PYMIIVAIVL LVALLIMLTK FPALQSDNHS
DAKQGSFSAS LSRLARIRHW RWAVLAQFCY VGAQTACWSY LIRYAVEEIP GMTAGFAANY
LTGTMVCFFI GRFTGTWLIS RFAPHKVLAA YALIAMALCL ISAFAGGHVG LIALTLCSAF
MSIQYPTIFS LGIKNLGQDT KYGSSFIVMT IIGGGIVTPV MGFVSDAAGN IPTAELIPAL
CFAVIFIFAR FRSQTATN