Gene EcHS_A2945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2945 
SymbolfucP 
ID5593999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2952444 
End bp2953760 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content48% 
IMG OID640922063 
ProductL-fucose transporter 
Protein accessionYP_001459573 
Protein GI157162255 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID[TIGR00885] L-fucose:H+ symporter permease 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value0.824406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAACA CATCAATACA AACGCAGAGT TACCGTGCGG TAGATAAAGA TGCAGGGCAA 
AGCAGAAGTT ACATTATTCC ATTCGCGCTG CTGTGCTCAC TGTTTTTTCT TTGGGCGGTA
GCCAATAACC TTAACGACAT TTTATTACCT CAATTCCAGC AGGCTTTTAC GCTGACAAAT
TTCCAGGCTG GCCTGATCCA ATCGGCCTTT TACTTTGGTT ATTTCATTAT CCCAATCCCT
GCTGGGATAT TGATGAAAAA ACTCAGTTAT AAAGCAGGGA TTATTACCGG ATTATTTTTA
TATGCCTTGG GCGCTGCATT ATTCTGGCCC GCTGCAGAAA TAATGAACTA CACCTTATTT
TTAGTTGGCC TATTTATTAT TGCAGCCGGA TTAGGTTGTC TGGAAACTGC CGCAAACCCT
TTTGTTACGG TATTAGGGCC GGAAAGCAGT GGTCACTTCC GCTTAAATCT TGCGCAAACA
TTTAACTCGT TTGGCGCAAT TATCGCGGTT GTCTTTGGGC AAAGTCTTAT TTTGTCTAAC
GTGCCACATC AATCGCAAGA CGTTCTCGAT AAAATGTCTC CAGAGCAATT GAGTGCGTAT
AAACACAGCC TGGTATTATC GGTACAGACA CCTTATATGA TCATCGTGGC TATCGTGTTA
CTGGTCGCCC TGCTGATCAT GCTGACGAAA TTCCCGGCAT TGCAGAGTGA TAATCACAGT
GACGCCAAAC AAGGATCGTT CTCCGCATCG CTTTCTCGTC TGGCGCGTAT TCGTCACTGG
CGCTGGGCGG TATTAGCGCA ATTCTGCTAT GTCGGCGCAC AAACGGCCTG CTGGAGCTAT
TTGATTCGCT ACGCTGTAGA AGAAATTCCA GGTATGACTG CAGGCTTTGC CGCTAACTAT
TTAACCGGAA CCATGGTGTG CTTCTTTATT GGTCGTTTCA CCGGTACCTG GCTCATCAGT
CGCTTCGCAC CACACAAAGT CCTGGCCGCC TACGCATTAA TCGCTATGGC ACTGTGCCTG
ATCTCAGCCT TCGCTGGCGG TCATGTGGGC TTAATAGCCC TGACTTTATG CAGCGCCTTT
ATGTCGATTC AGTACCCAAC AATCTTCTCG CTGGGCATTA AGAATCTCGG CCAGGACACC
AAATACGGTT CGTCCTTCAT CGTTATGACC ATCATTGGCG GCGGTATTGT CACTCCGGTC
ATGGGTTTTG TCAGTGACGC GGCGGGCAAC ATCCCCACTG CTGAACTGAT CCCCGCACTC
TGCTTCGCGG TCATCTTTAT CTTTGCCCGT TTCCGTTCTC AAACGGCAAC TAACTGA
 
Protein sequence
MGNTSIQTQS YRAVDKDAGQ SRSYIIPFAL LCSLFFLWAV ANNLNDILLP QFQQAFTLTN 
FQAGLIQSAF YFGYFIIPIP AGILMKKLSY KAGIITGLFL YALGAALFWP AAEIMNYTLF
LVGLFIIAAG LGCLETAANP FVTVLGPESS GHFRLNLAQT FNSFGAIIAV VFGQSLILSN
VPHQSQDVLD KMSPEQLSAY KHSLVLSVQT PYMIIVAIVL LVALLIMLTK FPALQSDNHS
DAKQGSFSAS LSRLARIRHW RWAVLAQFCY VGAQTACWSY LIRYAVEEIP GMTAGFAANY
LTGTMVCFFI GRFTGTWLIS RFAPHKVLAA YALIAMALCL ISAFAGGHVG LIALTLCSAF
MSIQYPTIFS LGIKNLGQDT KYGSSFIVMT IIGGGIVTPV MGFVSDAAGN IPTAELIPAL
CFAVIFIFAR FRSQTATN