Gene EcDH1_0887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0887 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp948212 
End bp949528 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content48% 
IMG OID 
ProductL-fucose transporter 
Protein accessionACX38570 
Protein GI260448148 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAACA CATCAATACA AACGCAGAGT TACCGTGCGG TAGATAAAGA TGCAGGGCAA 
AGCAGAAGTT ACATTATTCC ATTCGCGCTG CTGTGCTCAC TGTTTTTTCT TTGGGCGGTA
GCCAATAACC TTAACGACAT TTTATTACCT CAATTCCAGC AGGCTTTTAC GCTGACAAAT
TTCCAGGCTG GCCTGATCCA ATCGGCCTTT TACTTTGGTT ATTTCATTAT CCCAATCCCT
GCTGGGATAT TGATGAAAAA ACTCAGTTAT AAAGCAGGGA TTATTACCGG GTTATTTTTA
TATGCCTTGG GTGCTGCATT ATTCTGGCCC GCCGCAGAAA TAATGAACTA CACCTTGTTT
TTAGTTGGCC TATTTATTAT TGCAGCCGGA TTAGGTTGTC TGGAAACTGC CGCAAACCCT
TTTGTTACGG TATTAGGGCC GGAAAGTAGT GGTCACTTCC GCTTAAATCT TGCGCAAACA
TTTAACTCGT TTGGCGCAAT TATCGCGGTT GTCTTTGGGC AAAGTCTTAT TTTGTCTAAC
GTGCCACATC AATCGCAAGA CGTTCTCGAT AAAATGTCTC CAGAGCAATT GAGTGCGTAT
AAACACAGCC TGGTATTATC GGTACAGACA CCTTATATGA TCATCGTGGC TATCGTGTTA
CTGGTCGCCC TGCTGATCAT GCTGACGAAA TTCCCGGCAT TGCAGAGTGA TAATCACAGT
GACGCCAAAC AAGGATCGTT CTCCGCATCG CTTTCTCGCC TGGCGCGTAT TCGCCACTGG
CGCTGGGCGG TATTAGCGCA ATTCTGCTAT GTCGGCGCAC AAACGGCCTG CTGGAGCTAT
TTGATTCGCT ACGCTGTAGA AGAAATTCCA GGTATGACTG CAGGCTTTGC CGCTAACTAT
TTAACCGGAA CCATGGTGTG CTTCTTTATT GGTCGTTTCA CCGGTACCTG GCTCATCAGT
CGCTTCGCAC CACACAAAGT CCTGGCCGCC TACGCATTAA TCGCTATGGC ACTGTGCCTG
ATCTCAGCCT TCGCTGGCGG TCATGTGGGC TTAATAGCCC TGACTTTATG CAGCGCCTTT
ATGTCGATTC AGTACCCAAC AATCTTCTCG CTGGGCATTA AGAATCTCGG CCAGGACACC
AAATATGGTT CGTCCTTCAT CGTTATGACC ATTATTGGCG GCGGTATTGT CACTCCGGTC
ATGGGTTTTG TCAGTGACGC GGCGGGCAAC ATCCCCACTG CTGAACTGAT CCCCGCACTC
TGCTTCGCGG TCATCTTTAT CTTTGCCCGT TTCCGTTCTC AAACGGCAAC TAACTGA
 
Protein sequence
MGNTSIQTQS YRAVDKDAGQ SRSYIIPFAL LCSLFFLWAV ANNLNDILLP QFQQAFTLTN 
FQAGLIQSAF YFGYFIIPIP AGILMKKLSY KAGIITGLFL YALGAALFWP AAEIMNYTLF
LVGLFIIAAG LGCLETAANP FVTVLGPESS GHFRLNLAQT FNSFGAIIAV VFGQSLILSN
VPHQSQDVLD KMSPEQLSAY KHSLVLSVQT PYMIIVAIVL LVALLIMLTK FPALQSDNHS
DAKQGSFSAS LSRLARIRHW RWAVLAQFCY VGAQTACWSY LIRYAVEEIP GMTAGFAANY
LTGTMVCFFI GRFTGTWLIS RFAPHKVLAA YALIAMALCL ISAFAGGHVG LIALTLCSAF
MSIQYPTIFS LGIKNLGQDT KYGSSFIVMT IIGGGIVTPV MGFVSDAAGN IPTAELIPAL
CFAVIFIFAR FRSQTATN