Gene ECH74115_5514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5514 
SymbolmalF 
ID6969860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5161159 
End bp5162703 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content53% 
IMG OID643389157 
Productmaltose transporter membrane protein 
Protein accessionYP_002273554 
Protein GI209400802 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.957845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTCA TTAAAAAGAA ACATTGGTGG CAAAGCGACG CGCTGAAATG GTCAGTGTTA 
GGTCTGCTCG GCCTGCTGGT GGGTTACCTT GTTGTTTTAA TGTACGCACA AGGGGAATAC
CTGTTCGCCA TTACCACGCT GATATTGAGT TCAGCGGGGC TGTATATTTT CGCCAATCGT
AAAGCCTACG CCTGGCGCTA TGTTTACCCG GGAATGGCTG GAATGGGATT ATTCGTCCTC
TTCCCTCTGG TCTGCACCAT CGCCATTGCC TTCACCAACT ACAGCAGCAC TAACCAGCTG
ACTTTTGAAC GTGCACAGGA AGTGTTGTTA GATCGCTCCT GGCAAGCAGG CAAAACCTAT
AACTTTGGTC TTTACCCGGC GGGCGATGAG TGGCAACTGG CGCTCAGCGA CGGCGAAACC
GGCAAAAATT ACCTCTCCGA CGCTTTTAAA TTTGGCAGAG AGCAAAAACT GCAACTGAAA
GAAACGACCG CCCAGCCCGA AGGCGAACGC GCAAATCTAC GCGTGATTAC CCAGAATCGT
CAGGCGCTGA GTGACATTAC CGCCATTCTG CCGGATGGCA ACAAAGTGAT GATGAGCTCC
CTGCGCCAGT TTTCTGGCAC GCAGCCGCTC TACACACTCG ACGGTGACGG CACGTTGACG
AATAATCAGA GCGGCGTGAA ATATCGTCCG AATAACCAAA TTGGCTTTTA CCAGTCCATT
ACCGCCGACG GCAACTGGGG TGATGAAAAG CTAAGCCCCG GTTACACCGT GACCACCGGC
TGGAAAAACT TTACTCGCGT CTTCACCGAC GAAGGCATTC AGAAACCGTT CCTCGCCATT
TTCGTCTGGA CCGTGGTGTT CTCGCTGATC ACTGTCTTTT TAACGGTGGC AGTCGGCATG
GTTCTGGCGT GTCTGGTGCA GTGGGAAGCG TTGCGCGGCA AAGCGGTCTA TCGCGTCCTG
CTGATTCTGC CCTACGCGGT GCCATCGTTC ATTTCAATCT TGATTTTCAA AGGGTTGTTT
AACCAGAGCT TCGGTGAGAT CAACATGATG TTGAGCGCGC TGTTTGGCGT GAAGCCCGCC
TGGTTCAGCG ATCCGACCAC CGCCCGAACG ATGCTGATTA TCGTCAATAC CTGGCTGGGC
TATCCGTACA TGATGATCCT CTGCATGGGC TTGCTGAAAG CGATTCCGGA CGATTTGTAT
GAAGCCTCAG CAATGGATGG CGCAGGTCCG TTCCAGAACT TCTTTAAGAT TACGCTGCCG
CTGCTGATTA AACCGCTGAC GCCGCTGATG ATCGCCAGCT TCGCCTTTAA CTTTAACAAC
TTCGTGCTGA TTCAACTGTT AACCAACGGC GGCCCGGATC GTCTTGGCAC GACCACGCCA
GCCGGTTATA CCGACCTGCT TGTTAACTAC ACCTACCGCA TCGCTTTTGA AGGCGGCGGG
GGTCAGGACT TCGGTCTGGC AGCAGCAATT GCCACGCTGA TCTTCCTGCT GGTGGGTGCG
CTGGCGATAG TGAACCTGAA AGCCACGCGA ATGAAGTTTG ATTAA
 
Protein sequence
MDVIKKKHWW QSDALKWSVL GLLGLLVGYL VVLMYAQGEY LFAITTLILS SAGLYIFANR 
KAYAWRYVYP GMAGMGLFVL FPLVCTIAIA FTNYSSTNQL TFERAQEVLL DRSWQAGKTY
NFGLYPAGDE WQLALSDGET GKNYLSDAFK FGREQKLQLK ETTAQPEGER ANLRVITQNR
QALSDITAIL PDGNKVMMSS LRQFSGTQPL YTLDGDGTLT NNQSGVKYRP NNQIGFYQSI
TADGNWGDEK LSPGYTVTTG WKNFTRVFTD EGIQKPFLAI FVWTVVFSLI TVFLTVAVGM
VLACLVQWEA LRGKAVYRVL LILPYAVPSF ISILIFKGLF NQSFGEINMM LSALFGVKPA
WFSDPTTART MLIIVNTWLG YPYMMILCMG LLKAIPDDLY EASAMDGAGP FQNFFKITLP
LLIKPLTPLM IASFAFNFNN FVLIQLLTNG GPDRLGTTTP AGYTDLLVNY TYRIAFEGGG
GQDFGLAAAI ATLIFLLVGA LAIVNLKATR MKFD