Gene EcSMS35_4495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4495 
SymbolmalF 
ID6144757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4591081 
End bp4592625 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content53% 
IMG OID641619311 
Productmaltose transporter membrane protein 
Protein accessionYP_001746423 
Protein GI170684307 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.654176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTCA TTAAAAAGAA ACATTGGTGG CAAAGCGACG CGCTGAAATG GTCAGTGTTA 
GGTCTGCTCG GCCTGCTGGT GGGTTACCTT GTTGTTTTAA TGTACGCACA AGGGGAATAC
CTGTTCGCCA TTACCACGCT GATATTGAGT TCAGCGGGGC TGTATATTTT CGCCAATCGT
AAAGCCTACG CCTGGCGCTA TGTTTACCCG GGAATGGCTG GAATGGGATT ATTCGTCCTC
TTCCCTTTGG TCTGCACCAT CGCCATTGCC TTTACCAACT ACAGCAGCAC TAACCAGCTG
ACTTTTGAAC GTGCGCAGGA AGTGTTGTTA GATCGCTCCT GGCAAGCAGG CAAAACCTAT
AATTTTGGTC TTTACCCGGC GGGCGATGAG TGGCAACTGG CGCTCAGCGA CGGCGAAACC
GGCAAAAATT ACCTCTCCGA CGCGTTTAAA TTTGGCGGCG AGCAAAAACT GCAACTGAAA
GAAACGACCG CCCAGCCCGA AGGCGAACGC GCGAATCTGC GCGTGATTAC CCAGAATCGT
CAGGCACTGA GTGACATTAC CGCCATTCTG CCGGATGGCA ACAAAGTGAT GATGAGCTCC
CTGCGCCAGT TTTCTGGCAC GCAGCCGCTC TACACACTCG ACGGTGACGG CACGCTGACG
AATAATCAGA GCGGCGTAAA ATATCGTCCA AACGACCAGA TTGGTTTTTA CCAGTCCATC
ACTGCTGACG GCAACTGGGG TGATGAAAAG CTAAGCCCCG GTTACACCGT GACCACCGGC
TGGAAAAACT TTACCCGCGT CTTTACCGAC GAAGGCATTC AGAAACCGTT CCTCGCCATT
TTCGTCTGGA CCGTGGTGTT CTCGCTGATC ACTGTCTTTT TAACGGTGGC GGTCGGCATG
GTTCTGGCGT GTCTGGTGCA GTGGGAAGCG TTGCGCGGCA AAGCGGTCTA TCGCGTCCTG
CTTATTCTGC CCTACGCGGT GCCATCGTTC ATTTCAATCT TGATTTTCAA AGGGTTGTTT
AACCAGAGCT TCGGTGAGAT CAACATGATG TTGAGCGCGC TGTTTGGCGT GAAGCCCGCC
TGGTTCAGCG ATCCGACCAC CGCCCGAACG ATGCTGATTA TCGTCAATAC CTGGCTGGGT
TATCCGTACA TGATGATCCT CTGCATGGGC TTGCTGAAAG CGATTCCGGA CGATTTGTAT
GAAGCCTCAG CAATGGATGG CGCAGGTCCG TTCCAGAACT TCTTTAAGAT TACGCTGCCG
CTGCTGATTA AACCGCTGAC GCCGCTGATG ATCGCCAGCT TCGCCTTTAA CTTTAACAAC
TTCGTGCTGA TTCAACTGTT AACCAACGGC GGCCCGGATC GTCTTGGCAC GACCACGCCA
GCCGGTTATA CCGACCTGCT TGTTAACTAC ACCTACCGCA TCGCTTTTGA AGGCGGCGGG
GGTCAGGACT TCGGTCTGGC AGCAGCAATT GCCACGCTGA TCTTCCTGCT GGTGGGTGCG
CTGGCGATAG TGAACCTGAA AGCCACGCGA ATGAAGTTTG ATTAA
 
Protein sequence
MDVIKKKHWW QSDALKWSVL GLLGLLVGYL VVLMYAQGEY LFAITTLILS SAGLYIFANR 
KAYAWRYVYP GMAGMGLFVL FPLVCTIAIA FTNYSSTNQL TFERAQEVLL DRSWQAGKTY
NFGLYPAGDE WQLALSDGET GKNYLSDAFK FGGEQKLQLK ETTAQPEGER ANLRVITQNR
QALSDITAIL PDGNKVMMSS LRQFSGTQPL YTLDGDGTLT NNQSGVKYRP NDQIGFYQSI
TADGNWGDEK LSPGYTVTTG WKNFTRVFTD EGIQKPFLAI FVWTVVFSLI TVFLTVAVGM
VLACLVQWEA LRGKAVYRVL LILPYAVPSF ISILIFKGLF NQSFGEINMM LSALFGVKPA
WFSDPTTART MLIIVNTWLG YPYMMILCMG LLKAIPDDLY EASAMDGAGP FQNFFKITLP
LLIKPLTPLM IASFAFNFNN FVLIQLLTNG GPDRLGTTTP AGYTDLLVNY TYRIAFEGGG
GQDFGLAAAI ATLIFLLVGA LAIVNLKATR MKFD