Gene EcolC_3996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3996 
SymbolmalF 
ID6064546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4388834 
End bp4390378 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content53% 
IMG OID641603407 
Productmaltose transporter membrane protein 
Protein accessionYP_001726922 
Protein GI170021968 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0600719 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.596443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTCA TTAAAAAGAA ACATTGGTGG CAAAGCGACG CGCTGAAATG GTCAGTGCTA 
GGTCTGCTCG GCCTGCTGGT GGGTTACCTT GTTGTTTTAA TGTACGCACA AGGGGAATAC
CTGTTCGCCA TTACCACGCT GATATTGAGT TCAGCGGGGC TGTATATTTT CGCCAATCGT
AAAGCCTACG CCTGGCGCTA TGTTTACCCG GGAATGGCTG GAATGGGATT ATTCGTCCTC
TTCCCTCTGG TCTGCACCAT CGCCATTGCC TTCACCAACT ACAGCAGCAC TAACCAGCTG
ACTTTTGAAC GTGCGCAGGA AGTGTTGTTA GATCGCTCCT GGCAAGCAGG CAAAACCTAT
AACTTTGGTC TTTACCCGGC GGGCGATGAG TGGCAACTGG CGCTCAGCGA CGGCGAAACC
GGCAAAAATT ACCTCTCCGA CGCTTTTAAA TTTGGCGGCG AGCAAAAACT GCAACTGAAA
GAAACGACCG CCCAGCCCGA AGGCGAACGC GCGAATCTGC GCGTGATTAC CCAGAATCGT
CAGGCGCTGA GTGACATTAC CGCCATTCTG CCGGATGGCA ACAAAGTGAT GATGAGCTCC
CTGCGCCAGT TTTCTGGCAC GCAGCCGCTC TACACACTCG ACGGTGACGG CACGTTGACG
AATAATCAGA GCGGCGTGAA ATATCGTCCG AATAACCAAA TTGGCTTTTA CCAGTCCATT
ACCGCCGACG GCAACTGGGG TGATGAAAAG CTAAGCCCCG GTTACACCGT GACCACCGGC
TGGAAAAACT TTACCCGCGT CTTTACCGAC GAAGGCATTC AGAAACCGTT CCTCGCCATT
TTCGTCTGGA CCGTGGTGTT CTCGCTGATC ACTGTCTTTT TAACGGTGGC GGTCGGCATG
GTTCTGGCGT GTCTGGTGCA GTGGGAAGCG TTGCGCGGCA AAGCGGTCTA TCGCGTCCTG
CTGATTCTGC CCTACGCGGT GCCATCGTTC ATTTCAATCT TGATTTTCAA AGGGTTGTTT
AACCAGAGCT TCGGTGAAAT CAACATGATG TTGAGCGCGC TGTTTGGCGT GAAGCCCGCC
TGGTTCAGCG ATCCGACCAC CGCCCGCACG ATGCTAATTA TCGTCAATAC CTGGCTGGGT
TATCCGTACA TGATGATCCT CTGCATGGGC TTGCTGAAAG CGATTCCGGA CGATTTGTAT
GAAGCCTCAG CAATGGATGG CGCAGGTCCG TTCCAGAACT TCTTTAAGAT TACGCTGCCG
CTGCTGATTA AACCGCTGAC GCCGCTGATG ATCGCCAGCT TCGCCTTTAA CTTTAACAAC
TTCGTGCTGA TTCAACTGTT AACCAACGGC GGCCCGGATC GTCTTGGCAC GACCACGCCA
GCCGGTTATA CCGACCTGCT TGTTAACTAC ACCTACCGCA TCGCTTTTGA AGGCGGCGGG
GGTCAGGACT TCGGTCTGGC GGCAGCAATT GCCACGCTGA TCTTCCTGCT GGTGGGTGCG
CTGGCGATAG TGAACCTGAA AGCCACGCGA ATGAAGTTTG ATTAA
 
Protein sequence
MDVIKKKHWW QSDALKWSVL GLLGLLVGYL VVLMYAQGEY LFAITTLILS SAGLYIFANR 
KAYAWRYVYP GMAGMGLFVL FPLVCTIAIA FTNYSSTNQL TFERAQEVLL DRSWQAGKTY
NFGLYPAGDE WQLALSDGET GKNYLSDAFK FGGEQKLQLK ETTAQPEGER ANLRVITQNR
QALSDITAIL PDGNKVMMSS LRQFSGTQPL YTLDGDGTLT NNQSGVKYRP NNQIGFYQSI
TADGNWGDEK LSPGYTVTTG WKNFTRVFTD EGIQKPFLAI FVWTVVFSLI TVFLTVAVGM
VLACLVQWEA LRGKAVYRVL LILPYAVPSF ISILIFKGLF NQSFGEINMM LSALFGVKPA
WFSDPTTART MLIIVNTWLG YPYMMILCMG LLKAIPDDLY EASAMDGAGP FQNFFKITLP
LLIKPLTPLM IASFAFNFNN FVLIQLLTNG GPDRLGTTTP AGYTDLLVNY TYRIAFEGGG
GQDFGLAAAI ATLIFLLVGA LAIVNLKATR MKFD