Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4495 |
Symbol | malF |
ID | 6144757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4591081 |
End bp | 4592625 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641619311 |
Product | maltose transporter membrane protein |
Protein accession | YP_001746423 |
Protein GI | 170684307 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1175] ABC-type sugar transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.654176 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGTCA TTAAAAAGAA ACATTGGTGG CAAAGCGACG CGCTGAAATG GTCAGTGTTA GGTCTGCTCG GCCTGCTGGT GGGTTACCTT GTTGTTTTAA TGTACGCACA AGGGGAATAC CTGTTCGCCA TTACCACGCT GATATTGAGT TCAGCGGGGC TGTATATTTT CGCCAATCGT AAAGCCTACG CCTGGCGCTA TGTTTACCCG GGAATGGCTG GAATGGGATT ATTCGTCCTC TTCCCTTTGG TCTGCACCAT CGCCATTGCC TTTACCAACT ACAGCAGCAC TAACCAGCTG ACTTTTGAAC GTGCGCAGGA AGTGTTGTTA GATCGCTCCT GGCAAGCAGG CAAAACCTAT AATTTTGGTC TTTACCCGGC GGGCGATGAG TGGCAACTGG CGCTCAGCGA CGGCGAAACC GGCAAAAATT ACCTCTCCGA CGCGTTTAAA TTTGGCGGCG AGCAAAAACT GCAACTGAAA GAAACGACCG CCCAGCCCGA AGGCGAACGC GCGAATCTGC GCGTGATTAC CCAGAATCGT CAGGCACTGA GTGACATTAC CGCCATTCTG CCGGATGGCA ACAAAGTGAT GATGAGCTCC CTGCGCCAGT TTTCTGGCAC GCAGCCGCTC TACACACTCG ACGGTGACGG CACGCTGACG AATAATCAGA GCGGCGTAAA ATATCGTCCA AACGACCAGA TTGGTTTTTA CCAGTCCATC ACTGCTGACG GCAACTGGGG TGATGAAAAG CTAAGCCCCG GTTACACCGT GACCACCGGC TGGAAAAACT TTACCCGCGT CTTTACCGAC GAAGGCATTC AGAAACCGTT CCTCGCCATT TTCGTCTGGA CCGTGGTGTT CTCGCTGATC ACTGTCTTTT TAACGGTGGC GGTCGGCATG GTTCTGGCGT GTCTGGTGCA GTGGGAAGCG TTGCGCGGCA AAGCGGTCTA TCGCGTCCTG CTTATTCTGC CCTACGCGGT GCCATCGTTC ATTTCAATCT TGATTTTCAA AGGGTTGTTT AACCAGAGCT TCGGTGAGAT CAACATGATG TTGAGCGCGC TGTTTGGCGT GAAGCCCGCC TGGTTCAGCG ATCCGACCAC CGCCCGAACG ATGCTGATTA TCGTCAATAC CTGGCTGGGT TATCCGTACA TGATGATCCT CTGCATGGGC TTGCTGAAAG CGATTCCGGA CGATTTGTAT GAAGCCTCAG CAATGGATGG CGCAGGTCCG TTCCAGAACT TCTTTAAGAT TACGCTGCCG CTGCTGATTA AACCGCTGAC GCCGCTGATG ATCGCCAGCT TCGCCTTTAA CTTTAACAAC TTCGTGCTGA TTCAACTGTT AACCAACGGC GGCCCGGATC GTCTTGGCAC GACCACGCCA GCCGGTTATA CCGACCTGCT TGTTAACTAC ACCTACCGCA TCGCTTTTGA AGGCGGCGGG GGTCAGGACT TCGGTCTGGC AGCAGCAATT GCCACGCTGA TCTTCCTGCT GGTGGGTGCG CTGGCGATAG TGAACCTGAA AGCCACGCGA ATGAAGTTTG ATTAA
|
Protein sequence | MDVIKKKHWW QSDALKWSVL GLLGLLVGYL VVLMYAQGEY LFAITTLILS SAGLYIFANR KAYAWRYVYP GMAGMGLFVL FPLVCTIAIA FTNYSSTNQL TFERAQEVLL DRSWQAGKTY NFGLYPAGDE WQLALSDGET GKNYLSDAFK FGGEQKLQLK ETTAQPEGER ANLRVITQNR QALSDITAIL PDGNKVMMSS LRQFSGTQPL YTLDGDGTLT NNQSGVKYRP NDQIGFYQSI TADGNWGDEK LSPGYTVTTG WKNFTRVFTD EGIQKPFLAI FVWTVVFSLI TVFLTVAVGM VLACLVQWEA LRGKAVYRVL LILPYAVPSF ISILIFKGLF NQSFGEINMM LSALFGVKPA WFSDPTTART MLIIVNTWLG YPYMMILCMG LLKAIPDDLY EASAMDGAGP FQNFFKITLP LLIKPLTPLM IASFAFNFNN FVLIQLLTNG GPDRLGTTTP AGYTDLLVNY TYRIAFEGGG GQDFGLAAAI ATLIFLLVGA LAIVNLKATR MKFD
|
| |