Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03865 |
Symbol | malF |
ID | 8113657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 4148992 |
End bp | 4150536 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644850021 |
Product | hypothetical protein |
Protein accession | YP_003001594 |
Protein GI | 251787290 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1175] ABC-type sugar transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGTCA TTAAAAAGAA ACATTGGTGG CAAAGCGACG CGCTGAAATG GTCAGTGCTA GGTCTGCTCG GCCTGCTGGT GGGTTACCTT GTTGTTTTAA TGTACGCACA AGGGGAATAC CTGTTCGCCA TTACCACGCT GATATTGAGT TCAGCGGGGC TGTATATTTT CGCCAATCGT AAAGCCTACG CCTGGCGCTA TGTTTACCCG GGAATGGCTG GAATGGGATT ATTCGTCCTC TTCCCTCTGG TCTGCACCAT CGCCATTGCC TTCACCAACT ACAGCAGCAC TAACCAGCTG ACTTTTGAAC GTGCGCAGGA AGTGTTGTTA GATCGCTCCT GGCAAGCAGG CAAAACCTAT AACTTTGGTC TTTACCCGGC GGGCGATGAG TGGCAACTGG CGCTCAGCGA CGGCGAAACC GGCAAAAATT ACCTCTCCGA CGCTTTTAAA TTTGGCGGCG AGCAAAAACT GCAACTGAAA GAAACGACCG CCCAGCCCGA AGGCGAACGC GCGAATCTGC GCGTGATTAC CCAGAATCGT CAGGCGCTGA GTGACATTAC CGCCATTCTG CCGGATGGCA ACAAAGTGAT GATGAGCTCC CTGCGCCAGT TTTCTGGCAC GCAGCCGCTC TACACACTCG ACGGTGACGG CACGTTGACG AATAATCAGA GCGGCGTGAA ATATCGTCCG AATAACCAAA TTGGCTTTTA CCAGTCCATT ACCGCCGACG GCAACTGGGG TGATGAAAAG CTAAGCCCCG GTTACACCGT GACCACCGGC TGGAAAAACT TTACCCGCGT CTTTACCGAC GAAGGCATTC AGAAACCGTT CCTCGCCATT TTCGTCTGGA CCGTGGTGTT CTCGCTGATC ACTGTCTTTT TAACGGTGGC GGTCGGCATG GTTCTGGCGT GTCTGGTGCA GTGGGAAGCG TTGCGCGGCA AAGCGGTCTA TCGCGTCCTG CTGATTCTGC CCTACGCGGT GCCATCGTTC ATTTCAATCT TGATTTTCAA AGGGTTGTTT AACCAGAGCT TCGGTGAAAT CAACATGATG TTGAGCGCGC TGTTTGGCGT GAAGCCCGCC TGGTTCAGCG ATCCGACCAC CGCCCGCACG ATGCTAATTA TCGTCAATAC CTGGCTGGGT TATCCGTACA TGATGATCCT CTGCATGGGC TTGCTGAAAG CGATTCCGGA CGATTTGTAT GAAGCCTCAG CAATGGATGG CGCAGGTCCG TTCCAGAACT TCTTTAAGAT TACGCTGCCG CTGCTGATTA AACCGCTGAC GCCGCTGATG ATCGCCAGCT TCGCCTTTAA CTTTAACAAC TTCGTGCTGA TTCAACTGTT AACCAACGGC GGCCCGGATC GTCTTGGCAC GACCACGCCA GCCGGTTATA CCGACCTGCT TGTTAACTAC ACCTACCGCA TCGCTTTTGA AGGCGGCGGG GGTCAGGACT TCGGTCTGGC GGCAGCAATT GCCACGCTGA TCTTCCTGCT GGTGGGTGCG CTGGCGATAG TGAACCTGAA AGCCACGCGA ATGAAGTTTG ATTAA
|
Protein sequence | MDVIKKKHWW QSDALKWSVL GLLGLLVGYL VVLMYAQGEY LFAITTLILS SAGLYIFANR KAYAWRYVYP GMAGMGLFVL FPLVCTIAIA FTNYSSTNQL TFERAQEVLL DRSWQAGKTY NFGLYPAGDE WQLALSDGET GKNYLSDAFK FGGEQKLQLK ETTAQPEGER ANLRVITQNR QALSDITAIL PDGNKVMMSS LRQFSGTQPL YTLDGDGTLT NNQSGVKYRP NNQIGFYQSI TADGNWGDEK LSPGYTVTTG WKNFTRVFTD EGIQKPFLAI FVWTVVFSLI TVFLTVAVGM VLACLVQWEA LRGKAVYRVL LILPYAVPSF ISILIFKGLF NQSFGEINMM LSALFGVKPA WFSDPTTART MLIIVNTWLG YPYMMILCMG LLKAIPDDLY EASAMDGAGP FQNFFKITLP LLIKPLTPLM IASFAFNFNN FVLIQLLTNG GPDRLGTTTP AGYTDLLVNY TYRIAFEGGG GQDFGLAAAI ATLIFLLVGA LAIVNLKATR MKFD
|
| |