Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4068 |
Symbol | malS |
ID | 5588951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 4050118 |
End bp | 4052148 |
Gene Length | 2031 bp |
Protein Length | 676 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640927687 |
Product | periplasmic alpha-amylase precursor |
Protein accession | YP_001465047 |
Protein GI | 157158372 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTCG CCGCCTGTTT TCTGACACTC CTTCCTGGCT TCGCCGTTGC CGCCAGCTGG ACTTCTCCGG GGTTCCCTGC CTTTAGCGAA CAGGGAACGG GAACATTTGT CAGCCACGCG CAGTTGCCCA AAGGTACGCG TCCACTCACG CTAAATTTTG ACCAACAGTG CTGGCAGCCT GCAGATGCGA TAAAACTCAA TCAGATGCTT TCCCTGCAAC CTTGTAGCAA CACGCCGCCT CAATGGCGAT TGTTCAGAGA CGGCAAATAT ACGCTGCAAA TAGACACCCG CTCCGGTACG CCAACATTGA TGATTTCCAT CCAGAACGCC GCCGAACCGG TAGCAAACCT GGTCCGTGAA TGCCCGAAAT GGGATGGATT ACCGCTCACG CTGGATGTCA GCGCCACTTT CCCGGAAGGA GCCGCCGTAC GGGATTATTA CAGCCAGCAA ATTGCGATAG TGAAGAACGG TCAAATAACG TTACAACCCG CTGCTACCAG CAACGGTTTA CTCCTGCTGG AACGGGCAGA AACTGACGCC CCTGCCCCTT TCGACTGGCA TAACGCCACG GTTTACTTTG TGCTGACAGA TCGTTTCGAA AACGGCGATC CCAGTAATGA CCAGAGTTAC GGACGTCATA AAGACGGTAT GGCGGAAATT GGCACTTTTC ACGGCGGCGA TTTACGCGGC CTGACCAACA AACTGGATTA CCTCCAGCAG TTGGGCGTTA ATGCTTTATG GATAAGCGCC CCATTTGAGC AAATTCACGG CTGGGTCGGC GGCGGTACAA AAGGCGATTT CCCGCATTAT GCCTACCACG GTTATTACAC ACAGGACTGG ACGAATCTTG ATGCCAATAT GGGCAACGAA GCCGATCTAC GGACGCTGGT TGATAGCGCA CATCAGCGCG GTATTCGTAT TCTCTTTGAT GTCGTGATGA ACCACACCGG CTATGCCACG CTGGCGGATA TGCAGGAATA TCAGTTTGGC GCGTTATATC TTTCTGGTGA CGAGGTGAAA AAAACGCTGG GTGAACGCTG GAGCGACTGG AAACCTGCCG CCGGGCAAAC CTGGCATAGC TTTAACGATT ACATTAATTT CAGCGACAAA ACAGGCTGGG ATAAATGGTG GGGAAAAAAC TGGATCAGAA CGGATATCGG CGATTACGAC AATCCTGGAT TCGACGATCT CACTATGTCG CTGGCCTTTT TGCCGGATAT CAAAACCGAA TCAACTACCG CTTCTGGTCT GCCGGTGTTC TATAAAAACA AAACGGATAC TCACGCTAAA GTCATCGAAG GCTTTACACC TCGCGATTAC TTAACCCACT GGTTAAGTCA GTGGGTCCGC GACTATGGGA TTGATGGTTT TCGGGTCGAT ACCGCCAAAC ATGTTGAGTT GCCCGCCTGG CAGCAACTGA AAACCGAAGC CAGCGCCGCG CTTCGCGAAT GGAAAAAAGC TTACCCCGAC AAAGCATTAG ATGACAAACC TTTCTGGATG ACCGGTGAAG CCTGGGGCCA CGGCGTGATG CAAAGTGACT ACTATCGCCA CGGCTTCGAT GCGATGATCA ATTTCGATTA TCAGGAGCAG GCGGCGAAAG CAGTCGATTG TCTGGCGCAG ATGGATACGA CCTGGCAGCA AATGGCGGAG AAATTGCAGG GTTTCAACGT GTTGAGCTAC CTCTCGTCGC ATGATACCCG TCTGTTCCGT GAAGGGGGCG ACAAAGCAGC AGAGTTATTA CTATTAGCGC CAGGCGCGGT ACAAATCTTT TATGGCGATG AATCCTCGCG TCCGTTCGGT CCTACAGGTT CTGATCCGCT GCAAGGTACA CGTTCGGATA TGAACTGGCA GGATGTTAGC GGTAAATCTG CCGCCAACGT CGCGCACTGG CAGAAAATCA GCCAGTTCCG CGCCCGCCAT CCCGCAATTG GCGCGGGCAA ACAAACGACA CTTTCGCTGA AGCAGGGCTA CGGCTTTGTT CGTGAGCATG GCGACGATAA AGTGCTGGTC ATCTGGGCTG GGCAACAGTG A
|
Protein sequence | MKLAACFLTL LPGFAVAASW TSPGFPAFSE QGTGTFVSHA QLPKGTRPLT LNFDQQCWQP ADAIKLNQML SLQPCSNTPP QWRLFRDGKY TLQIDTRSGT PTLMISIQNA AEPVANLVRE CPKWDGLPLT LDVSATFPEG AAVRDYYSQQ IAIVKNGQIT LQPAATSNGL LLLERAETDA PAPFDWHNAT VYFVLTDRFE NGDPSNDQSY GRHKDGMAEI GTFHGGDLRG LTNKLDYLQQ LGVNALWISA PFEQIHGWVG GGTKGDFPHY AYHGYYTQDW TNLDANMGNE ADLRTLVDSA HQRGIRILFD VVMNHTGYAT LADMQEYQFG ALYLSGDEVK KTLGERWSDW KPAAGQTWHS FNDYINFSDK TGWDKWWGKN WIRTDIGDYD NPGFDDLTMS LAFLPDIKTE STTASGLPVF YKNKTDTHAK VIEGFTPRDY LTHWLSQWVR DYGIDGFRVD TAKHVELPAW QQLKTEASAA LREWKKAYPD KALDDKPFWM TGEAWGHGVM QSDYYRHGFD AMINFDYQEQ AAKAVDCLAQ MDTTWQQMAE KLQGFNVLSY LSSHDTRLFR EGGDKAAELL LLAPGAVQIF YGDESSRPFG PTGSDPLQGT RSDMNWQDVS GKSAANVAHW QKISQFRARH PAIGAGKQTT LSLKQGYGFV REHGDDKVLV IWAGQQ
|
| |