Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3894 |
Symbol | malS |
ID | 6142972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3963567 |
End bp | 3965597 |
Gene Length | 2031 bp |
Protein Length | 676 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618720 |
Product | periplasmic alpha-amylase precursor |
Protein accession | YP_001745859 |
Protein GI | 170684113 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTCG CCGCCTGTTT TCTGACACTC CTTCCTGGCT TCGCCGTTGC CGCCAGCTGG ACTTCTCCGG GGTTTCCCGC CTTTAGCGAA CAGGGGACGG GAACATTTGT CAGCCACGCA CAGTTGCCCA AAGGTACGCG TCCACTAACG CTAAATTTTG ACCAACAGTG CTGGCAGCCT GCGGATGCGA TAAAACTCAA TCAGATGCTT TCCCTGCAAC CTTGTAGCAA CACGCCGCCT CAATGGCGAT TGTTCAGGGA CGGCGAATAT ACGCTGCAAC TAGACACCCG CTCCGGTACA CCAACATTGA TGATTTCCAT CCAGAACGCC GTCGAACCGG TAGCAAGCCT GGTCCGTGAA TGCCCGAAAT GGGATGGATT ACCGCTCACA CTGGATGTCA GCGCCACTTT TCCGGAGGGA GCCGCCGTCC GGGATTATTA CAGCCAGCAA ATTGCGATAG TGAAGAACGG TCAAATAACG TTACAACCCG CTGCCACCAG CAACGGTTTA CTCCTGCTGG AACGGGCAGA AACTGACGCC TCTGCCCCTT TCGACTGGCA TAACGCCACG GTTTACTTTG TGCTGACAGA TCGTTTCGAA AACGGCGATC CCAGTAATGA CCAGAGTTAC GGACGTCATA AAGACGGTAT GGCAGAAATT GGCACTTTTC ACGGCGGCGA TTTACGCGGC CTGACCAACA AACTGGATTA CCTCCAGCAG CTGGGCGTCA ATGCTTTATG GATAAGTGCC CCATTTGAGC AAATTCACGG CTGGGTCGGC GGCGGTACAA AAGGCGATTT CCCGCATTAT GCCTACCACG GTTATTACAC ACAGGACTGG ACGAATCTTG ATGCCAATAT GGGCCGCGAA GCCGATCTAC GGACGCTGGT TGATAGCGCG CATCAGCGCG GTATTCGTAT TCTCTTTGAT GTCGTGATGA ACCACACCGG TTACGCCACA CTGGCGGATA TGCAGGAGTT TCAGTTTGGC GCGTTATATC TTTCTGGTGA CGAAGTGAAA AAAACGCTGG GTGAACGCTG GAGCGACTGG AAACCTGCCG CCGGGCAAAC CTGGCATAGC TTTAACGATT ACATTAATTT CAGCGACAAA ACAGGCTGGG ATAAATGGTG GGGAAAAAAC TGGATCAGAA CCGATATCGG CGATTACGAC AATCCTGGAT TCGACGATCT CACTATGTCG CTGGCCTTTT TGCCGGATAT CAAAACCGAA TCAACGACCG CTTCTGGTCT GCCGGTGTTC TATAAAAACA AAACGGATAC CCACGCTAAA GTCATCGACG GCTTTACCCC TCGCGATTAC TTAACCCACT GGTTAAGTCA GTGGGTTCGC GACTATGGGA TTGATGGTTT TCGGGTCGAC ACCGCCAAAC ATGTTGAGTT GCCCGCCTGG CAGCAACTGA AAACCGAAGC CAGCGCCGCG CTTCGCGAAT GGAAAAAAGC TAACTCCGAC AAAGCATTAG ATGACAAACC TTTCTGGATG ACCGGTGAAG CCTGGGGCCA CGGCGTGATG CAAAGTGACT ACTATCGCCA CGGCTTCGAT GCGATGATCA ATTTCGATTA TCAGGAGCAG GCGGCGAAAG CAGTCGATTG TCTGGCGCAG ATGGATACGA CCTGGCAGCA AATGGCGGAG AAATTGCAGG ATTTCAACGT GTTGAGCTAC CTCTCATCGC ATGATACCCG CCTGTTCCGC GAAGGTGGTA ATAAAGCAGC AGAGTTATTG TTATTAGCGC CAGGCGCGGT ACAAATCTTT TATGGTGATG AATCCTCGCG TCCGTTCGGC CCCACCGGTT CGGATCCGCT GCAAGGTACG CGTTCGGATA TGAACTGGCA GGATGTTAGC GGTAAATCTG CCGCCAGCGT CGCGCACTGG CAGAAAATCA GCCAGTTCCG CGCCCGCCAT CCCGCAATTG GCGAGGGCAA ACAAACGACA CTTTCGATGA AGCAGGGCTA CGGCTTTGTT CGTGAGCATG GCGACGATAA AGTGCTGGTC ATCTGGGCTG GGCAACAGTG A
|
Protein sequence | MKLAACFLTL LPGFAVAASW TSPGFPAFSE QGTGTFVSHA QLPKGTRPLT LNFDQQCWQP ADAIKLNQML SLQPCSNTPP QWRLFRDGEY TLQLDTRSGT PTLMISIQNA VEPVASLVRE CPKWDGLPLT LDVSATFPEG AAVRDYYSQQ IAIVKNGQIT LQPAATSNGL LLLERAETDA SAPFDWHNAT VYFVLTDRFE NGDPSNDQSY GRHKDGMAEI GTFHGGDLRG LTNKLDYLQQ LGVNALWISA PFEQIHGWVG GGTKGDFPHY AYHGYYTQDW TNLDANMGRE ADLRTLVDSA HQRGIRILFD VVMNHTGYAT LADMQEFQFG ALYLSGDEVK KTLGERWSDW KPAAGQTWHS FNDYINFSDK TGWDKWWGKN WIRTDIGDYD NPGFDDLTMS LAFLPDIKTE STTASGLPVF YKNKTDTHAK VIDGFTPRDY LTHWLSQWVR DYGIDGFRVD TAKHVELPAW QQLKTEASAA LREWKKANSD KALDDKPFWM TGEAWGHGVM QSDYYRHGFD AMINFDYQEQ AAKAVDCLAQ MDTTWQQMAE KLQDFNVLSY LSSHDTRLFR EGGNKAAELL LLAPGAVQIF YGDESSRPFG PTGSDPLQGT RSDMNWQDVS GKSAASVAHW QKISQFRARH PAIGEGKQTT LSMKQGYGFV REHGDDKVLV IWAGQQ
|
| |