Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0143 |
Symbol | malS |
ID | 6068305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 155299 |
End bp | 157329 |
Gene Length | 2031 bp |
Protein Length | 676 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641599543 |
Product | periplasmic alpha-amylase precursor |
Protein accession | YP_001723152 |
Protein GI | 170018198 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTCG CCGCCTGTTT TCTGACACTC CTTCCTGGCT TCGCCGTTGC CGCCAGCTGG ACTTCTCCGG GGTTTCCCGC CTTTAGCGAA CAGGGGACAG GAACATTTGT CAGCCACGCG CAGTTGCCCA AAGGTACGCG TCCACTAACG CTAAATTTTG ACCAACAGTG CTGGCAGCCT GCGGATGCGA TAAAACTCAA TCAGATGCTT TCCCTGCAAC CTTGTAGCAA CACGCCGCCT CAATGGCGAT TGTTCAGGGA CGGCGAATAT ACGCTGCAAA TAGACACCCG CTCCGGTACG CCAACATTGA TGATTTCCAT CCAGAACGCC GCCGAACCGG TAGCAAGCCT GGTCCGTGAA TGCCCGAAAT GGGATGGATT ACCGCTCACA GTGGATGTCA GCGCCACTTT CCCGGAAGGA GCCGCCGTAC GGGATTATTA CAGCCAGCAA ATTGCGATAG TGAAGAACGG TCAAATAATG TTACAACCCG CTGCCACCAG CAACGGTTTA CTCCTGCTGG AACGGGCAGA AACTGACACA TCCGCCCCTT TCGACTGGCA TAACGCCACG GTTTACTTTG TGCTGACAGA TCGTTTCGAA AACGGCGATC CCAGTAATGA CCAGAGTTAC GGACGTCATA AAGACGGTAT GGCGGAAATT GGCACTTTTC ACGGCGGCGA TTTACGCGGC CTGACCAACA AACTGGATTA CCTCCAGCAG TTGGGCGTTA ATGCTTTATG GATAAGCGCC CCATTTGAGC AAATTCACGG CTGGGTCGGC GGCGGTACAA AAGGCGATTT CCCGCATTAT GCCTACCACG GTTATTACAC ACAGGACTGG ACGAATCTTG ATGCCAATAT GGGCAACGAA GCCGATCTAC GGACGCTGGT TGATAGCGCA CATCAGCGCG GTATTCGTAT TCTCTTTGAT GTCGTGATGA ACCACACCGG CTATGCCACG CTGGCGGATA TGCAGGAGTA TCAGTTTGGC GCGTTATATC TTTCTGGTGA CGAAGTGAAA AAATCGCTGG GTGAACGCTG GAGCGACTGG AAACCTGCCG CCGGGCAAAC CTGGCATAGC TTTAACGATT ACATTAATTT CAGCGACAAA ACAGGCTGGG ATAAATGGTG GGGAAAAAAC TGGATCAGAA CGGATATCGG CGATTACGAC AATCCTGGAT TCGACGATCT CACTATGTCG CTAGCCTTTT TGCCGGATAT CAAAACCGAA TCAACTACCG CTTCTGGTCT GCCGGTGTTC TATAAAAACA AAATGGATAC CCACGCCAAA GCCATTGACG GCTATACGCC GCGCGATTAC TTAACCCACT GGTTAAGTCA GTGGGTCCGC GACTATGGGA TTGATGGTTT TCGGGTCGAT ACCGCCAAAC ATGTTGAGTT GCCCGCCTGG CAGCAACTGA AAACCGAAGC CAGCGCCGCG CTTCGCGAAT GGAAAAAAGC TAACCCCGAC AAAGCATTAG ATGACAAACC TTTCTGGATG ACCGGTGAAG CCTGGGGCCA CGGCGTGATG CAAAGTGACT ACTATCGCCA CGGCTTCGAT GCGATGATCA ATTTCGATTA TCAGGAGCAG GCGGCGAAAG CAGTCGACTG TCTGGCGCAG ATGGATACGA CCTGGCAGCA AATGGCGGAG AAATTGCAGG GTTTCAACGT GTTGAGCTAC CTCTCGTCGC ATGATACTCG CCTGTTCCGT GAAGGGGGCG ACAAAGCAGC AGAGTTATTA CTATTAGCGC CAGGCGCGGT ACAAATCTTT TATGGTGATG AATCCTCGCG TCCGTTCGGT CCTACAGGTT CTGATCCGCT GCAAGGTACA CGTTCGGATA TGAACTGGCA GGATGTTAGC GGTAAATCTG CCGCCAGCGT CGCGCACTGG CAGAAAATCA GCCAGTTCCG CGCCCGCCAT CCCGCAATTG GCGCGGGCAA ACAAACGACA CTTTTGCTGA AGCAGGGCTA CGGCTTTGTT CGTGAGCATG GCGACGATAA AGTGCTGGTC GTCTGGGCAG GGCAACAGTA A
|
Protein sequence | MKLAACFLTL LPGFAVAASW TSPGFPAFSE QGTGTFVSHA QLPKGTRPLT LNFDQQCWQP ADAIKLNQML SLQPCSNTPP QWRLFRDGEY TLQIDTRSGT PTLMISIQNA AEPVASLVRE CPKWDGLPLT VDVSATFPEG AAVRDYYSQQ IAIVKNGQIM LQPAATSNGL LLLERAETDT SAPFDWHNAT VYFVLTDRFE NGDPSNDQSY GRHKDGMAEI GTFHGGDLRG LTNKLDYLQQ LGVNALWISA PFEQIHGWVG GGTKGDFPHY AYHGYYTQDW TNLDANMGNE ADLRTLVDSA HQRGIRILFD VVMNHTGYAT LADMQEYQFG ALYLSGDEVK KSLGERWSDW KPAAGQTWHS FNDYINFSDK TGWDKWWGKN WIRTDIGDYD NPGFDDLTMS LAFLPDIKTE STTASGLPVF YKNKMDTHAK AIDGYTPRDY LTHWLSQWVR DYGIDGFRVD TAKHVELPAW QQLKTEASAA LREWKKANPD KALDDKPFWM TGEAWGHGVM QSDYYRHGFD AMINFDYQEQ AAKAVDCLAQ MDTTWQQMAE KLQGFNVLSY LSSHDTRLFR EGGDKAAELL LLAPGAVQIF YGDESSRPFG PTGSDPLQGT RSDMNWQDVS GKSAASVAHW QKISQFRARH PAIGAGKQTT LLLKQGYGFV REHGDDKVLV VWAGQQ
|
| |