Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3951 |
Symbol | malS |
ID | 6270624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3684229 |
End bp | 3686259 |
Gene Length | 2031 bp |
Protein Length | 676 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641727800 |
Product | periplasmic alpha-amylase precursor |
Protein accession | YP_001882233 |
Protein GI | 187734117 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTCG CCTCCTGTTT TCTGACACTC CTTCCTGGCT TCGCCGTTGC CGCCAGCTGG ACTTCTCCGG GGTTCCCTGT CTTTAGCGAA CAGGGAACGG GAACATTTGT CAGCCACGCG CAGTTGCCCA AAGGTACGCG TCCACTCACG CTAAATTTTG ACCAACAGTG CTGGCAGCCT GCAGATGCGA TAAAACTCAA TCAGATGCTT TCCCTGCAAC CTTGTAGCAA CACGCCGCCT CAATGGCGAT TGTTCAGGGA CGGCAAATAT ACGCTGCAAA TAGACACCCG CTCCGGTACG CCAACATTGA TGATTTCCAT CCAGAACGCC GCCGAAGCGG TAGCAAACCT GGTCCGTGAA TGCCCGAAAT GGGATGGATT ACCGCTCACG CTGGATGTCA GCGCCACTTT CCCGGAAGGA GCCGCCGTAC GGGATTATTA CAGCCAGCAA ATTGCGATAG TGAAGAACGG TCAAATAACG TTACAACCCG CTGCTACCAG CAACGGTTTA CTCCTGCTGG AACGGGCAGA AACTGACACA TCCGCCCCTT TCGACTGGCA TAACGCCACG GTTTACTTTG TGCTGACAGA TCGTTTCGAA AACGGCGATC CCAGTAATGA CCAGAGTTAC GGACGTCATA AAGACGGTAT GGCGGAAATT GGCACTTTTC ACGGCGGCGA TTTACGCGCC CTGATCAATA AACTGGATTA CCTCCAGCAG TTGGGCGTTA ATGCTTTATG GATAAGCGCC CCATTTGAGC AAATTCACGG CTGGGTCGGC GGCGGTACAA AAGGCGATTT CCCGCATTAT GCCTACCACG GTTATTACAC ACAGGACTGG ACGAATCTTG ATGCCAATAT GGGCAACGAA GCCGATCTAC GGACGCTGGT TGATAGCGCA CATCAGCGCG GTATTCGTAT TCTCTTTGAT GTCGTGATGA ACCACACCGG CTATGCCACG CTGGCGGATA TGCAGGAGTA TCAGTTTGGC GCGTTATATC TTTCTGGTGA CGAAGTGAAA AAAACGCTGG GTGAACGCTG GAGCGACTGG AAACCTGCCG CCGGGCAAAC CTGGCATAGC TTTAACGATT ACATTAATTT CAGCGACAAA ACAGGCTGGG ATAAATGGTG GGGAAAAAAC TGGATCAGAA CGGATATCGG CGATTACGAC AATCCTGGAT TCGACGATCT CACTATGTCG CTGGCCTTTT TGCCGGATAT CAAAACCGAA TCAACTACCG CTTCTGGTCT GCCGGTGTTC TATAAAAACA AAACGGATAC TCACGCTAAA GTCATCGAAG GCTTTACACC TCGCGATTAC TTAACCCACT GGTTAAGTCA GTGGGTCCGC GACTATGGGA TTGATGGTTT TCGGGTCGAT ACCGCCAAAC ATGTTGAGTT GCCCGCCTGG CAGCAACTGA AAACCGAAGC CAGCTCCGCG CTTCGCGAAT GGAAAAAAAC TAACCCCGAC AAAGCATTAG ATGACAAACC TTTCTGGATG ACCGGTGAAG CCTGGGGCCA CGGCGTGATG CAAAGTGACT ACTATCGCCA CGGCTTCGAT GCGATGATCA ATTTCGATTA TCAGGAGCAG GCGGCGAAAG CAGTCGATTG TCTGGCGCAG ATGGATACGA CCTGGCAGCA AATGGCGGAG AAATTGCAGG GTTTCAACGT GTTGAGCTAC CTCTCGTCGC ATGATACCCG TCTGTTCCGT GAAGGGGGCG ACAAAGCAGC AGAGTTATTA CTATTAGCGC CAGGCGCGGT ACAAATCTTT TATGGCGATG AATCCTCGCG TCCGTTCGGT CCTACAGGTT CTGATCCGCT GCAAGGTACA CGTTCGGATA TGAACTGGCA GGATGTTAGC GGTAAATCTG CCGCCAACGT CGCGCACTGG CAGAAAATCA GCCAGTTCCG CGCCCGCCAT CCCGCAATTG GCGCGGGCAA ACAAACGACA CTTTCGCTGA AGCAGGGCTA CGGCTTTGTT CGTGAGCATG GCGACGATAA AGTGCTGGTC ATCTGGGCTG GGCAACAGTG A
|
Protein sequence | MKLASCFLTL LPGFAVAASW TSPGFPVFSE QGTGTFVSHA QLPKGTRPLT LNFDQQCWQP ADAIKLNQML SLQPCSNTPP QWRLFRDGKY TLQIDTRSGT PTLMISIQNA AEAVANLVRE CPKWDGLPLT LDVSATFPEG AAVRDYYSQQ IAIVKNGQIT LQPAATSNGL LLLERAETDT SAPFDWHNAT VYFVLTDRFE NGDPSNDQSY GRHKDGMAEI GTFHGGDLRA LINKLDYLQQ LGVNALWISA PFEQIHGWVG GGTKGDFPHY AYHGYYTQDW TNLDANMGNE ADLRTLVDSA HQRGIRILFD VVMNHTGYAT LADMQEYQFG ALYLSGDEVK KTLGERWSDW KPAAGQTWHS FNDYINFSDK TGWDKWWGKN WIRTDIGDYD NPGFDDLTMS LAFLPDIKTE STTASGLPVF YKNKTDTHAK VIEGFTPRDY LTHWLSQWVR DYGIDGFRVD TAKHVELPAW QQLKTEASSA LREWKKTNPD KALDDKPFWM TGEAWGHGVM QSDYYRHGFD AMINFDYQEQ AAKAVDCLAQ MDTTWQQMAE KLQGFNVLSY LSSHDTRLFR EGGDKAAELL LLAPGAVQIF YGDESSRPFG PTGSDPLQGT RSDMNWQDVS GKSAANVAHW QKISQFRARH PAIGAGKQTT LSLKQGYGFV REHGDDKVLV IWAGQQ
|
| |