Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0526 |
Symbol | |
ID | 7978241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 595195 |
End bp | 596883 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644797527 |
Product | alpha amylase catalytic region |
Protein accession | YP_002948701 |
Protein GI | 239826077 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGAACGAG CATGGTGGAA AGAGGCGGTT GTGTATCAAA TTTATCCGCG CAGTTTTTAC GATTCCAACG GGGATGGCAT TGGCGATATC CGCGGCATTA TCGCGAAACT TGATTATTTA AAAGAACTTG GCGTGGATGT GGTATGGCTA TCTCCGGTAT ATAAATCGCC AAATGATGAC AACGGATATG ATATAAGCGA TTATCGGGAG ATTATGGATG AATTTGGTAC GATGGAAGAT TGGGAAGAAA TGCTTGAGGA AATGCACAAA CGCGGGATTA AGCTAGTGAT GGATTTAGTC GTCAATCATA TATCAGATGA GCATCCGTGG TTTATCGAAT CAAGAAAGTC AAAGGACAAT CCGTATCGCG ACTACTATAT ATGGCGGCCT GGAAAAGATG GAAAGGAACC GAACAATTGG GAGTCAAATT TTAGCGGTTC AGCGTGGGAA TATGACGAAA CGACGGAAGA ATATTATTTG CATCTTTTTT CAAAAAAACA ACCAGATTTA AACTGGGAAA ATCCAAAAGT GCGCCGCGAA GTGTATGACA TAATGAAGTT TTGGCTTGAT AAAGGCGTCG ACGGCTTCCG AATGGATGTC ATCAACATGA TTTCGAAAGT GCCAGAATTA CCGGACGGAA AACCGCAGGA AGGGAAGAAA TACGCTTCGG GAAGCAAGTA TTTTATGAAC GGTCCGCGCG TTCATGAGTT TTTGCAAGAG ATGAACCGCG AAGTATTGTC AAAATACGAC ATTATGACGG TCGGAGAAAC GCCGGGAGTC ACACCAAAAG AGGGAATTTT ATATACCGAC CCATCGCGTC ATGAGTTGAA CATGGTGTTT CAATTCGAGC ATGTTGGTTT AGATTCCGGA CCTGGAGGAA AATGGGATAT TCGTCCATGG TCGTTGGCCG ACTTGAAAAA AACGATGACA AAATGGCAAA AAGAGCTAGA AGGAAAAGGA TGGAACAGTC TTTACTTAAA CAATCATGAT CAGCCACGCG CTGTTTCTCG CTTTGGCGAT GATGGAAAGT ATCGTGTGGA ATCGGCGAAA ATGCTTGCAA CATTTCTCCA TATGATGCAA GGAACACCGT ATATTTACCA AGGTGAAGAG ATCGGAATGA CCAATGTGCG CTTCCCGTCG ATTGAATACT ACCGCGATAT TGAAACGTTG AACATGTATA AAGAACGTGT GGAAGAATAT GGTGAAGATC CGCAAAAAGT GATGGAGAAA ATTTATTATA AAGGGCGTGA CAACGCGCGC ACACCGATGC AGTGGGATGA CAGCGAAAAC GCAGGATTTA CAACGGGGAC GCCATGGATT CCAGTAAATC CAAATTATAA GGAAATCAAC GTAAAAGAGG CTTTAGCGGA TCCAAATTCG GTGTTTCATT ATTATAAAAA ATTAATTCAA TTTCGCAAGC AGCATGACAT TATTGTCTAT GGAACATATG ACTTAATTTT GGAAGACGAT CCGTATATTT ACGCATATAC ACGCACATTG GGAAATGAAA AGCTGATTGT TATTACTAAT TTTTCTGAAA AAACTCCTGT TTTCCGGCTT CCGGATGATA TCACCTATAA AACAAAAGAG CTGCTTATCA GCAATTACGA TGTTGATGAA ACGGAAGAAC TGAAAGAAAT TCGCCTGCGT CCATGGGAGG CGCGCGTATA TAAAATCCGT TTGTCATGA
|
Protein sequence | MERAWWKEAV VYQIYPRSFY DSNGDGIGDI RGIIAKLDYL KELGVDVVWL SPVYKSPNDD NGYDISDYRE IMDEFGTMED WEEMLEEMHK RGIKLVMDLV VNHISDEHPW FIESRKSKDN PYRDYYIWRP GKDGKEPNNW ESNFSGSAWE YDETTEEYYL HLFSKKQPDL NWENPKVRRE VYDIMKFWLD KGVDGFRMDV INMISKVPEL PDGKPQEGKK YASGSKYFMN GPRVHEFLQE MNREVLSKYD IMTVGETPGV TPKEGILYTD PSRHELNMVF QFEHVGLDSG PGGKWDIRPW SLADLKKTMT KWQKELEGKG WNSLYLNNHD QPRAVSRFGD DGKYRVESAK MLATFLHMMQ GTPYIYQGEE IGMTNVRFPS IEYYRDIETL NMYKERVEEY GEDPQKVMEK IYYKGRDNAR TPMQWDDSEN AGFTTGTPWI PVNPNYKEIN VKEALADPNS VFHYYKKLIQ FRKQHDIIVY GTYDLILEDD PYIYAYTRTL GNEKLIVITN FSEKTPVFRL PDDITYKTKE LLISNYDVDE TEELKEIRLR PWEARVYKIR LS
|
| |