Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0601 |
Symbol | |
ID | 7978790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 667079 |
End bp | 668743 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644797590 |
Product | alpha amylase catalytic region |
Protein accession | YP_002948764 |
Protein GI | 239826140 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000199149 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAAAG CTTGGTGGAA AGAAGGAGTC GCGTATCAAA TTTATCCGAG AAGCTTCATG GATTCAAATG GCGATGGAAT TGGAGATCTT CGCGGAATTA TCGAGAAGCT TGACTATTTA AAAGACCTTG GAATCGATAT CATATGGATT TGTCCAATTT ACAAGTCGCC AAATGCCGAT AACGGATATG ATATTAGCGA TTATCATGCG ATTATGGAAG AATTCGGGAC AATGGAAGAC TTTGATTTAT TGCTCGAGGA AATTCACCGG CGCGGCATGA AAGTGATTTT AGATTTAGTC ATTAACCATA CAAGCGATGA ACACCCATGG TTTATTGAAT CGCGTTCATC ACGGGATAAT CCGAAACGGG ATTGGTACAT TTGGCGTGAT GGAAAAAATG GCAAAGAGCC GAACAACTGG GAAAGCATCT TTGGCGGATC GGCGTGGGAG TACGATCCGA AAACTGATCA ATATTACTTG CACATATTTG ATGTAAAGCA GCCGGATTTA AACTGGGAAA ACGAAGAAGT GCGCCAAGCG TTGTATAAAA TGATTAATTG GTGGCTGGAT AAAGGAATTG ATGGATTTCG AGTGGATGCC ATTTCACATA TTAAGAAAAA GCCGGGATTG CCGGATTTGC CGAATCCGAA AGGGCTAGAT TATGTTCCGT CTTTTGCTGG CCATATGAAC CAAGAAGGGA TTATGGATTA TTTAAGAGAG CTCAAAATGC AAACGTTTGC ACGCTATGAT ATTATGACGG TTGGAGAAGC GAATGGCGTT ACCGTCGAGG ACGCGGAAGA ATGGGTCGGT GAGGAAAACG GTATTTTCAA TATGATCTTC CAGTTTGAAC ATTTAGGGTT ATGGCAAAAA GGAACGAATG GCGGAGTGGA TGTGCGCCAA TTAAAGCGTA CGTTGACAAA ATGGCAAAAA GGGCTGGAAA ACCGCGGCTG GAACGCATTG TTTTTAGAAA ACCACGATCA GCCTCGTTCT GTCTCGACAT GGGGAAATGA TAAGGAGTAC CTTACCGAAA GTGCGAAGGC GCTTGGTGCG ATGTACTTTT TGATGCAAGG AACACCGTTT ATTTATCAAG GACAGGAGAT TGGGATGACG AACGTCCAAT TTTCGAATAT TGAAGACTAC AATGATGTTG CTATTAAAAG AATGTATCAA ATCGAACGGG AAAAGGGTCG CTCCCATGAA GAAATTATGA AAGTGATTTG GAAAACAGGG CGCGACAATT CGCGAACCCC AATGCAATGG TCCGACGCGC CAAACGCGGG GTTTACGACG GGAACGCCAT GGATGAAAGT GAACGAAAAC TATAAGACCA TTAACGTCGA GGCGCAATTG CGCGACCCGA ACTCCGTCCT TCAGTTTTAC AAAAAAATGA TTCGGCTTCG CAAGGAGAAC GAAGTGTTTA TTTATGGAAC GTACGATTTG ATTTTAGAAA AGCATCCGAC GATTTATGCG TACACAAGAA CGCTTGGAAA CGAAAAAGCG ATGGTTATTG TCAATTTAAG CGACAAGCCA GCGCTGTATC GATATGATGG CATTCGATTA AGCTCAGAAA ATTTGGTGCT GCAAAACTAT GATGTAAAAC CGCATAAAAA CGCAACACGC TTTAAGCTGA AACCGTATGA AGCACGCGTA TATTTGCTGA AATAG
|
Protein sequence | MKKAWWKEGV AYQIYPRSFM DSNGDGIGDL RGIIEKLDYL KDLGIDIIWI CPIYKSPNAD NGYDISDYHA IMEEFGTMED FDLLLEEIHR RGMKVILDLV INHTSDEHPW FIESRSSRDN PKRDWYIWRD GKNGKEPNNW ESIFGGSAWE YDPKTDQYYL HIFDVKQPDL NWENEEVRQA LYKMINWWLD KGIDGFRVDA ISHIKKKPGL PDLPNPKGLD YVPSFAGHMN QEGIMDYLRE LKMQTFARYD IMTVGEANGV TVEDAEEWVG EENGIFNMIF QFEHLGLWQK GTNGGVDVRQ LKRTLTKWQK GLENRGWNAL FLENHDQPRS VSTWGNDKEY LTESAKALGA MYFLMQGTPF IYQGQEIGMT NVQFSNIEDY NDVAIKRMYQ IEREKGRSHE EIMKVIWKTG RDNSRTPMQW SDAPNAGFTT GTPWMKVNEN YKTINVEAQL RDPNSVLQFY KKMIRLRKEN EVFIYGTYDL ILEKHPTIYA YTRTLGNEKA MVIVNLSDKP ALYRYDGIRL SSENLVLQNY DVKPHKNATR FKLKPYEARV YLLK
|
| |