Gene GWCH70_0601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0601 
Symbol 
ID7978790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp667079 
End bp668743 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content43% 
IMG OID644797590 
Productalpha amylase catalytic region 
Protein accessionYP_002948764 
Protein GI239826140 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000199149 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAAAG CTTGGTGGAA AGAAGGAGTC GCGTATCAAA TTTATCCGAG AAGCTTCATG 
GATTCAAATG GCGATGGAAT TGGAGATCTT CGCGGAATTA TCGAGAAGCT TGACTATTTA
AAAGACCTTG GAATCGATAT CATATGGATT TGTCCAATTT ACAAGTCGCC AAATGCCGAT
AACGGATATG ATATTAGCGA TTATCATGCG ATTATGGAAG AATTCGGGAC AATGGAAGAC
TTTGATTTAT TGCTCGAGGA AATTCACCGG CGCGGCATGA AAGTGATTTT AGATTTAGTC
ATTAACCATA CAAGCGATGA ACACCCATGG TTTATTGAAT CGCGTTCATC ACGGGATAAT
CCGAAACGGG ATTGGTACAT TTGGCGTGAT GGAAAAAATG GCAAAGAGCC GAACAACTGG
GAAAGCATCT TTGGCGGATC GGCGTGGGAG TACGATCCGA AAACTGATCA ATATTACTTG
CACATATTTG ATGTAAAGCA GCCGGATTTA AACTGGGAAA ACGAAGAAGT GCGCCAAGCG
TTGTATAAAA TGATTAATTG GTGGCTGGAT AAAGGAATTG ATGGATTTCG AGTGGATGCC
ATTTCACATA TTAAGAAAAA GCCGGGATTG CCGGATTTGC CGAATCCGAA AGGGCTAGAT
TATGTTCCGT CTTTTGCTGG CCATATGAAC CAAGAAGGGA TTATGGATTA TTTAAGAGAG
CTCAAAATGC AAACGTTTGC ACGCTATGAT ATTATGACGG TTGGAGAAGC GAATGGCGTT
ACCGTCGAGG ACGCGGAAGA ATGGGTCGGT GAGGAAAACG GTATTTTCAA TATGATCTTC
CAGTTTGAAC ATTTAGGGTT ATGGCAAAAA GGAACGAATG GCGGAGTGGA TGTGCGCCAA
TTAAAGCGTA CGTTGACAAA ATGGCAAAAA GGGCTGGAAA ACCGCGGCTG GAACGCATTG
TTTTTAGAAA ACCACGATCA GCCTCGTTCT GTCTCGACAT GGGGAAATGA TAAGGAGTAC
CTTACCGAAA GTGCGAAGGC GCTTGGTGCG ATGTACTTTT TGATGCAAGG AACACCGTTT
ATTTATCAAG GACAGGAGAT TGGGATGACG AACGTCCAAT TTTCGAATAT TGAAGACTAC
AATGATGTTG CTATTAAAAG AATGTATCAA ATCGAACGGG AAAAGGGTCG CTCCCATGAA
GAAATTATGA AAGTGATTTG GAAAACAGGG CGCGACAATT CGCGAACCCC AATGCAATGG
TCCGACGCGC CAAACGCGGG GTTTACGACG GGAACGCCAT GGATGAAAGT GAACGAAAAC
TATAAGACCA TTAACGTCGA GGCGCAATTG CGCGACCCGA ACTCCGTCCT TCAGTTTTAC
AAAAAAATGA TTCGGCTTCG CAAGGAGAAC GAAGTGTTTA TTTATGGAAC GTACGATTTG
ATTTTAGAAA AGCATCCGAC GATTTATGCG TACACAAGAA CGCTTGGAAA CGAAAAAGCG
ATGGTTATTG TCAATTTAAG CGACAAGCCA GCGCTGTATC GATATGATGG CATTCGATTA
AGCTCAGAAA ATTTGGTGCT GCAAAACTAT GATGTAAAAC CGCATAAAAA CGCAACACGC
TTTAAGCTGA AACCGTATGA AGCACGCGTA TATTTGCTGA AATAG
 
Protein sequence
MKKAWWKEGV AYQIYPRSFM DSNGDGIGDL RGIIEKLDYL KDLGIDIIWI CPIYKSPNAD 
NGYDISDYHA IMEEFGTMED FDLLLEEIHR RGMKVILDLV INHTSDEHPW FIESRSSRDN
PKRDWYIWRD GKNGKEPNNW ESIFGGSAWE YDPKTDQYYL HIFDVKQPDL NWENEEVRQA
LYKMINWWLD KGIDGFRVDA ISHIKKKPGL PDLPNPKGLD YVPSFAGHMN QEGIMDYLRE
LKMQTFARYD IMTVGEANGV TVEDAEEWVG EENGIFNMIF QFEHLGLWQK GTNGGVDVRQ
LKRTLTKWQK GLENRGWNAL FLENHDQPRS VSTWGNDKEY LTESAKALGA MYFLMQGTPF
IYQGQEIGMT NVQFSNIEDY NDVAIKRMYQ IEREKGRSHE EIMKVIWKTG RDNSRTPMQW
SDAPNAGFTT GTPWMKVNEN YKTINVEAQL RDPNSVLQFY KKMIRLRKEN EVFIYGTYDL
ILEKHPTIYA YTRTLGNEKA MVIVNLSDKP ALYRYDGIRL SSENLVLQNY DVKPHKNATR
FKLKPYEARV YLLK