Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0698 |
Symbol | |
ID | 7978878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 767743 |
End bp | 769275 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644797682 |
Product | alpha amylase catalytic region |
Protein accession | YP_002948856 |
Protein GI | 239826232 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAATC GATTGCTCGC TCTGTTTATA CTTCCGTTTC TTCTTTTTTA TGCGTTTCCG GTACAAGCCG CTGAAAAAGA AGAACGAACC TGGCAGGATG AAGCAATTTA TTTCATCATG GTTGACCGCT TTAACAACAT GGACCCAAGC AATGATTACA ATGTCAACGT CAATGACCCG AAAGGCTATT TCGGCGGTGA TTTAAAAGGA GTGACTGCCA AACTGGACTA CATTAAAGAG ATGGGGTTTA CGGCGATTTG GCTCACGCCG ATTTTCAAAA ACGAACCGGG CGGGTACCAT GGCTATTGGA TTCAAGACTT TTACAAAGTC GATCCGCATT TTGGCACGAT GGAAGATTTA AAAACGTTAG TGAAAGAAGC GCATAAACGC AACATGAAGG TGATTCTGGA TTTTGTCGCC AACCATACCG GCTACCACCA TCCATGGCTG AACGACCCAG CGAAGAAAGA TTGGTTTCAT GAGAAAAAAG AAATTTTCGA CTGGAATGAC CAAAAACAAT CAGAAAATGG ATGGATATAC GGACTGCCTG ATTTAGCGCA AGAAAACCCT GAGGTGAAGC GGTATTTAAT TGATGCGGCA AAGTGGTGGA TCAAACAGAC CGATATTGAC GGCTATCGTT TAGATGCTGT TCGCCATGTG CCAAAATCAT TTTGGAAAGA GTTTTCCAAA GAAGTAAAAT CAGTTAAAAA AGACTTCTTG CTGCTTGGAG AAGTATGGGC TGATGATCCG CGCTATATTG CGGATTACGG AAAATACGGA ATTGACGGTT TCATCGATTA CCCGCTTTAC AATGCGGTAA CGACGACATT GACGAAACGG GATCAATCAT TGCGGCCGCT TTATGACGTA TGGGAGTATA ATAAAACATT TTACGACCGT CCGTATTTGC TGGGAACATT TTTAGACAAC CACGATACGG TCCGGTTTAC GAAGCTCGCT TTGGACAACA AACAAAACCC GATTTCCAGA ACAAAGCTTG CTTTATCATA TTTATTCTCA GCTCCTGGCA TTCCGATTAT GTATTATGGA ACGGAAATTG CGATGAACGG GGGAGAAGAC CCGGATAACC GCCGTTTAAT GGATTTTCGC GCCGACCGGG AAATCATTGA TTACATTAAA AAACTTGGCG AATTGCGGCA AAAACTTCCT TCCTTGCGGC GCGGTGATTT TACGTTGCTG TATGAAAAAG ACGGCATGGC CGTATTTAAG CGCCATTACA AAGACGAAAC AACGGTGATC GCCATTAATA ATACAGGAAA AACACAAAAG GTGCATATCA CGAATGACCA ACTGACGCCG GGAAAAGAGC TGCGCGGGCT GCTTGCCGGC GATTTAGTGC GGAGCGGCCG CGATGGCTAT GATATTATTA TCAATCGCGA AACAGCGGAA ATTTACGCGC TTGCCGATAA AACAGGGATC AATATTCCTT TTATCATGGC GATTGTCGCT GTTTACGTAT TATTTATCCT ATTTTTGTAC CTTGTTAAAA GGCGGTCGAA ACAGGCGACG TAA
|
Protein sequence | MGNRLLALFI LPFLLFYAFP VQAAEKEERT WQDEAIYFIM VDRFNNMDPS NDYNVNVNDP KGYFGGDLKG VTAKLDYIKE MGFTAIWLTP IFKNEPGGYH GYWIQDFYKV DPHFGTMEDL KTLVKEAHKR NMKVILDFVA NHTGYHHPWL NDPAKKDWFH EKKEIFDWND QKQSENGWIY GLPDLAQENP EVKRYLIDAA KWWIKQTDID GYRLDAVRHV PKSFWKEFSK EVKSVKKDFL LLGEVWADDP RYIADYGKYG IDGFIDYPLY NAVTTTLTKR DQSLRPLYDV WEYNKTFYDR PYLLGTFLDN HDTVRFTKLA LDNKQNPISR TKLALSYLFS APGIPIMYYG TEIAMNGGED PDNRRLMDFR ADREIIDYIK KLGELRQKLP SLRRGDFTLL YEKDGMAVFK RHYKDETTVI AINNTGKTQK VHITNDQLTP GKELRGLLAG DLVRSGRDGY DIIINRETAE IYALADKTGI NIPFIMAIVA VYVLFILFLY LVKRRSKQAT
|
| |