Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0017 |
Symbol | |
ID | 7978456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 25162 |
End bp | 26448 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644796972 |
Product | glycoside hydrolase family 18 |
Protein accession | YP_002948225 |
Protein GI | 239825601 |
COG category | [R] General function prediction only |
COG ID | [COG3858] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000350058 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATTC ACGTTGTACA AAGCGGGCAA ACGTTAAGTG GAATTGCTCA AGCGTATAAT ACGACTCCTG AAGAAATTAT TCGCGCAAAC GAGCTCCCAA ATCCAAACGA TCTTGTTGTA GGTCAAGCGA TTGTTATTCC GATTGTTGGA AGCTTTTATT GGGTTCAGCG CGGCGATAGT TTATGGTCCA TCTCCCGGAA ATTTTCCATT CCTGCGCAGC GTCTTGCGGA AATTAACCGC ATTTCCTTAA ATAGCCCGCT GCAAGTTGGA CAACGTTTAT ACATTCCGCC AAAAGCAAAA CGGAGAGCCG AGTTTAACGG ATACATTGAG CCGCGAGGAA CGACTGTCAG CCCAGCGCTG GAGGCAAGTG CTCGTCAAGC AGCCCCATAT TTAACATACT TAGCTCCGTT TCAGTTCCAA ATTCAGCGAA ATGCAGCACT CAAAGAACCT CCATTGAATA ATTTCCCATC TATCGCTCGC GCCAATCGCG TTACTTTAAT CATGGTCGTT ACCAATATCG AAAACGATCA ATTTAGCGAT GAACTAGGAG CGCTTATTTT AAACAACGAA CAATTACAAA ACCGTCTATT AGAAAACATT GTAACAACAG CAAAAAAATA CGGATTTCGC GATATTCACT TTGATATGGA ATATTTGCGT CCCGAAGACC GCGAAGCGTA TAACGCGTTT TTGCGAAAAG CAAAACGGCG ATTTGAACGT GAAGGGTGGC TTATGTCTAC CGCGTTGGCG CCAAAAACAA GCGCGACACA AAAAGGCCGC TGGTATGAAG CGCATGATTA CCGCGCCCAC GGACAAATTG TTGATTTTGT CGTCATTATG ACATATGAAT GGGGATATAG CGGTGGACCG CCAATGGCGG TTTCTCCAAT TGGCCCTGTT CGGCAGGTGA TCGAATATGC CATTTCCGAA ATGCCCGCTT CTAAAATTAT GATGGGACAA AACTTATACG GATACGATTG GACGCTTCCA TACGTACCGG GTGGACCATA TGCACGGGCG ATTAGTCCGC AACAAGCGAT CCGTCTTGCT GCTCAATACA ATGTCGCCAT TGAATATGAC ACCAAAGCGC AAGCTCCTCA TTTCCGCTAT CGGGACGAAA ACGGAAAAGA ACACGAAGTT TGGTTTGAAG ATGCCCGTTC GATTCAAGCA AAATTTGACT TAGTAAAAGA ACTCGGCCTA CGGGGAATCA GCTATTGGAA ACTAGGATTA GATTTTCCGC AAAATTGGTT GCTGCTAACA GATAACTTTA CTGTTGTAAA AAGGTAA
|
Protein sequence | MQIHVVQSGQ TLSGIAQAYN TTPEEIIRAN ELPNPNDLVV GQAIVIPIVG SFYWVQRGDS LWSISRKFSI PAQRLAEINR ISLNSPLQVG QRLYIPPKAK RRAEFNGYIE PRGTTVSPAL EASARQAAPY LTYLAPFQFQ IQRNAALKEP PLNNFPSIAR ANRVTLIMVV TNIENDQFSD ELGALILNNE QLQNRLLENI VTTAKKYGFR DIHFDMEYLR PEDREAYNAF LRKAKRRFER EGWLMSTALA PKTSATQKGR WYEAHDYRAH GQIVDFVVIM TYEWGYSGGP PMAVSPIGPV RQVIEYAISE MPASKIMMGQ NLYGYDWTLP YVPGGPYARA ISPQQAIRLA AQYNVAIEYD TKAQAPHFRY RDENGKEHEV WFEDARSIQA KFDLVKELGL RGISYWKLGL DFPQNWLLLT DNFTVVKR
|
| |