Gene GWCH70_0017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0017 
Symbol 
ID7978456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp25162 
End bp26448 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content45% 
IMG OID644796972 
Productglycoside hydrolase family 18 
Protein accessionYP_002948225 
Protein GI239825601 
COG category[R] General function prediction only 
COG ID[COG3858] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000350058 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATTC ACGTTGTACA AAGCGGGCAA ACGTTAAGTG GAATTGCTCA AGCGTATAAT 
ACGACTCCTG AAGAAATTAT TCGCGCAAAC GAGCTCCCAA ATCCAAACGA TCTTGTTGTA
GGTCAAGCGA TTGTTATTCC GATTGTTGGA AGCTTTTATT GGGTTCAGCG CGGCGATAGT
TTATGGTCCA TCTCCCGGAA ATTTTCCATT CCTGCGCAGC GTCTTGCGGA AATTAACCGC
ATTTCCTTAA ATAGCCCGCT GCAAGTTGGA CAACGTTTAT ACATTCCGCC AAAAGCAAAA
CGGAGAGCCG AGTTTAACGG ATACATTGAG CCGCGAGGAA CGACTGTCAG CCCAGCGCTG
GAGGCAAGTG CTCGTCAAGC AGCCCCATAT TTAACATACT TAGCTCCGTT TCAGTTCCAA
ATTCAGCGAA ATGCAGCACT CAAAGAACCT CCATTGAATA ATTTCCCATC TATCGCTCGC
GCCAATCGCG TTACTTTAAT CATGGTCGTT ACCAATATCG AAAACGATCA ATTTAGCGAT
GAACTAGGAG CGCTTATTTT AAACAACGAA CAATTACAAA ACCGTCTATT AGAAAACATT
GTAACAACAG CAAAAAAATA CGGATTTCGC GATATTCACT TTGATATGGA ATATTTGCGT
CCCGAAGACC GCGAAGCGTA TAACGCGTTT TTGCGAAAAG CAAAACGGCG ATTTGAACGT
GAAGGGTGGC TTATGTCTAC CGCGTTGGCG CCAAAAACAA GCGCGACACA AAAAGGCCGC
TGGTATGAAG CGCATGATTA CCGCGCCCAC GGACAAATTG TTGATTTTGT CGTCATTATG
ACATATGAAT GGGGATATAG CGGTGGACCG CCAATGGCGG TTTCTCCAAT TGGCCCTGTT
CGGCAGGTGA TCGAATATGC CATTTCCGAA ATGCCCGCTT CTAAAATTAT GATGGGACAA
AACTTATACG GATACGATTG GACGCTTCCA TACGTACCGG GTGGACCATA TGCACGGGCG
ATTAGTCCGC AACAAGCGAT CCGTCTTGCT GCTCAATACA ATGTCGCCAT TGAATATGAC
ACCAAAGCGC AAGCTCCTCA TTTCCGCTAT CGGGACGAAA ACGGAAAAGA ACACGAAGTT
TGGTTTGAAG ATGCCCGTTC GATTCAAGCA AAATTTGACT TAGTAAAAGA ACTCGGCCTA
CGGGGAATCA GCTATTGGAA ACTAGGATTA GATTTTCCGC AAAATTGGTT GCTGCTAACA
GATAACTTTA CTGTTGTAAA AAGGTAA
 
Protein sequence
MQIHVVQSGQ TLSGIAQAYN TTPEEIIRAN ELPNPNDLVV GQAIVIPIVG SFYWVQRGDS 
LWSISRKFSI PAQRLAEINR ISLNSPLQVG QRLYIPPKAK RRAEFNGYIE PRGTTVSPAL
EASARQAAPY LTYLAPFQFQ IQRNAALKEP PLNNFPSIAR ANRVTLIMVV TNIENDQFSD
ELGALILNNE QLQNRLLENI VTTAKKYGFR DIHFDMEYLR PEDREAYNAF LRKAKRRFER
EGWLMSTALA PKTSATQKGR WYEAHDYRAH GQIVDFVVIM TYEWGYSGGP PMAVSPIGPV
RQVIEYAISE MPASKIMMGQ NLYGYDWTLP YVPGGPYARA ISPQQAIRLA AQYNVAIEYD
TKAQAPHFRY RDENGKEHEV WFEDARSIQA KFDLVKELGL RGISYWKLGL DFPQNWLLLT
DNFTVVKR