Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2024 |
Symbol | |
ID | 7978977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2083069 |
End bp | 2083914 |
Gene Length | 846 bp |
Protein Length | 281 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644798846 |
Product | Cof-like hydrolase |
Protein accession | YP_002950016 |
Protein GI | 239827392 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.4946 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTAATCG CATTGGATAT GGACGGGACG CTGCTCAATT CGAATGGACA GGTAAGTAGA AGAAACAAAG AGGCCATTGT GACAGCGCAA AAGCAAGGCC ATATCGTGGC GATCGTGACG GGCAGGGCGT ATAAAGACGC GCGCGCGCCG CTTCGGGATG CGGGGCTTGT TTGCCCGATT ATGAGTTTAA ATGGCGCGGT GATGACATTA GAAGATGGGA CGGTGCTTGG CGATGTGCCG CTTGACAAGG AAAAGTTGAT TCCGGCGCTC GAATGGGTGC GCGCGCAGCC GGATTTATAT TGCGAAATTT ATACGGGCGA TGCCGTCTAT GTCGGGCTCC ATAACCGCGC GCATCTTGAG GCGATGGCAG AAAAGGCAAG CGATATCGCG CCTGAATTAA AGCGCATCGT CGAAAAGCAG TTCCAGCAGG CGCGCGTGAC GTATGTCGAT GACATCCGCG CCATCTGGGA GGAGCGGCAA ACCGTGTTTT ACAAGGTGCT CATTTTTTCT CTCGATCAAG AACGTTTACA AGAAGCGGCC GCCCAGTTTG CCGCCATCTC CGGCATTACT GTCACCTCGT CGCATCCAAA CAACATCGAA ATTAACCATG AGCAGGCGAC GAAAGGGGAG GCGCTTGTAA AACTGGCCGC TCATTACGGC ATCGACATGA AAGATACGGT TGTTTTCGGT GACAGCCATA ACGATTTATC GATGTTCGCC GTCGCTGGAT ACCGCGTCGC GATGGAAAAC GCCGCACCGG GATTAAAAGA AGTCAGCGAC ATGGTCACCG CGTCACACGA GGAAGACGGC GTGGCGGTCG TATTGGAAGA GCTAATTGGC AAGTGA
|
Protein sequence | MLIALDMDGT LLNSNGQVSR RNKEAIVTAQ KQGHIVAIVT GRAYKDARAP LRDAGLVCPI MSLNGAVMTL EDGTVLGDVP LDKEKLIPAL EWVRAQPDLY CEIYTGDAVY VGLHNRAHLE AMAEKASDIA PELKRIVEKQ FQQARVTYVD DIRAIWEERQ TVFYKVLIFS LDQERLQEAA AQFAAISGIT VTSSHPNNIE INHEQATKGE ALVKLAAHYG IDMKDTVVFG DSHNDLSMFA VAGYRVAMEN AAPGLKEVSD MVTASHEEDG VAVVLEELIG K
|
| |