Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0909 |
Symbol | |
ID | 7979343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 966979 |
End bp | 967785 |
Gene Length | 807 bp |
Protein Length | 268 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 644797867 |
Product | Cof-like hydrolase |
Protein accession | YP_002949040 |
Protein GI | 239826416 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000000198866 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATTAAAC TGATTGTCAG CGATTTGGAC GGAACGCTGC TTGGATTGGA TAACCACGTT AAAAAAGAAG ATAAACTCGC CATTCAATTA GCAGTGGAAC AAGGAATGGA TTTTGCAATT GCATCCGGCC GAATGGATAA TGAAATTTTA GAAGTGTTGC AAGAATTGGA ACAAAAAGCG CATCGTATTA GTCAAAATGG GGCGTTCATT TACACAAAAA CGAATTTGCC GCTTCATGCA ACTACTTTTT CTCCGCAGCT AGCACAACAA ATTTATCGGC AAGCAAAGGA ATTGGAAGCG ATTGTACTTG TTTGCAATGA AAATACATAT TTTATTGAAG AAACGAACGA AAATACAGCG GAAATAGAAA AACGTCTATT TTATGGGCTA TATGAATGTC CAAACATCCT TGATTCCATC GGCAAAGATA TTCATCCATC GAAAATTACT GTATTAGCAG CAAATGAACA AATTGTCGAA TTCGAACAAA AAATAACGCG TCAGTTTGGC AATCACATCG ATACGTTTAT TTCAGAACCG AACTGTCTTG ATATTATGCC AAAAAACATT AGCAAAGGAA ATGCGGTTCG CACTTTAATG GAACATCTTC GCTTGCAGCC GAATGAAATC GCTTGTTTCG GCGATGCGTT TAACGATACT CCAATGTTTC GTCTAACCCC TTACAGCTTT GCCATGTCTC ATGCTCATCC TAATGTAAAA AAAGAAGCAC AATATGTGAT CAATTCAGTA AGTGAAGGGA TACACATGAT TCTTAAAGAC ATTCCTACAC CAAATATGAT GCAATAA
|
Protein sequence | MIKLIVSDLD GTLLGLDNHV KKEDKLAIQL AVEQGMDFAI ASGRMDNEIL EVLQELEQKA HRISQNGAFI YTKTNLPLHA TTFSPQLAQQ IYRQAKELEA IVLVCNENTY FIEETNENTA EIEKRLFYGL YECPNILDSI GKDIHPSKIT VLAANEQIVE FEQKITRQFG NHIDTFISEP NCLDIMPKNI SKGNAVRTLM EHLRLQPNEI ACFGDAFNDT PMFRLTPYSF AMSHAHPNVK KEAQYVINSV SEGIHMILKD IPTPNMMQ
|
| |