Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0219 |
Symbol | |
ID | 7977976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 243115 |
End bp | 244128 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644797213 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_002948416 |
Protein GI | 239825792 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCATG ACATATATGT TCTTGGAATT GAAACGAGCT GCGATGAAAC CGCGGCGGCA GTTGTGAAAA ATGGAACGGA AATTTTGTCT AATATTGTGG CTTCCCAAAT GGAAAGCCAT AAACGATTTG GCGGAGTCGT ACCAGAAATC GCCTCGCGCC ACCATGTCGA GCAGATTACG CTCGTTATTG AAGAAGCGAT GCGTAAGGCG GATGTTTCGT TTCAACAGTT GAGTGCTATT GCAGTGACGC AAGGTCCAGG ATTAGTCGGC GCTCTTTTGA TTGGTGTGAA CGCGGCGAAA GCGTTAGCGT TTGCCCATGG TCTTCCGCTT GTAGGCGTTC ATCATATCGC CGGGCATATT TACGCGAATC GGCTTGTTAC GGAAATGAAA TTTCCGTTGT TATCTCTTGT GGTTTCAGGA GGACATACGG AGCTTGTGTA CATGGAGGGG CATGGCAAGT TTCAAGTCAT CGGCGAAACG AGAGACGATG CGGCGGGAGA AGCGTATGAC AAGGTGGCGA GAGCATTAAA CCTTCCTTAT CCGGGCGGAC CGCATATTGA CCGTCTTGCA CAGGAAGGAA AGGTGACCAT TGATTTGCCT CGTGCATGGC TGGAAGAAGG GTCATATGAT TTTAGCTTCA GCGGGTTGAA ATCGGCGGTA TTAAACACGC TTCATAACGC GAATCAGCGT GGAGAAATCA TCGACCCGAA AGATATGGCA GCCAGCTTTC AAGCGAGCGT AATCGATGTG CTTGTGACCA AAACCGTCAA CGCAGCAAAA GAATATAATG TCCGTCAAGT GCTGCTTGCG GGCGGAGTCG CCGCAAACAG AGGATTACGA GCGGAGTTGG AGCGAAAAAT GGCTGAATTA GACCATATCG AATTGGTTAT TCCGCCTTTA TCGCTTTGTA CCGACAATGC GGCGATGATT GCGGCGGCAG GAACTGTTTT GTTTGAACAA GGCAAACGCG CAGATATGGC GTTAAACGCG GATCCAAGTT TAGAGCTAGA TTAA
|
Protein sequence | MNHDIYVLGI ETSCDETAAA VVKNGTEILS NIVASQMESH KRFGGVVPEI ASRHHVEQIT LVIEEAMRKA DVSFQQLSAI AVTQGPGLVG ALLIGVNAAK ALAFAHGLPL VGVHHIAGHI YANRLVTEMK FPLLSLVVSG GHTELVYMEG HGKFQVIGET RDDAAGEAYD KVARALNLPY PGGPHIDRLA QEGKVTIDLP RAWLEEGSYD FSFSGLKSAV LNTLHNANQR GEIIDPKDMA ASFQASVIDV LVTKTVNAAK EYNVRQVLLA GGVAANRGLR AELERKMAEL DHIELVIPPL SLCTDNAAMI AAAGTVLFEQ GKRADMALNA DPSLELD
|
| |