Gene GWCH70_0219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0219 
Symbol 
ID7977976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp243115 
End bp244128 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content49% 
IMG OID644797213 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_002948416 
Protein GI239825792 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCATG ACATATATGT TCTTGGAATT GAAACGAGCT GCGATGAAAC CGCGGCGGCA 
GTTGTGAAAA ATGGAACGGA AATTTTGTCT AATATTGTGG CTTCCCAAAT GGAAAGCCAT
AAACGATTTG GCGGAGTCGT ACCAGAAATC GCCTCGCGCC ACCATGTCGA GCAGATTACG
CTCGTTATTG AAGAAGCGAT GCGTAAGGCG GATGTTTCGT TTCAACAGTT GAGTGCTATT
GCAGTGACGC AAGGTCCAGG ATTAGTCGGC GCTCTTTTGA TTGGTGTGAA CGCGGCGAAA
GCGTTAGCGT TTGCCCATGG TCTTCCGCTT GTAGGCGTTC ATCATATCGC CGGGCATATT
TACGCGAATC GGCTTGTTAC GGAAATGAAA TTTCCGTTGT TATCTCTTGT GGTTTCAGGA
GGACATACGG AGCTTGTGTA CATGGAGGGG CATGGCAAGT TTCAAGTCAT CGGCGAAACG
AGAGACGATG CGGCGGGAGA AGCGTATGAC AAGGTGGCGA GAGCATTAAA CCTTCCTTAT
CCGGGCGGAC CGCATATTGA CCGTCTTGCA CAGGAAGGAA AGGTGACCAT TGATTTGCCT
CGTGCATGGC TGGAAGAAGG GTCATATGAT TTTAGCTTCA GCGGGTTGAA ATCGGCGGTA
TTAAACACGC TTCATAACGC GAATCAGCGT GGAGAAATCA TCGACCCGAA AGATATGGCA
GCCAGCTTTC AAGCGAGCGT AATCGATGTG CTTGTGACCA AAACCGTCAA CGCAGCAAAA
GAATATAATG TCCGTCAAGT GCTGCTTGCG GGCGGAGTCG CCGCAAACAG AGGATTACGA
GCGGAGTTGG AGCGAAAAAT GGCTGAATTA GACCATATCG AATTGGTTAT TCCGCCTTTA
TCGCTTTGTA CCGACAATGC GGCGATGATT GCGGCGGCAG GAACTGTTTT GTTTGAACAA
GGCAAACGCG CAGATATGGC GTTAAACGCG GATCCAAGTT TAGAGCTAGA TTAA
 
Protein sequence
MNHDIYVLGI ETSCDETAAA VVKNGTEILS NIVASQMESH KRFGGVVPEI ASRHHVEQIT 
LVIEEAMRKA DVSFQQLSAI AVTQGPGLVG ALLIGVNAAK ALAFAHGLPL VGVHHIAGHI
YANRLVTEMK FPLLSLVVSG GHTELVYMEG HGKFQVIGET RDDAAGEAYD KVARALNLPY
PGGPHIDRLA QEGKVTIDLP RAWLEEGSYD FSFSGLKSAV LNTLHNANQR GEIIDPKDMA
ASFQASVIDV LVTKTVNAAK EYNVRQVLLA GGVAANRGLR AELERKMAEL DHIELVIPPL
SLCTDNAAMI AAAGTVLFEQ GKRADMALNA DPSLELD