Gene Mthe_0247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0247 
Symbol 
ID4462070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp242387 
End bp243361 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content59% 
IMG OID639699253 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_842684 
Protein GI116753566 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGTCC TTGGTATTGA GGGCACTGCC TGGAATCTCA GTGCAGCGAT CGTGAATGAG 
GATGATGTGA TCATAGAGAG GGCTGCAACA TACACGCCTG CAAGGGGAGG TATTCACCCA
AGAGAGGCGG CGCAGCACCA CTCAGAGCAC ATCGGCCCGC TGCTGAGAGA GGTGATCCAG
GGCGCCAGAG ATCTCGGAAT AAAGATCGAC GGAGTCGCGT TCTCTCAGGG GCCCGGACTC
GGGCCGTGCC TGAGGACGGT CGCGACTGCG GCCAGGGTTC TCGCTTTGAA GCTCAATGTC
CCTCTCGTCG GCGTGAACCA CTGCATAGCT CATATCGAGA TTGGGAAATG GAAGACCGGA
GCCAGGGATC CTGCGGTGCT CTACGTGAGC GGCGGAAACT CCCAGGTTCT CGCCCTCAGG
CGCGGTCGCT ACAGGATCTT TGGCGAGACC CTGGACATCA GCGTCGGGAA CATGCTTGAC
AAGTTCGCAC GCTCAGTCGG TCTTCCGCAC CCTGGAGGGC CGCGGATAGA GGAGCTCGCC
AGGAATGCAA AGGAATACAT ACCGCTTCCC TACACCGTCA AGGGAATGGA CTTCTCCTTC
TCAGGGCTTG CGACCGCTGC AGCAGAGGCC GCCAGGAGAT ACGATCTGGA GGACGTCTGC
TACAGCCTCC AGGAGACCGC ATTCGCGATG CTCGTCGAGG TCACAGAGCG CGCGATGGCA
CATGCTGAGA AGAAGGAGGC AATGCTTGTC GGTGGAGTCG GGGCGAACCG GCGGCTCGGA
GAGATGCTTA GGCTGATGTG TGAGGAGCGC GGCGCGAGAT TTTATCTCCC TGAAAGGCGT
TTCATGGGCG ATAACGGATC GATGATAGCA TATACAGGGC TGGTGATGCT CAAGAGCGGC
GTGAGCACGC CGATTGAGAG CTCAGGCGTC AGGCCTAATT ACAGGACAGA TGAGGTCGAG
GTGAGATGGG CCTGA
 
Protein sequence
MYVLGIEGTA WNLSAAIVNE DDVIIERAAT YTPARGGIHP REAAQHHSEH IGPLLREVIQ 
GARDLGIKID GVAFSQGPGL GPCLRTVATA ARVLALKLNV PLVGVNHCIA HIEIGKWKTG
ARDPAVLYVS GGNSQVLALR RGRYRIFGET LDISVGNMLD KFARSVGLPH PGGPRIEELA
RNAKEYIPLP YTVKGMDFSF SGLATAAAEA ARRYDLEDVC YSLQETAFAM LVEVTERAMA
HAEKKEAMLV GGVGANRRLG EMLRLMCEER GARFYLPERR FMGDNGSMIA YTGLVMLKSG
VSTPIESSGV RPNYRTDEVE VRWA