Gene Mlab_0653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0653 
Symbol 
ID4795757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp625013 
End bp626590 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content57% 
IMG OID640099314 
ProductO-sialoglycoprotein endopeptidase/protein kinase 
Protein accessionYP_001030093 
Protein GI124485477 
COG category[O] Posttranslational modification, protein turnover, chaperones
[T] Signal transduction mechanisms 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity
[COG3642] Mn2+-dependent serine/threonine protein kinase 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAA AAAGGGTCCT CGGCATTGAA GGAACTGCAT GGAACTTCAG TGCCGCTGTT 
TTTGCAGAAG ATTTAGTTTG TCTTCACTCT GCACCGTATG TTCCGCCAAC CGGCGGTATT
CACCCGCGTG AAGCTGCCCA GCATCATGCG TCTGTTGCGT CGGATGTTAT CAGAAAAGCT
CTGGATGAAG CGGGAGAGAA AATCGACGCC GTTGCTTTTT CCATTGGCCC GGGTCTTGGC
CCCTCGCTTC GAATCGCGGC TACGACAGCC CGCACTCTTG CGCTGAAGCT TGGCGTCCCG
CTTATCGGCG TGAATCACTG TGTGGCCCAT GTCGAGATCG GCAGATGGTA CACAAAATTC
GCCGACCCTA TCGTGCTCTA CGCATCGGGT GCAAACACCC AGGTCCTTGG CTTTCTGAAC
GGAAAGTACC GGATCTTCGG CGAGACGCTG GATATCGGAC TCGGCAACGC ACTCGACAAA
TTCGCGCGGA GTCATAACCT CCCCCACCCA GGCGGCCCCA TCATCGAAAA GATGGCGAAG
GACGGATCTT ATATTCACCT TCCCTACACG GTGAAAGGCA TGGACCTTGC ATTCTCCGGC
TTGATGAGTG CCGCAAAGGA AGCGACCCAG CGGGGCGAAT CCATGGAAGA TGTCTGCTTC
AGTTTCCAGG AAACGGCCTT TGCCATGTGT GTGGAAGTGA CCGAACGGGC CCTTGCCCAC
ACAGGAAAGG ATGAGGTCAT TTTAGTCGGC GGCGTCGGGG CAAACGCACG CCTCCAGGAG
ATGCTCGCAA AAATGTGCGA GGAACGCGGA GCGAAGTTCA TGGCTCCCCC AAGAGTCTAT
ATGGGTGACA ACGGCGCGAT GATCGCCTAC ACCGGAAAGA TCATGCTCGA AGCCGGATCC
ACGATCCCGA TTGCTGAATC CGTGGTGAAT CCCGGATTCA GATCCGATCA GGTCGAGGTG
ACCTGGCGGC ACGATGCCGG CCAGCTCTTC GCTCCCGGCC AATCGGAAAC TGCCGAGCGC
GGTGCGGAGG CTTCGGTCAA TCTCACGGAC AAGGACGTGG TGAAAACCCG TCTTGCCAAA
GGATACCGGG TACCCGAACT CGACCGGCAT CTCATTGCCG AACGAACCCG GGCCGAAGCC
CGTGCGATTT CAGCAGCGAG ACGCGGCGGC GTGCCGGTTC CGGTGATCCG CGATGTGACC
GATCACGAGA TCGTGATGGA AAAACTCGAT GGTGATGTTC TCAAGTACGT CATGAACGAA
GAGTACGCCA AAGGTGCGGG GCTCACGGTT GGTAAACTCC ACAAGGCGGG GATAACGCAC
GGAGACCTCA CGACCTCGAA CATGATCTGG CATAACGACC GCGTGTATCT GATCGACTTT
GGTCTCTCGC AGATGACGGA AGAGATCGAA CCGCGCGGCG TCGATCTGCA CGTTCTCTTC
CAGACGCTGG AAAGCACGAC CGAAAATCCG GAGACGCTCA AATCCGCATT CATCAATGGA
TACTGCGCGG CGTTTTCCGA AGCCGAAAAC GTGATCCGGC GCGAACACGA GATCGAACTG
CGCGGGAGAT ACTTATGA
 
Protein sequence
MPEKRVLGIE GTAWNFSAAV FAEDLVCLHS APYVPPTGGI HPREAAQHHA SVASDVIRKA 
LDEAGEKIDA VAFSIGPGLG PSLRIAATTA RTLALKLGVP LIGVNHCVAH VEIGRWYTKF
ADPIVLYASG ANTQVLGFLN GKYRIFGETL DIGLGNALDK FARSHNLPHP GGPIIEKMAK
DGSYIHLPYT VKGMDLAFSG LMSAAKEATQ RGESMEDVCF SFQETAFAMC VEVTERALAH
TGKDEVILVG GVGANARLQE MLAKMCEERG AKFMAPPRVY MGDNGAMIAY TGKIMLEAGS
TIPIAESVVN PGFRSDQVEV TWRHDAGQLF APGQSETAER GAEASVNLTD KDVVKTRLAK
GYRVPELDRH LIAERTRAEA RAISAARRGG VPVPVIRDVT DHEIVMEKLD GDVLKYVMNE
EYAKGAGLTV GKLHKAGITH GDLTTSNMIW HNDRVYLIDF GLSQMTEEIE PRGVDLHVLF
QTLESTTENP ETLKSAFING YCAAFSEAEN VIRREHEIEL RGRYL