Gene Mboo_0240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0240 
Symbol 
ID5410555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp224096 
End bp225679 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content57% 
IMG OID640867456 
ProductO-sialoglycoprotein endopeptidase/protein kinase 
Protein accessionYP_001403405 
Protein GI154149787 
COG category[O] Posttranslational modification, protein turnover, chaperones
[T] Signal transduction mechanisms 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity
[COG3642] Mn2+-dependent serine/threonine protein kinase 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.211158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.534479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGT TTGGGCAGAT ACTGGGCATT GAGGGAACAG CATGGAACCT CAGTGCTGCT 
CTTTTTGATC GTGATCTTCT GGCCCTGTGT TCCCGTCCCT ACTCACCCGA ACATGGTGGC
ATCCACCCGA GGGAGGCAGC CCAGCACCAT GCTTCTGCCA TGAGGGAGGT GATTGCCACT
GTCACAAAAG AACCGGAGAA GATCACCGGC ATTGCATTTT CCCAGGGGCC CGGCCTTGGC
CCCTGCCTGA GAACGGTTGC AACGGCCGCT CGTTCGCTTG CCCTTGCTCT CGAAGTTCCG
CTGATCGGGG TCAACCATTG CGTAGCCCAC GTGGAGATCG GAAGTTGGGC AACTGGGTGC
AGGGATCCCA TTGTCCTGTA CGCGAGCGGA GCGAATACTC AGGTGATCGG GTACCTCAAC
GGGCGGTACC GGATCTTTGG GGAGACGCTC GATATCGGGA TCGGAAACGC CCTCGACAAG
TTTGCGCGGG CAAAAGACCT GCCCCACCCG GGGGGGCCGC TGATCGAGGC ACAGGCAAAA
TCCGGGACCT ATTTTGAACT CCCCTATACC GTCAAAGGCA TGGATCTCGC ATTCTCCGGT
CTGGTCTCGG CCGCAAAGGA TTCCAGAAAA TTGCTCTCCG ATGTCTGCTG CAGCCTGCAG
GAAACTGCGT TTGCCATGTG CGTGGAAGTG ACCGAACGGG CGCTCTCGCT GACCGGGAAA
GACGAGGTGC TGCTCGTGGG CGGCGTGGGT GCCAATGCAC GCCTGCAGGA GATGCTTCGG
ATCATGTGCG AAGAGAGGGG TGCACACTTC TTTGTCCCCG AACGAAAATA TCTCGGGGAC
AATGGCGCGA TGATCGCATA CACCGGCAAG CTGATGCTTG AAAGCGGCCA GACGCTTGCC
ATCGAAAACT CGCAGGTCAA CCCCTCCTTC CGTTCAGACG ATGTTGAAGT GACCTGGAAG
CATGAAGCCA CACCTGTACC AGGACAGGAG AATCGCGGCA GGACTGACCA GAAACGCGGT
GCAGAAGCGG TTGTCCTGTT CAGGAACGGG AGTGCGATCA AACAAAGGCT CTCCAAAACG
TACCGCGTAC CTGCACTGGA CCGGAAACTG ATCACGGAGC GCACACGGGC CGAGGCCCGT
ATCATCCATA TGGCCCGGAA AGGAGGCGTT CCCACACCTA TCATGAGCGA CATCACGGGA
GATTCCATTG TCATGGAAGA GATCACCGGC ACGCTCCTGA CCCATGCACT CTCAGAAGCA
AACTGCGAAA AAGCCGGAGA GATGACAGGA CAGTTGCACA CGGCCGGAAT CATGCACGGG
GATCTCACCA CAAGCAACAT GATCCTCCGG GACACAGATG GCAAATGCGT GCTCATCGAT
TTCGGCCTTT CTCAGGTCAC GTCCGAGATT GAGCAGAGAG GCGTGGATAT CCATGTGATG
TTTCAGACCC TCACCAGTAC CCATCCAAAA GATGCCGGCC GGTTGCAGGC TGCATTTGCC
AAAGGGTACT TGGCAACGTT TGCCAGTGCA CGTGAGGTTC TGGAACGCGA AAAGGAGATC
GAACAGCGGG GGAGATACCT GTGA
 
Protein sequence
MPEFGQILGI EGTAWNLSAA LFDRDLLALC SRPYSPEHGG IHPREAAQHH ASAMREVIAT 
VTKEPEKITG IAFSQGPGLG PCLRTVATAA RSLALALEVP LIGVNHCVAH VEIGSWATGC
RDPIVLYASG ANTQVIGYLN GRYRIFGETL DIGIGNALDK FARAKDLPHP GGPLIEAQAK
SGTYFELPYT VKGMDLAFSG LVSAAKDSRK LLSDVCCSLQ ETAFAMCVEV TERALSLTGK
DEVLLVGGVG ANARLQEMLR IMCEERGAHF FVPERKYLGD NGAMIAYTGK LMLESGQTLA
IENSQVNPSF RSDDVEVTWK HEATPVPGQE NRGRTDQKRG AEAVVLFRNG SAIKQRLSKT
YRVPALDRKL ITERTRAEAR IIHMARKGGV PTPIMSDITG DSIVMEEITG TLLTHALSEA
NCEKAGEMTG QLHTAGIMHG DLTTSNMILR DTDGKCVLID FGLSQVTSEI EQRGVDIHVM
FQTLTSTHPK DAGRLQAAFA KGYLATFASA REVLEREKEI EQRGRYL