Gene Mboo_1572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1572 
Symbol 
ID5410097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1639467 
End bp1641965 
Gene Length2499 bp 
Protein Length832 aa 
Translation table11 
GC content62% 
IMG OID640868806 
Productpeptidase U32 
Protein accessionYP_001404732 
Protein GI154151114 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.812702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATCTGA GCGGCAGACG GTTTGGGGCA CGGAAGTTCG CGCAGAACTT CTCCGAAGAA 
GAGATCGCAG GGGCGATCGC ATATGCCCAT GCCCGGGGGG TCCGGGTGTA TGTGACGGTC
AACACGCTGA TCCATGACCG GGAGCTCCCT GAAGCACTTG CGTACCTGGT CCGGCTGTAC
GCGGCCGGCG CGGATGCGGT GCTCGTGCAG GATGCAGGTC TTGCAGCGCT TGCAAAGGAG
ATCGTGCCGG ATCTCGTGCT CCATGCCTCA ACCCAGCTCA CGATCCATAA TGCCGAAGGG
GTGAGGTGGG CGCACACGAT GGGGTTCTCA CGTGTGGTGC TTGCCCGTGA GCTGCCCCTC
CATGAAGTGG AGGCGATTGC CCGGGCCACG TCAGATACCG GCGTTGGGCT CGAAGTTTTT
GCCCACGGGG CCCTCTGCTA CAGTTATTCC GGCCAGTGCC TCCTCTCATC GGTGATCGGC
GGGCGGAGCG GGAACCGGGG CATGTGCGCC CAGCCGTGCC GGAAACCCTA TGCCCTTGTT
ACGGCACAGA CAGACCGGTA CGGCCGGCCT GTTGATCCAA AAGACATGCC CGTGCCAGGT
CAGTATCTCC TCTCGCCAAA GGACCTCTGC ACGTACCGGG ATATTCCAAA ACTCGTGGAT
TCCACGGTAG CAGCCCTCAA GATCGAGGGC CGGATGAAAT CGGCAGAGTA CGTGGCAATC
GTGGTCTCGA CCTACCGGCG GGCACTCGAT GCTGCGGCTG CCGGTACCTT TGTTTCGGAC
GAAAATGAGA TCCGCGATCT TGCCCTTGCC TTCAACCGGG GTTTCACCCG GGGCTATCTC
TTTGGCGACA AAAAAGGAAA AATCATGGCC CGGGACCGGC CGGACAACCG CGGGGTCCTT
ATCGGTACGG TCATCAGGTA TGACCCAATA AAGAGGGTGG CAATCATATG TCCGGACAAG
CCCGTAACCC TTCGCCCCGG CGACGGTTTC CTCTTTTCTC TTCCAAAAAG CCCGGAGAAG
GAGTGGGGAT TTTCCCTCAA TACCGAACCG GTCATACTTC CGGAGGGGAT CGCACTGTTT
GTCCCCCGCC CGGCAGAGAA GGGAGAACAG GTCTGTCTCA CCTCTTCGAT CGATCTCCTG
GCCGGGGCCC GGCAGATCGT AAAACAGGAA AATCCCTTGC TGCATCACCC GGTCCCGGCA
GCGCTTGCTG CCGCAGTCAG CCCGGACGGG GACCTCACCC TTACCGGAAT CCTGTACCCT
CCGGGAAAAG CGCCGGTGGT TATATCTGCG GCCGGGGAGC TCCGGCTCGA ACCGGCAAAG
ACACGCCCGC TTACCCGGGA GCAGCTGGCC GGGCAGCTCG AAAAGACCGG GGGAACGCCC
TTTGTCCTTT CGGACCTGGC GCTTGAGTAT GACGGCACCC GCTTTGCCCC GGTCCGGGAG
ATCAACCGGG TCCGGCGGGA GTTTTTTGCC CGGGCAGAGG CAGCCCTTGT GGCGGCCTCG
CGCCCCGGGG AAAAAGCCGT CAAAGATGCA GAGCGGCGCC TGGACCGGTG GCGTACGGGA
AAAACAGCAC CGATATCCAC GGAACGCGCC GGGCGCACAC CGGAAATAAT TCTCTGGGCA
GACAACCTTG AGGCGATTGA AGCCGGTGTA CAGGCCGGGG CAGGAACAAT CTGCTATGAA
CCCTGCGGAA AGGGAACGGA GCCGGAGCTG GCAGCCGCGA TCGATTCAGC CCTTGACCTC
TGCAAGGCAC AGGACACCCG GCTTATCTGG AAACTGCCCC GGATCACCCG GGAAGCAGAG
ATCATGCTTA TACGTCAGGT CCTGCCGCGG CTGTACGCAG CAGGCCTGCG GACCTGCATG
GCAGACAACC CGGGCGCGGC CATTGCAGTT GCAGATGCCG TGCCGGGCAT GGAACTGGCA
GGCTCGATCG GGCTTAACGT TTTCAATGCG GAGACCGCCC GGGCTTTTGG GACCCTGCCC
TTTACGCTGC TCACGCTCTC GCCGGAATTG TCCGCAACTG AGATCGAAGA CCTCGCACAC
GCTCACGGGG AGGGCCCGGC GCTTGCAGTA TTTGCGCAGG GAAATCTTGA GGCCATGGTT
ACCGGCGACT GTCTCATCTC CCCTCTGGAG CGGTGCCATG GCGGTACCGG GCCCTGTTCC
AAAGGGAGGT GGTATGGGAT CCGGGACGGG ACCGGCCATC TCCTGCCGGT CCGGACCGAC
AGTACCTGCC AGGGCCATAT CTTCAATGCT GCCGAGACCT GCCTTGTGGC TGCGGTACCG
GACCTTGTCC GGCAGGGGAT TCGTGGCTTT GTGATCGATG CCCGGGGGCG GACTGCAGCC
TATGCAGGAG AGATGGTCGG CATCTACCGG GAAGCGATCG CTATGGCATT ACCGGGCAGT
ACCGGAAGCA ACGTTGCCGC TCTCCGCGAA CGGGCAAAAG CGATCGCCCT TGGGGGCATT
ACCGCAGGCC ACTACCAGAG GGGACTTACC GGAGAATAG
 
Protein sequence
MYLSGRRFGA RKFAQNFSEE EIAGAIAYAH ARGVRVYVTV NTLIHDRELP EALAYLVRLY 
AAGADAVLVQ DAGLAALAKE IVPDLVLHAS TQLTIHNAEG VRWAHTMGFS RVVLARELPL
HEVEAIARAT SDTGVGLEVF AHGALCYSYS GQCLLSSVIG GRSGNRGMCA QPCRKPYALV
TAQTDRYGRP VDPKDMPVPG QYLLSPKDLC TYRDIPKLVD STVAALKIEG RMKSAEYVAI
VVSTYRRALD AAAAGTFVSD ENEIRDLALA FNRGFTRGYL FGDKKGKIMA RDRPDNRGVL
IGTVIRYDPI KRVAIICPDK PVTLRPGDGF LFSLPKSPEK EWGFSLNTEP VILPEGIALF
VPRPAEKGEQ VCLTSSIDLL AGARQIVKQE NPLLHHPVPA ALAAAVSPDG DLTLTGILYP
PGKAPVVISA AGELRLEPAK TRPLTREQLA GQLEKTGGTP FVLSDLALEY DGTRFAPVRE
INRVRREFFA RAEAALVAAS RPGEKAVKDA ERRLDRWRTG KTAPISTERA GRTPEIILWA
DNLEAIEAGV QAGAGTICYE PCGKGTEPEL AAAIDSALDL CKAQDTRLIW KLPRITREAE
IMLIRQVLPR LYAAGLRTCM ADNPGAAIAV ADAVPGMELA GSIGLNVFNA ETARAFGTLP
FTLLTLSPEL SATEIEDLAH AHGEGPALAV FAQGNLEAMV TGDCLISPLE RCHGGTGPCS
KGRWYGIRDG TGHLLPVRTD STCQGHIFNA AETCLVAAVP DLVRQGIRGF VIDARGRTAA
YAGEMVGIYR EAIAMALPGS TGSNVAALRE RAKAIALGGI TAGHYQRGLT GE