Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_1572 |
Symbol | |
ID | 5410097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 1639467 |
End bp | 1641965 |
Gene Length | 2499 bp |
Protein Length | 832 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640868806 |
Product | peptidase U32 |
Protein accession | YP_001404732 |
Protein GI | 154151114 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.812702 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTATCTGA GCGGCAGACG GTTTGGGGCA CGGAAGTTCG CGCAGAACTT CTCCGAAGAA GAGATCGCAG GGGCGATCGC ATATGCCCAT GCCCGGGGGG TCCGGGTGTA TGTGACGGTC AACACGCTGA TCCATGACCG GGAGCTCCCT GAAGCACTTG CGTACCTGGT CCGGCTGTAC GCGGCCGGCG CGGATGCGGT GCTCGTGCAG GATGCAGGTC TTGCAGCGCT TGCAAAGGAG ATCGTGCCGG ATCTCGTGCT CCATGCCTCA ACCCAGCTCA CGATCCATAA TGCCGAAGGG GTGAGGTGGG CGCACACGAT GGGGTTCTCA CGTGTGGTGC TTGCCCGTGA GCTGCCCCTC CATGAAGTGG AGGCGATTGC CCGGGCCACG TCAGATACCG GCGTTGGGCT CGAAGTTTTT GCCCACGGGG CCCTCTGCTA CAGTTATTCC GGCCAGTGCC TCCTCTCATC GGTGATCGGC GGGCGGAGCG GGAACCGGGG CATGTGCGCC CAGCCGTGCC GGAAACCCTA TGCCCTTGTT ACGGCACAGA CAGACCGGTA CGGCCGGCCT GTTGATCCAA AAGACATGCC CGTGCCAGGT CAGTATCTCC TCTCGCCAAA GGACCTCTGC ACGTACCGGG ATATTCCAAA ACTCGTGGAT TCCACGGTAG CAGCCCTCAA GATCGAGGGC CGGATGAAAT CGGCAGAGTA CGTGGCAATC GTGGTCTCGA CCTACCGGCG GGCACTCGAT GCTGCGGCTG CCGGTACCTT TGTTTCGGAC GAAAATGAGA TCCGCGATCT TGCCCTTGCC TTCAACCGGG GTTTCACCCG GGGCTATCTC TTTGGCGACA AAAAAGGAAA AATCATGGCC CGGGACCGGC CGGACAACCG CGGGGTCCTT ATCGGTACGG TCATCAGGTA TGACCCAATA AAGAGGGTGG CAATCATATG TCCGGACAAG CCCGTAACCC TTCGCCCCGG CGACGGTTTC CTCTTTTCTC TTCCAAAAAG CCCGGAGAAG GAGTGGGGAT TTTCCCTCAA TACCGAACCG GTCATACTTC CGGAGGGGAT CGCACTGTTT GTCCCCCGCC CGGCAGAGAA GGGAGAACAG GTCTGTCTCA CCTCTTCGAT CGATCTCCTG GCCGGGGCCC GGCAGATCGT AAAACAGGAA AATCCCTTGC TGCATCACCC GGTCCCGGCA GCGCTTGCTG CCGCAGTCAG CCCGGACGGG GACCTCACCC TTACCGGAAT CCTGTACCCT CCGGGAAAAG CGCCGGTGGT TATATCTGCG GCCGGGGAGC TCCGGCTCGA ACCGGCAAAG ACACGCCCGC TTACCCGGGA GCAGCTGGCC GGGCAGCTCG AAAAGACCGG GGGAACGCCC TTTGTCCTTT CGGACCTGGC GCTTGAGTAT GACGGCACCC GCTTTGCCCC GGTCCGGGAG ATCAACCGGG TCCGGCGGGA GTTTTTTGCC CGGGCAGAGG CAGCCCTTGT GGCGGCCTCG CGCCCCGGGG AAAAAGCCGT CAAAGATGCA GAGCGGCGCC TGGACCGGTG GCGTACGGGA AAAACAGCAC CGATATCCAC GGAACGCGCC GGGCGCACAC CGGAAATAAT TCTCTGGGCA GACAACCTTG AGGCGATTGA AGCCGGTGTA CAGGCCGGGG CAGGAACAAT CTGCTATGAA CCCTGCGGAA AGGGAACGGA GCCGGAGCTG GCAGCCGCGA TCGATTCAGC CCTTGACCTC TGCAAGGCAC AGGACACCCG GCTTATCTGG AAACTGCCCC GGATCACCCG GGAAGCAGAG ATCATGCTTA TACGTCAGGT CCTGCCGCGG CTGTACGCAG CAGGCCTGCG GACCTGCATG GCAGACAACC CGGGCGCGGC CATTGCAGTT GCAGATGCCG TGCCGGGCAT GGAACTGGCA GGCTCGATCG GGCTTAACGT TTTCAATGCG GAGACCGCCC GGGCTTTTGG GACCCTGCCC TTTACGCTGC TCACGCTCTC GCCGGAATTG TCCGCAACTG AGATCGAAGA CCTCGCACAC GCTCACGGGG AGGGCCCGGC GCTTGCAGTA TTTGCGCAGG GAAATCTTGA GGCCATGGTT ACCGGCGACT GTCTCATCTC CCCTCTGGAG CGGTGCCATG GCGGTACCGG GCCCTGTTCC AAAGGGAGGT GGTATGGGAT CCGGGACGGG ACCGGCCATC TCCTGCCGGT CCGGACCGAC AGTACCTGCC AGGGCCATAT CTTCAATGCT GCCGAGACCT GCCTTGTGGC TGCGGTACCG GACCTTGTCC GGCAGGGGAT TCGTGGCTTT GTGATCGATG CCCGGGGGCG GACTGCAGCC TATGCAGGAG AGATGGTCGG CATCTACCGG GAAGCGATCG CTATGGCATT ACCGGGCAGT ACCGGAAGCA ACGTTGCCGC TCTCCGCGAA CGGGCAAAAG CGATCGCCCT TGGGGGCATT ACCGCAGGCC ACTACCAGAG GGGACTTACC GGAGAATAG
|
Protein sequence | MYLSGRRFGA RKFAQNFSEE EIAGAIAYAH ARGVRVYVTV NTLIHDRELP EALAYLVRLY AAGADAVLVQ DAGLAALAKE IVPDLVLHAS TQLTIHNAEG VRWAHTMGFS RVVLARELPL HEVEAIARAT SDTGVGLEVF AHGALCYSYS GQCLLSSVIG GRSGNRGMCA QPCRKPYALV TAQTDRYGRP VDPKDMPVPG QYLLSPKDLC TYRDIPKLVD STVAALKIEG RMKSAEYVAI VVSTYRRALD AAAAGTFVSD ENEIRDLALA FNRGFTRGYL FGDKKGKIMA RDRPDNRGVL IGTVIRYDPI KRVAIICPDK PVTLRPGDGF LFSLPKSPEK EWGFSLNTEP VILPEGIALF VPRPAEKGEQ VCLTSSIDLL AGARQIVKQE NPLLHHPVPA ALAAAVSPDG DLTLTGILYP PGKAPVVISA AGELRLEPAK TRPLTREQLA GQLEKTGGTP FVLSDLALEY DGTRFAPVRE INRVRREFFA RAEAALVAAS RPGEKAVKDA ERRLDRWRTG KTAPISTERA GRTPEIILWA DNLEAIEAGV QAGAGTICYE PCGKGTEPEL AAAIDSALDL CKAQDTRLIW KLPRITREAE IMLIRQVLPR LYAAGLRTCM ADNPGAAIAV ADAVPGMELA GSIGLNVFNA ETARAFGTLP FTLLTLSPEL SATEIEDLAH AHGEGPALAV FAQGNLEAMV TGDCLISPLE RCHGGTGPCS KGRWYGIRDG TGHLLPVRTD STCQGHIFNA AETCLVAAVP DLVRQGIRGF VIDARGRTAA YAGEMVGIYR EAIAMALPGS TGSNVAALRE RAKAIALGGI TAGHYQRGLT GE
|
| |