Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_2053 |
Symbol | |
ID | 5410686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 2129150 |
End bp | 2131132 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640869297 |
Product | thimet oligopeptidase |
Protein accession | YP_001405210 |
Protein GI | 154151592 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.499816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTCC GTCTCCCCCT CTCCCCGTTA AAGACCAGCT ACCGGCCCGG CGAGATCACT GTTCTGTGCG ATACTGCAAT CAGAACCGCC ACCACAGCCC TTGACCGGAT TGCCGTCCTT CCGCCGGAAA CGCGATCTGT AGAAACCACC CTGCTCGCGT TTGAGACCGC GATGGCGGAT TTTTCGGACG CAACCCTGCC CCTGACCCTT ATGGGTTACG TGTATCCCGA CCCGGGAGTG GCAGCAGAAG GATCGGCAAG CGAGGAGAAA ACAGGAAAGT TTGCCATTGG CGTCTTTACC CGGCGGGATC TCTATGATGC AATCCGCGGA GTTGTCCCGC GGAATCCTGC AGAGACACGC CTCCTCTCAG AAACGCTGCG GCAGTTCAAA AAGAACGGGC TTGCGCTTTC CAATGAGGGC CTTGCCCGGG TCCGGGCCTT AAAAGAGCAG ATCACCGGAC TGGAAGTGAA GTTCTCTGCA AACCTCAACA ACGATACCAC CACTTTGGAT TTTTCTGCAG AGGAACTGGG GGGTGTCCCG CAGGAAGTGC TTGCCACCTT TGCGCAGACC CCCGACGGGA AATACCGGGT CACGACCAAG TACCCGGACT ACATCCCGGT GATGCAGAAC GCTGAAAGTG CGGCGACAAG AAAACAGCTG TACGCCGCGT TTGTGAACCG GCAGGCCGTT CCCAACACGG CGCTTCTCGA GGAGGCGATC CGGGTGCGGC AGGAGTGTGC CCGGGAGCTG GGCTATGCAA GCTGGGCAGA CTACCGGCTC GATGGCCGGA TGGCACAGGA CACCGCCACC GTCCGTTCGT TTCTCTCAAG GCTTGAAGCG CCGGTCAAAG AGAAGATCCG ATCTGACCTG GCCATGCTCC TTACCCTCAA GCAGGAACTC GTACCGGGAG CAGATCGGGT CGATCCATGG GATCTCGCGT TCCTTTCAGA ACGGGAAAGG AAACAGAAAT TTGCGCTCGA CAACGAGGAG ATCCGTAAGT ATTTCCCGTT CGATCTCGTC CTTGAAGGAA TGTTCCGTTG CTTCGGCCCG CTCTTTGGGG CCCGGTTTGC CGTGGTACCT GAAGCCCCGG CCTGGGCACC GGGGGTCCGG CTGATCCGCA TCTTTGATCA GGATGACGAT CGAACCCTCG CATACCTCTA CCTTGATATG TTTCCCCGGG ACGGCAAGTA CGGGCATATG ATGATGTCCC CCCTGATCGC AGGCAGGGAA AGAGAAGGAG GATATTCCGT GCCGGTCACC GCCATCGTGG GGAACTTCCG GGCACCTTCG GGTGACATCC CCTCGCTTCT CACCCATGAC GATGTCGAGG GTCTCTTCCA CGAGTTCGGC CACGCGCTCC ATGGCTGCCT TACCAAAGCC CCCTATGCCA GCCTTGCCGG ATCGAGCGTG GAGTGGGACT TTGTCGAGAC CCCTTCGCAG GCGCTGGAGA GCTGGGTCTG GGAGCCGGAG GTGCTCGATG CGATCTCCGG CCACTATGCA CATCCTGCAG AAAAACTCCC GGCCCCGCTC CGGGACCGGA TCATCGCGGC ACGCGACCTC GGCGCCGGGC TGAGGTACAC CCGGATGCTC GTGATCTCGA CCGAGGACAT GGAATTCCAT ACCGCAAAAG GGCCGGTTGA TGTGACCGCG ACTGCCAACC GTATCTACCG GGAGCTCATG GGCATCTCGC CACTCGAAGG GGACCACGAG CCGGCCACCA TCGGCCATTT CATGGGGGGA TACGATGCCG GTTACTACAG TTACCTCTGG GCCGAAGTCT ACGCCCTGAA TATCTTTGCC CGGTTCAAAA AAGACGGCCT GTTCAATGCT GCCACCGGGG CCGCGTACCG TCACTGGATC CTCGAACAGG GAAACATGCA GGATGGAAAG GCGCTCCTTG CAGGATTCCT GGGAAAAGAG CCCGGCATGG ATGTCTTCTA CGAGAGGCTC CATATCCACC CACCCTCACC CACATCCCCG TAA
|
Protein sequence | MTFRLPLSPL KTSYRPGEIT VLCDTAIRTA TTALDRIAVL PPETRSVETT LLAFETAMAD FSDATLPLTL MGYVYPDPGV AAEGSASEEK TGKFAIGVFT RRDLYDAIRG VVPRNPAETR LLSETLRQFK KNGLALSNEG LARVRALKEQ ITGLEVKFSA NLNNDTTTLD FSAEELGGVP QEVLATFAQT PDGKYRVTTK YPDYIPVMQN AESAATRKQL YAAFVNRQAV PNTALLEEAI RVRQECAREL GYASWADYRL DGRMAQDTAT VRSFLSRLEA PVKEKIRSDL AMLLTLKQEL VPGADRVDPW DLAFLSERER KQKFALDNEE IRKYFPFDLV LEGMFRCFGP LFGARFAVVP EAPAWAPGVR LIRIFDQDDD RTLAYLYLDM FPRDGKYGHM MMSPLIAGRE REGGYSVPVT AIVGNFRAPS GDIPSLLTHD DVEGLFHEFG HALHGCLTKA PYASLAGSSV EWDFVETPSQ ALESWVWEPE VLDAISGHYA HPAEKLPAPL RDRIIAARDL GAGLRYTRML VISTEDMEFH TAKGPVDVTA TANRIYRELM GISPLEGDHE PATIGHFMGG YDAGYYSYLW AEVYALNIFA RFKKDGLFNA ATGAAYRHWI LEQGNMQDGK ALLAGFLGKE PGMDVFYERL HIHPPSPTSP
|
| |