Gene Mboo_2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2053 
Symbol 
ID5410686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2129150 
End bp2131132 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content59% 
IMG OID640869297 
Productthimet oligopeptidase 
Protein accessionYP_001405210 
Protein GI154151592 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.499816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTCC GTCTCCCCCT CTCCCCGTTA AAGACCAGCT ACCGGCCCGG CGAGATCACT 
GTTCTGTGCG ATACTGCAAT CAGAACCGCC ACCACAGCCC TTGACCGGAT TGCCGTCCTT
CCGCCGGAAA CGCGATCTGT AGAAACCACC CTGCTCGCGT TTGAGACCGC GATGGCGGAT
TTTTCGGACG CAACCCTGCC CCTGACCCTT ATGGGTTACG TGTATCCCGA CCCGGGAGTG
GCAGCAGAAG GATCGGCAAG CGAGGAGAAA ACAGGAAAGT TTGCCATTGG CGTCTTTACC
CGGCGGGATC TCTATGATGC AATCCGCGGA GTTGTCCCGC GGAATCCTGC AGAGACACGC
CTCCTCTCAG AAACGCTGCG GCAGTTCAAA AAGAACGGGC TTGCGCTTTC CAATGAGGGC
CTTGCCCGGG TCCGGGCCTT AAAAGAGCAG ATCACCGGAC TGGAAGTGAA GTTCTCTGCA
AACCTCAACA ACGATACCAC CACTTTGGAT TTTTCTGCAG AGGAACTGGG GGGTGTCCCG
CAGGAAGTGC TTGCCACCTT TGCGCAGACC CCCGACGGGA AATACCGGGT CACGACCAAG
TACCCGGACT ACATCCCGGT GATGCAGAAC GCTGAAAGTG CGGCGACAAG AAAACAGCTG
TACGCCGCGT TTGTGAACCG GCAGGCCGTT CCCAACACGG CGCTTCTCGA GGAGGCGATC
CGGGTGCGGC AGGAGTGTGC CCGGGAGCTG GGCTATGCAA GCTGGGCAGA CTACCGGCTC
GATGGCCGGA TGGCACAGGA CACCGCCACC GTCCGTTCGT TTCTCTCAAG GCTTGAAGCG
CCGGTCAAAG AGAAGATCCG ATCTGACCTG GCCATGCTCC TTACCCTCAA GCAGGAACTC
GTACCGGGAG CAGATCGGGT CGATCCATGG GATCTCGCGT TCCTTTCAGA ACGGGAAAGG
AAACAGAAAT TTGCGCTCGA CAACGAGGAG ATCCGTAAGT ATTTCCCGTT CGATCTCGTC
CTTGAAGGAA TGTTCCGTTG CTTCGGCCCG CTCTTTGGGG CCCGGTTTGC CGTGGTACCT
GAAGCCCCGG CCTGGGCACC GGGGGTCCGG CTGATCCGCA TCTTTGATCA GGATGACGAT
CGAACCCTCG CATACCTCTA CCTTGATATG TTTCCCCGGG ACGGCAAGTA CGGGCATATG
ATGATGTCCC CCCTGATCGC AGGCAGGGAA AGAGAAGGAG GATATTCCGT GCCGGTCACC
GCCATCGTGG GGAACTTCCG GGCACCTTCG GGTGACATCC CCTCGCTTCT CACCCATGAC
GATGTCGAGG GTCTCTTCCA CGAGTTCGGC CACGCGCTCC ATGGCTGCCT TACCAAAGCC
CCCTATGCCA GCCTTGCCGG ATCGAGCGTG GAGTGGGACT TTGTCGAGAC CCCTTCGCAG
GCGCTGGAGA GCTGGGTCTG GGAGCCGGAG GTGCTCGATG CGATCTCCGG CCACTATGCA
CATCCTGCAG AAAAACTCCC GGCCCCGCTC CGGGACCGGA TCATCGCGGC ACGCGACCTC
GGCGCCGGGC TGAGGTACAC CCGGATGCTC GTGATCTCGA CCGAGGACAT GGAATTCCAT
ACCGCAAAAG GGCCGGTTGA TGTGACCGCG ACTGCCAACC GTATCTACCG GGAGCTCATG
GGCATCTCGC CACTCGAAGG GGACCACGAG CCGGCCACCA TCGGCCATTT CATGGGGGGA
TACGATGCCG GTTACTACAG TTACCTCTGG GCCGAAGTCT ACGCCCTGAA TATCTTTGCC
CGGTTCAAAA AAGACGGCCT GTTCAATGCT GCCACCGGGG CCGCGTACCG TCACTGGATC
CTCGAACAGG GAAACATGCA GGATGGAAAG GCGCTCCTTG CAGGATTCCT GGGAAAAGAG
CCCGGCATGG ATGTCTTCTA CGAGAGGCTC CATATCCACC CACCCTCACC CACATCCCCG
TAA
 
Protein sequence
MTFRLPLSPL KTSYRPGEIT VLCDTAIRTA TTALDRIAVL PPETRSVETT LLAFETAMAD 
FSDATLPLTL MGYVYPDPGV AAEGSASEEK TGKFAIGVFT RRDLYDAIRG VVPRNPAETR
LLSETLRQFK KNGLALSNEG LARVRALKEQ ITGLEVKFSA NLNNDTTTLD FSAEELGGVP
QEVLATFAQT PDGKYRVTTK YPDYIPVMQN AESAATRKQL YAAFVNRQAV PNTALLEEAI
RVRQECAREL GYASWADYRL DGRMAQDTAT VRSFLSRLEA PVKEKIRSDL AMLLTLKQEL
VPGADRVDPW DLAFLSERER KQKFALDNEE IRKYFPFDLV LEGMFRCFGP LFGARFAVVP
EAPAWAPGVR LIRIFDQDDD RTLAYLYLDM FPRDGKYGHM MMSPLIAGRE REGGYSVPVT
AIVGNFRAPS GDIPSLLTHD DVEGLFHEFG HALHGCLTKA PYASLAGSSV EWDFVETPSQ
ALESWVWEPE VLDAISGHYA HPAEKLPAPL RDRIIAARDL GAGLRYTRML VISTEDMEFH
TAKGPVDVTA TANRIYRELM GISPLEGDHE PATIGHFMGG YDAGYYSYLW AEVYALNIFA
RFKKDGLFNA ATGAAYRHWI LEQGNMQDGK ALLAGFLGKE PGMDVFYERL HIHPPSPTSP