Gene Mboo_0199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0199 
Symbol 
ID5411524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp191233 
End bp192891 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content58% 
IMG OID640867414 
Productthermosome 
Protein accessionYP_001403365 
Protein GI154149747 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.730772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAGC AACTTGGAGG ACAACCGATT TTTATTCTCA AGGAAGGCAC CAACCGCACC 
CGCGGTCGCG ATGCCCAGGG TATGAACATC ACTGCCGCAA AAGCCGTGGC AGCTGCAGTC
AGGACTACGC TCGGCCCCAA GGGCATGGAC AAGATGCTTG TCGACACCAT CGGTGATGTT
GTTATCACCA ACGATGGTGT CACGATCCTT AAGGAAATGG ACATCGAGCA CCCGGCCGCA
AAGATGATGG TCGAGGTCGC AAAGACCCAG GACGACGAGG TCGGCGACGG GACCACCACC
GCTGTAGTCA TCGGCGGCGA GCTCCTCAAG AAGGCCGAGG ACCTCCTTGA ACAGGACGTC
CACCCGACCG TGATCACTCA CGGTTACCGC ATGGCAGCCG AGAAGGCCCA GGAGTTCTTA
AAGGATATCG CGTTCGATGT CAAGGCAAAC GACAAGGCGC TCTTAAAGAA CATCGCGGGA
ACCGCCATGA CCGGCAAGAG TGCAGAGGCA AGCAAGGAAA AGCTCTGTGA CCTCGTGGTC
AAGGCAGTCA TCATGGTAGC CGAAGAAGAC GGCACCGTGG ACATCGAGAA CATCAAGGTT
GAGAAGAAGA CCGGCGGCAG CATCGAGGAC TCCGAGATCG TTGAGGGCGT CCTTGTGGAC
AAGGAACGTG TCCACCCTGC AATGCCAAAG AAGGTCACCA ATGCGAAGAT CCTTCTCTTA
AACGCAGCCG TCGAGTTCAA GAAGACCGAA GTCGACGCCG AGATCAACAT CACCCACCCC
GACCAGCTCC AGGCATTCCT CGACGAAGAG GAGCGCATGG TCAAGGGGAT CGTCGATAAG
ATCCAGAAGA GCGGCGCAAA CGTCCTCTTC TGCCAGAAGG GTATCGACGA CATCGCCCAG
CACTACCTTG CAAAGGCCGG CATCTTTGCC GTCCGCCGTG TCAAGAAGAG CGACATGGAG
AAGCTTGCCC GGGCAACCGG TGGCAGCCTA GTCTCCTCCA TCGACGCGAT CAGCAAGGAA
GAGCTTGGCA AGGCAGGTAT TGTCGAGGAG CGCAAGGTCT CCGGCGAAGA GATGACCTTT
GTCGAACAGT GCAAGAACCC GAAAGCAGTC TCCATCATTG TCAAGGGCGG CACTGAACAT
GTTGTCGACG AGCTTGAGCG TGCCATCCAC GATGCACTCC GCGTTGTCGG TGTTGTTGTC
GAGGACAAGA AAGTTGTAGC CGGTGGCGGC GCACCCGAGA CCGAGCTCTC GCTCCGTCTC
CACGAGTATG CAGCAACGGT CGGCGGCAAA GAGCAGCTCG CCATCGAGGC GTTTGCACAG
GCTCTTGAGA TCATTCCCCG CACCCTTGCA GAGAACGCAG GCCTTGACCC GATCGACATG
CTTGTCGAGA TCCGGGCAAC CCACGAGAAG GGCAAGAAGA CCTACGGCCT GAATGTTTTC
GAAGGAAAAG CTGTCGACAT GAAGGCAGCC GGTGTTGTCG AGCCGCTCCG GGTCAAGACC
CAGGCGATCT CGTCAGCCGC AGAAGCCGCG ATCATGATCC TCAGGATCGA CGATGTTATC
GCATCATCCA GGTCCCCCGA ACCCGCAGGC GGCGCCGGTG GAATGCCCCC GGGCGGTATG
GGTGGCATGG GCGGAATGCC CGGTATGGGC GACTTCTAA
 
Protein sequence
MSQQLGGQPI FILKEGTNRT RGRDAQGMNI TAAKAVAAAV RTTLGPKGMD KMLVDTIGDV 
VITNDGVTIL KEMDIEHPAA KMMVEVAKTQ DDEVGDGTTT AVVIGGELLK KAEDLLEQDV
HPTVITHGYR MAAEKAQEFL KDIAFDVKAN DKALLKNIAG TAMTGKSAEA SKEKLCDLVV
KAVIMVAEED GTVDIENIKV EKKTGGSIED SEIVEGVLVD KERVHPAMPK KVTNAKILLL
NAAVEFKKTE VDAEINITHP DQLQAFLDEE ERMVKGIVDK IQKSGANVLF CQKGIDDIAQ
HYLAKAGIFA VRRVKKSDME KLARATGGSL VSSIDAISKE ELGKAGIVEE RKVSGEEMTF
VEQCKNPKAV SIIVKGGTEH VVDELERAIH DALRVVGVVV EDKKVVAGGG APETELSLRL
HEYAATVGGK EQLAIEAFAQ ALEIIPRTLA ENAGLDPIDM LVEIRATHEK GKKTYGLNVF
EGKAVDMKAA GVVEPLRVKT QAISSAAEAA IMILRIDDVI ASSRSPEPAG GAGGMPPGGM
GGMGGMPGMG DF