Gene Mboo_1607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1607 
Symbol 
ID5412202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1681534 
End bp1682601 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content59% 
IMG OID640868841 
Productchaperone protein DnaJ 
Protein accessionYP_001404767 
Protein GI154151149 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID[TIGR02349] chaperone protein DnaJ 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.231275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.231372 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAC GCGACTATTA TGAGATCCTT GGCGTAAAAC GGGATGCCTC AGCGGATGAT 
CTCAAAAAAG CATTCAGGCA CCTTGCACGA AAGTATCACC CGGACCTGAA CAAGGGCAGT
AAAGAAGCGG AAGAAAAATT CAAGGAGATC AATGAGGCAT ACCAGGTGCT CGGCGACCCC
GCAAAGAAAG CCCAGTACGA CCAGGGAGGC AATGTCGAGT TTGCCCCAGG AGAAACGGCG
GGATACCGTC CGCCCAGTTA CGACGACCTG TTCCGGGACT ATGGCCTTGG GGATATTTTC
AACGCGTTCT CGGGGGGACC CCGGGGGATG CGGCAGCGGG GCGGGGCGGA TCTCCGCTAT
GATATCGAGA TATCCATCGC CGATGCTTTC AGCGGGACGA AGAATACCGT TGCCGTACCG
CATGATTACG AATGCGGCAC CTGCCATGGA ACGGGTGCAG AACCCGGTCA CGTGAAGGAC
TGTCCAACCT GCCATGGGAC GGGTGAGATC CGGAACCCGC GGAAAGTAGG TAACCGCACG
ATGATGAGCA TCGCCCAGTG CCCGGACTGT GGCGGAACCG GCAAAATCAT TGACAAGCCG
TGCAGTACCT GCAAAGGTTC GGGAACTGTG CAGAAGATGC GGCGGATCGA AGTGACGATC
CCGCGGGGCG TTGAGGACGG GCAGTTTTTG CGCATTGCCG GGGAAGGAGA ACCCGGGGAG
AACCAGGGCC CTCCCGGCGA TCTCTATGTT GTGGTTCACG TAAAGAAGGA TGAGATCTTC
GAACGGCACG GTGCCGACCT GCAGACCACA GCAGCGGTCG GCCTTGCCAC CGCGCTCCTG
GGAGGAGAGA TCACGGTACC GACGATTACC GGGAGTGCGT CCTTAAAAAT TCCCCGCGGT
ACGCAGAGCC ATACCCTTTT CCGGCTCCGG GGCCAGGGGA TGCCGTCCCT CAGTTCCGGG
AACCGCGGGG ATCTCCTTGT CAGGGTGATC GTGAAAATCC CGGCAAACCT GACAAAAAAG
CAGGAGGAAC TGATGAAGGA AGCGTTCGTG GGTGGGCCTG CAGGGTGA
 
Protein sequence
MATRDYYEIL GVKRDASADD LKKAFRHLAR KYHPDLNKGS KEAEEKFKEI NEAYQVLGDP 
AKKAQYDQGG NVEFAPGETA GYRPPSYDDL FRDYGLGDIF NAFSGGPRGM RQRGGADLRY
DIEISIADAF SGTKNTVAVP HDYECGTCHG TGAEPGHVKD CPTCHGTGEI RNPRKVGNRT
MMSIAQCPDC GGTGKIIDKP CSTCKGSGTV QKMRRIEVTI PRGVEDGQFL RIAGEGEPGE
NQGPPGDLYV VVHVKKDEIF ERHGADLQTT AAVGLATALL GGEITVPTIT GSASLKIPRG
TQSHTLFRLR GQGMPSLSSG NRGDLLVRVI VKIPANLTKK QEELMKEAFV GGPAG