Gene Mboo_1413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1413 
Symbol 
ID5412015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1442248 
End bp1443243 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content60% 
IMG OID640868647 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_001404574 
Protein GI154150956 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0720065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.131563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTCA ACCTGATGCA CGGTGCCGGG GGAGAAGTAA TGGGCGAACT CCTGCAGACG 
CTCACAAAAT TCTCCCACAA CAATGCCGGC GGGATCGGAT TAGAGTCCCT TGACGACGGG
GCCGTAATTC CGATCAACGG TACAAACATT GTTTTTACTA CGGACTCCCA CGTGGTCCGC
CCGCTCTTTT TCCCGGGCGG GGACATCGGG AGGATATCGG TCTGCGGTAC CGTGAACGAT
CTTACCATGA TGGGGGGCCG GCCGGTGGCA CTCTCGTGCG GGATGGTGAT CGAAGAGGGT
TTCGATGTGG CCGATCTCGC CCGGATCGTT GCTTCGATGG ACGAGGCGCT GGGGGAAGCC
GGGGCATGCC TTGTAACCGG CGACACAAAA GTGGTTGAAC GGGGATCGCT TGACGGGATT
GTTATTAACA CCGCAGGGAT TGGTGTTGCA AAGACCGTTG TACGGGACAA CGGACTTGTC
CCGGGTGATG TGATCATCGT TTCGGGGACG CTGGGCGATC ATGGGATCGC GATCATGGCC
CACCGTGAGG GCTTCGATCT TGGCGAGCAG ATCCATTCCG ATGTTGCCCC GCTGTGGGGA
ATGATGGAGG GGGTTCTTGC CGCCGGCACC ATCCACGCGA TGAAGGATCC GACACGGGGC
GGGTTTGCCA GTGCCATCAA CGAGATGGCC AAAAAGAGCC GGGTTCAGGT AAGGATCGAA
GAGGACCGCA TCCCGCTGCG CCGGAGCGTG AAGAGTGCGG CAGGGATGCT CGGGATCGAT
CCGCTCGAAG TGGCAAACGA AGGAAAGGTC GTAATGGGAG TGCCGGCAGC CGATGCAGAT
GCGATCCTCG CCGCACTGCA CTCACACAAA TACGGCAAAG ATGCAGCAGT TGTCGGCAGG
GTGGTTGCCG GGTCCCACGT GATCATGGAG ACGGCGATTG GCGGCGAGCG GTTCATCGAG
CCGCCCATGG GCGATCCGGT GCCCCGGGTC TGCTGA
 
Protein sequence
MKVNLMHGAG GEVMGELLQT LTKFSHNNAG GIGLESLDDG AVIPINGTNI VFTTDSHVVR 
PLFFPGGDIG RISVCGTVND LTMMGGRPVA LSCGMVIEEG FDVADLARIV ASMDEALGEA
GACLVTGDTK VVERGSLDGI VINTAGIGVA KTVVRDNGLV PGDVIIVSGT LGDHGIAIMA
HREGFDLGEQ IHSDVAPLWG MMEGVLAAGT IHAMKDPTRG GFASAINEMA KKSRVQVRIE
EDRIPLRRSV KSAAGMLGID PLEVANEGKV VMGVPAADAD AILAALHSHK YGKDAAVVGR
VVAGSHVIME TAIGGERFIE PPMGDPVPRV C