Gene Mboo_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2023 
Symbol 
ID5411828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2096061 
End bp2097377 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content64% 
IMG OID640869265 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_001405180 
Protein GI154151562 
COG category[C] Energy production and conversion 
COG ID[COG3259] Coenzyme F420-reducing hydrogenase, alpha subunit 
TIGRFAM ID[TIGR03295] coenzyme F420 hydrogenase, subunit alpha 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.113638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGG TCATTTCCAT CTCGCCAACG ATACGCCATG AAGGTAAGTC GGGACTCGTC 
CTGGATGTTG ACGAAAAGGG GATTGTTACC CGGGGGGACT GGGTCGGGTT GTCACCGGTC
CGGGGGATCG AGCGGTTCTG TACCGGAAAG AAGATGCACC AGGTACCAAA GATTGCCTCC
CGGACCTGCG GTATCTGCCC GGTGCCCCAT GTGCTCGCGG GCGTCGGGGC CATGGAAGCC
TCGATCGGCT GCGAGGTCCC AAAAGACGCT CTCCTCCTGC GCAGGATCAT CCATAGTGCG
TCGCGCCTCT CGGTCCATGC CCTCCACGCC TTTATGGTGC TCCCGGACCT GTATTACCCC
GGCACCGATA CCCGGATCAA CCCATACTCC CCCGAGCCCC GGGCCCGCGC CATCGCAGAC
CGGATCCAGC GGATCCGGGA GATCGGGCAG GACTGCGTGC AGATAGCGGG CGGCGAGGCG
ATCCACCCGG GCAACCCCCG GGTGGGCGGG ATGTACCGCA ATATCTCCCC GCAGGCAAAA
ACAAAACTCT TCGATCTGGC AAAAGAGGGA AATGTCCTTG CCCACGAACA CCTGGACTGC
ATGCTTGCCC TGATCCGGGA TTTTTCCCGG CGGGAGTGGG TGGAGATCGG TGGCGCCCGG
GTGCCGGTCC CAAAGAACCT CGGATACCAC AACCAGGGCT ATCTTGCCAC GGACCCGCTG
TACGGCACCT CAAGCCTTGA GGAGCACCCG TCCTTTGACC TTGCACGGTA TGCGGAAGTA
TCGCCTGAAC ACTGGTACCG GGGGCCGGGG GAGGTTACGT ACGGGGATCC CACCTATCCC
GGGGGCGGGA CGTTACCTGA GGGGACCGCG TTTGATCCCG CACGGGAGAT GTGCCCGGCC
GTGCCCATCT ATGACGGGCA GCCGGTCGAG GTCGGGGCTG CGGCACGGCT CCGGCGTTTT
TCGAATTTTG ACGAGAAAGG CACGATCGGG CAGCTCGTGG CCCGGCAGAT GGAGTGCATC
CCGGCCGTAA CCGAACTGGA GGACTGCATT GACCGGCTCA ATCCCGCAGG GGCAGTCCGT
GCGGGCACAC TTCCCCCCGG CGACGGGAAG CCGGGCTGGG CAGCAAACGA GGCCCCCCGC
GGGACACTGG TCCACATCAC CCGGGTCAAG GATCGGAAGG TCCGGTTCTT CAAGATGATC
GTTCCCACGT CATGGAACAT GCCGACCGCC GGCCTTGCGC TTGCCGGTTC GCCCTGGCAG
CTTGCCGAGT TTGTTATCCG GGGCTACGAC CCGTGCATAT CCTGTGCGTC GCACTAA
 
Protein sequence
MTRVISISPT IRHEGKSGLV LDVDEKGIVT RGDWVGLSPV RGIERFCTGK KMHQVPKIAS 
RTCGICPVPH VLAGVGAMEA SIGCEVPKDA LLLRRIIHSA SRLSVHALHA FMVLPDLYYP
GTDTRINPYS PEPRARAIAD RIQRIREIGQ DCVQIAGGEA IHPGNPRVGG MYRNISPQAK
TKLFDLAKEG NVLAHEHLDC MLALIRDFSR REWVEIGGAR VPVPKNLGYH NQGYLATDPL
YGTSSLEEHP SFDLARYAEV SPEHWYRGPG EVTYGDPTYP GGGTLPEGTA FDPAREMCPA
VPIYDGQPVE VGAAARLRRF SNFDEKGTIG QLVARQMECI PAVTELEDCI DRLNPAGAVR
AGTLPPGDGK PGWAANEAPR GTLVHITRVK DRKVRFFKMI VPTSWNMPTA GLALAGSPWQ
LAEFVIRGYD PCISCASH