Gene Mboo_0122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0122 
Symbol 
ID5411296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp111163 
End bp112404 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content59% 
IMG OID640867337 
ProductUbiD family decarboxylase 
Protein accessionYP_001403289 
Protein GI154149671 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.695479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAGT TTATCGAACG GATGCGTAGA AAAGGGCTTG TTATCGATAT AACCGAGCCG 
TCTTCCCCCG ATGATATGAA AGCGGCACAG CATGCGGCCG GGACGGATAA GCTGCTCTTT
TTCCATAACG TGGGCGGGGC GCGGGCAGTG ATGAATGTGA CCGCGGACCG GCAGGCACTT
GCGCTTGCCC TTGGCATGGA CGAAAAGGAG ATGGTAAAAA AGCTCGCGGA TGCAGCGTTC
GACGGGAAGA TCGTCCATGA CGGGAAACTG TCGATGAAAA AACCGGATCT TTCCCTCATC
CCGGTCATGC ACTTTTTCCC CAAAGACGCC GGCCGGTACT TTACCGCCGG CATTGTCTTC
TCGAAATGGG ATGGCGTAGA GAATGCGTCC ATCCACCGTA TGCTTGTGCT CGACGACACC
AGAGTTGCTG CCCGGCTCGT GGAAGGCCGG CATACTCATG TAATGCACAA AAAGGCCCTT
GCCTGCGGCG AGCGCCTGCC GATCGCGGTG GTGGTCGGGG TGCATCCGGC CGTGACGTTT
GCCAGCTGTA CCCGGGTCCC GGCCGGGAAG GAACTGGCGT ATGCGGCAGA ACTCATGGGC
GGGACGATCC ATGTAAAGGA ATGCAGCAAT GGCGTGCTCG TTCCTGCCGA CGCGGAGATA
GTACTCGAAG GGTTCATCGG TCCAGATGTC ACCGATGAGG GCCCGTTTGT CGATATCACG
GGCACCTACG ACCCGGTCCG GCGCCAGCCG GTGATCGAGT TCACCGGGAT GCATGTAAAA
CCGGACTTCA TCTACCACAG CATCCTGCCC GGGGGCGACG AGCACAAGAT CCTGATGGGA
TGCCCCTACG AGCCTAAGAT CTACCGGGCC GTTGCCGGTG TAACCGAGGT CAGAAATGTC
GTACTCACCA AGGGTGGCTG CGGGTACCTC CATGCAGTGA TCCAGGTAAA AAAGAGCACG
CAGGGCGATG GCAAGAACGC AATCATGGCA GCCTTTGCCG CCCATACTTC GCTCAAACAC
GTAGTCGTCG TGGATGAGGA TATTGACCCG AGCGATCCGA GCGAGGTCGA GTACGCGATT
GCAACTAGAG TGAGCGGCGA CCGGGACGTG ATGGTGGTCA CGGGCGTCCG GGGGTCATCG
CTCGATCCCT GCCAGAAAGA GGATGGCACC AACGTGAAGA TCGGTGTAGA CGCGACCATG
GTGATGGGCC ACGAAGACGA GTTCCGGCGT GCAACATGGT AG
 
Protein sequence
MREFIERMRR KGLVIDITEP SSPDDMKAAQ HAAGTDKLLF FHNVGGARAV MNVTADRQAL 
ALALGMDEKE MVKKLADAAF DGKIVHDGKL SMKKPDLSLI PVMHFFPKDA GRYFTAGIVF
SKWDGVENAS IHRMLVLDDT RVAARLVEGR HTHVMHKKAL ACGERLPIAV VVGVHPAVTF
ASCTRVPAGK ELAYAAELMG GTIHVKECSN GVLVPADAEI VLEGFIGPDV TDEGPFVDIT
GTYDPVRRQP VIEFTGMHVK PDFIYHSILP GGDEHKILMG CPYEPKIYRA VAGVTEVRNV
VLTKGGCGYL HAVIQVKKST QGDGKNAIMA AFAAHTSLKH VVVVDEDIDP SDPSEVEYAI
ATRVSGDRDV MVVTGVRGSS LDPCQKEDGT NVKIGVDATM VMGHEDEFRR ATW