Gene Mboo_1307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1307 
Symbol 
ID5411952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1330963 
End bp1332393 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content59% 
IMG OID640868538 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_001404468 
Protein GI154150850 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.424355 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0731768 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATGC TTATCAACGG GAAAGCGGAC GCAGGTAACG GTTCCGCATG GATTGAGGTA 
ATAAACCCGG CAACCGGGGA ACTGGTCGAG CGGGTACCGG CCGGGTCAAA GGATGATGTG
AATGCGGCAG TTGAATCTGC GGATCTCGCA TTTGGGGCAT GGTCGGAAAA AACCATGAGG
GAGCGTGGCC TTCTCCTTGG CCGGGCCGCA GAACTGGTGC GCCGGGACCA CAAACAACTG
GCCGAATTGT TAACCCGGGA ACAGGGAAAA CCGATCCGGG AAGCCATTGA TGAGGTACGG
GGCTGTGCCA ATATCCTTGA GTACTATGCC TCTATTGCAG GCAAGCCGGC CGGCGAAGCG
GTCAGCCTGG GTAAAGCCGG CGACTGCCTG GTGACACGTG TCCCCTTGGG AGTCTGCGGC
GCGATCATCC CATGGAACAT GCCGGTGATC ATCATGGGCT GGAAGGTGGG CCCGGCGCTC
CTTTCCGGAA ATACCCTTGT CCTCAAACCG GCATCCACTG CACCCCTCAC AAATCTGAAG
ATCGCCGGGC TCTTTTCCGA GGCGGGTCTC CCGGCCGGAG TGCTCAACGT GGTGACCGGA
TCAGGAGAGA TCGCAGGAGA GGCGCTCGTG CAGCACAAGA GGGTAAAAAA GATCTCTTTT
ACCGGGAACG GAATCACCGG CCGGAGGATT CGGGAGCTTA CCGGTAACCG GCTTGCAGCG
CTCACGCTGG AGCTGGGCGG CTCAGACCCG ATGATTGTGA TGGCCGATGC AGATGTAAAA
AAAGCGGTCG AAGGGGCGAT CCGGGGCCGG TTCTATAACG CCGGACAGGT CTGCACTGCG
GTAAAACGGC TGTACCTGCA CGAGAAGATT GCCGAAGAGT TCATGCGCGA GCTGACTACA
AAGGTCGAGG GGCTGAAAGT TGGAAACGGG CTTGTACCGG GAACCGATAT GGGCCCGCTC
AACAGCCCTG CCCAGCGCGA CCGGATCGCA ACCGTGGTCC ATGAGATAAC TGCAAACGAT
GAAGGAAAGA TTCTGACCGG CGGCTGTCCG GTACCGGGAA AGGAGTATGA GCGGGGTAAT
TTCTATAAGC CTACACTGGT CGGGGACGTG CCCCCGGATG CGGCACTGCT GAAAAACGAG
ATCTTCGGAC CGGTCTTACC GGTGATGACC TTCCCGGATC TTGCCACCGC GATTGCCGAA
GCAAACCGCT CGCTGTACGG CCTTGGTGCG TCGGTCTGGA CCCGGGACCT TGCCACGGTT
AAAGAGTTTT TTACCCATGT TCACGCCGGG ATCGTCTGGG TGAACCGCCA CCTCACCCTC
CCTCCCGAGG TGCCGTTTGG CGGTACCGAA GAGAGCGGGA TAGGGCGAGA GAACGGCTTC
CATGCCATAG ACAGTTACAC CCAGACAAAG ACGCTCTTTT TAGGCTGGTA A
 
Protein sequence
MEMLINGKAD AGNGSAWIEV INPATGELVE RVPAGSKDDV NAAVESADLA FGAWSEKTMR 
ERGLLLGRAA ELVRRDHKQL AELLTREQGK PIREAIDEVR GCANILEYYA SIAGKPAGEA
VSLGKAGDCL VTRVPLGVCG AIIPWNMPVI IMGWKVGPAL LSGNTLVLKP ASTAPLTNLK
IAGLFSEAGL PAGVLNVVTG SGEIAGEALV QHKRVKKISF TGNGITGRRI RELTGNRLAA
LTLELGGSDP MIVMADADVK KAVEGAIRGR FYNAGQVCTA VKRLYLHEKI AEEFMRELTT
KVEGLKVGNG LVPGTDMGPL NSPAQRDRIA TVVHEITAND EGKILTGGCP VPGKEYERGN
FYKPTLVGDV PPDAALLKNE IFGPVLPVMT FPDLATAIAE ANRSLYGLGA SVWTRDLATV
KEFFTHVHAG IVWVNRHLTL PPEVPFGGTE ESGIGRENGF HAIDSYTQTK TLFLGW