Gene Memar_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMemar_1201 
Symbol 
ID4847880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanoculleus marisnigri JR1 
KingdomArchaea 
Replicon accessionNC_009051 
Strand
Start bp1176717 
End bp1178147 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content67% 
IMG OID640115891 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_001047114 
Protein GI126179149 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.18248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATGA TGATCGGCGG CGAGCGGCGC GATTCGGTCT CCGGCAGGGT ATTTCCGGTG 
CACAACCCCG CGACGGGCGA CGTGGTCGGT GAAGCGCCGC TCGGCGGCGA GGACGATGTT
TTAGCCGCGA TCGAGGCCGC CGGGGAGGCG TTTCCCGACT GGGCCGCGAA GAGCCCGAGG
GATCGGGCGA AGATCCTCTT CTTTGCGGCA GAAGAGGTCC GGCGCAGGAA CACCGACCTT
GCGGCGCTCC TGACCGCCGA ACAGGGCAAG CCCATCCGCG AGGCGGTCGA CGAGATCAAC
GGGTTTGCAA ACATCCTCGA GTACTATCAC GCCCTCTCGG CGGGGCTCCG GGGCGAGTTC
GCGAACCTCA AGGGCTACGG CCGGGTTACG GTCCGGAGAC GCCCGCTCGG CATCTGCGGC
GCGATCATCC CCTGGAATAT GCCGGCGATC ATCATGGGCT GGAAGATAGG TGCGGCTCTC
GCGGCGGGGA ACACGATGGT CTTGAAGCCC GCCCGAACCG CCCCGCTGAC CTGCATGAGG
CTCGCGGAGA TCCTCGGGGA GGCGGGTCTC CCGCCGGGGG TCTTAAACGT CGTCACCGGG
CCGGGGGAGA CGGTCGGCCG GGAGATCGCG AGGAACCCAG GCGTTCGGAA GGTCTCGTTC
ACCGGGGAGG TCGGGACCGG GAGGCAGGTC GCCCTTGACG CCGCCCCTGC GATGAAGCGG
CTGACGCTGG AACTCGGGGG GAGCGATCCG ATGATCGTCT GTGACGATGC CGATATCGGA
GCGGCGGTCG AGGGGGTCAT CCGGGGACGT TACTACAACT GCGGCCAGGT CTGCACGGCG
GTCAAAAGGC TCTACGTCTC TGACTCGATC GCGGACGAGT TTGTCCGGCG GCTGACGGCG
AGGGTCGAGA CGTTCGTGGT CGGGAACGGC ATGGACCGCG GCGTCGGGAT GGGCCCGCTG
AACAACCGGG CGGGCTTAAA TCGGGTCGTT AATATTGTCG ATGCCGCCCG GGAGCGGGAC
GAGGGGAAGA TCGTCGCGGG AGGGCGGGCG CCCGAAGGGG AGCAGTACAA GCGCGGCCTC
TTCTTCCTGC CGACGCTCAT TACCGGTGTT CCGCACGACT CCGTCCTCTT CTCCGAGGAG
ATCTTCGGCC CGGTGCTGCC GATCGCCGCC GTTTCCGGTC TCGACGAGGC GCTCGAGCTC
GCGAACAATT CCCGTTACGG CCTCGGGGCG TCGGTATGGA CCCGGAACGC GGACACGATC
GCCCGGGCGA CGGAAGAACT CGAGGCGGGG ATCGTCTGGG TCAACCAGCA CCTGAGGATC
CCGCCGGAGG TGCCGTTCGG GGGGACGAAG GCGAGCGGCA TCGGCAGGGA GAACGGCAGC
CGGGCGCTCG AGGAGTACAC GGAGGAGAAG GCCGTGCTGG TGCGGCTGTA G
 
Protein sequence
MKMMIGGERR DSVSGRVFPV HNPATGDVVG EAPLGGEDDV LAAIEAAGEA FPDWAAKSPR 
DRAKILFFAA EEVRRRNTDL AALLTAEQGK PIREAVDEIN GFANILEYYH ALSAGLRGEF
ANLKGYGRVT VRRRPLGICG AIIPWNMPAI IMGWKIGAAL AAGNTMVLKP ARTAPLTCMR
LAEILGEAGL PPGVLNVVTG PGETVGREIA RNPGVRKVSF TGEVGTGRQV ALDAAPAMKR
LTLELGGSDP MIVCDDADIG AAVEGVIRGR YYNCGQVCTA VKRLYVSDSI ADEFVRRLTA
RVETFVVGNG MDRGVGMGPL NNRAGLNRVV NIVDAARERD EGKIVAGGRA PEGEQYKRGL
FFLPTLITGV PHDSVLFSEE IFGPVLPIAA VSGLDEALEL ANNSRYGLGA SVWTRNADTI
ARATEELEAG IVWVNQHLRI PPEVPFGGTK ASGIGRENGS RALEEYTEEK AVLVRL