Gene Mpal_0566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0566 
Symbol 
ID7270150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp557543 
End bp558973 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content61% 
IMG OID643569212 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002465661 
Protein GI219851229 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.359165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.116957 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATGG TGATCAATGG GGAGTGGGTC GGCTCGCTCT CAGACAGAAC GTATACGGTA 
AAGAATCCAG CGAATGGCGA GGCGGTCGGC CCAGCCCCGC TCGGAGTTCG GGATGATGTG
AAGCGGGCGG CCGATGCCGC AGAGGAGGCG CTCTCCCGAT GGGGGGGCAC CTCCTCCCAC
CATCGGGGGC AGATCCTCAC CAGGGCTGCA GCCAGGATCC GGGAACAGGC CGAGGAGATC
GCGATAATTC TGACGGCGGA GCAGGGAAAA CCGCAGCGTG AGGCAATTGA TGAGATCCGG
GGTACAGCCA GGGTCTTTGA ATACTATGCC GGCCTCTCGT CCAATCTCAC GGCCACGGTT
CAGCATCTTG AGGACGGGTC CGAAGCGACG GTAATGCGGG AGCCGATCGG GGTCTGTGGG
GCGATCATCC CCTGGAACAT GCCGGCGCTG TTGATGGCCT GGAAGGTCGG TCCGGCTCTT
CTGACCGGGA ACACCGTCGT ACTCAAGCCG GCCACTGCAA CCCCGCTAAC CCCCCTGATG
CTGGCTGCGG CTCTGCACGA TGTCGGGCTT CCCAACGGCG TTCTGAACGT GGTGACCGGT
TCAGGCGACG AGGTAGGGGA GGAGATCGTC CGTTCCAAAC AAATTCAGAA GGTCTCGTTC
ACCGGCTCGA CCCAGACCGG CAAGCGGATC ATGACACTGG CGGCCCATGA CCTCAAGAGG
TTGACCCTCG AACTTGGGGG GAGTGATCCA ATGATCGTTT GTGGGGATGC TGATATCCCT
AAGGCCGTCG CCGGAGCCGT CGCCGGCAGG TTCTATAATG CCGGACAGAT CTGCACCGCG
GTGAAACGCC TCTATGTCGT CGACTCGGTC GCCGACCAGG TGATCGAACA GATTACCGAG
AAGGTCGGCC AGATCACCAT CGGTGACGGA ATGAAGCCTG AGGTGAAGAT GGGACCGCTC
TCCAGCCTGC AAGGGCGGGA GTCAATCCGC TCGGTCGTCA GGCAGGTGGT CGACCGGGAG
GAAGGTCGAG TGATCGCCGG AGGGGAACTA CCACAGGGGG ATGAGTACAT CCGGGGGAAC
TTCTATACCC CGACGCTGGT GACTGATGTC GTCCCGGATT CGATCCTGCT TCGAGAGGAG
ATCTTTGGAC CGGTACTCCC GATTGTTCGG GTGAAGGATC TGAACGAAGC GATTACTGCC
GCCAACAGCA CACGCTACGG GCTGGGTGCT TCGATCTGGA CCAGTGACCT AAAGACGATT
CGGACTGCAG TCAGCGGGCT GAAGGCCGGT ATTATCTGGG TGAACCAGCA CCTGAAGATC
CCACCGGAGG TGCCCTTCGG AGGCGTGAAG GAGAGTGGGG TCGGTCGGGA GAACGGTCTG
CAGTCTCTGG ATGCCTACAC CGAGGCGAAG ACGGTGCTGG TCAGACTCTG A
 
Protein sequence
MKMVINGEWV GSLSDRTYTV KNPANGEAVG PAPLGVRDDV KRAADAAEEA LSRWGGTSSH 
HRGQILTRAA ARIREQAEEI AIILTAEQGK PQREAIDEIR GTARVFEYYA GLSSNLTATV
QHLEDGSEAT VMREPIGVCG AIIPWNMPAL LMAWKVGPAL LTGNTVVLKP ATATPLTPLM
LAAALHDVGL PNGVLNVVTG SGDEVGEEIV RSKQIQKVSF TGSTQTGKRI MTLAAHDLKR
LTLELGGSDP MIVCGDADIP KAVAGAVAGR FYNAGQICTA VKRLYVVDSV ADQVIEQITE
KVGQITIGDG MKPEVKMGPL SSLQGRESIR SVVRQVVDRE EGRVIAGGEL PQGDEYIRGN
FYTPTLVTDV VPDSILLREE IFGPVLPIVR VKDLNEAITA ANSTRYGLGA SIWTSDLKTI
RTAVSGLKAG IIWVNQHLKI PPEVPFGGVK ESGVGRENGL QSLDAYTEAK TVLVRL