Gene Mpal_2770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2770 
Symbol 
ID7270880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2898350 
End bp2899420 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content59% 
IMG OID643571356 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_002467749 
Protein GI219853317 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.656616 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAT TCGTAATGAA GAGGATTGGA GAAGTTGGTT GGATGGAGAA AGATAGACCT 
GCGTGCGGTC CACTGGACGC CATCTGCCGG CCACTTGCCC TTGCACCGTG CACGTCCGAT
GTCCACACGG TCTGGGAAGG AGCCCTCGGC GACCGCCACA ACATGACCCT CGGGCACGAG
GCCCTGGGTA TCGTTGACGA GGTGGGAAGT GAGGTCAAGG ACCTCAAGAA AGGCGACCGC
GTCATTGTGC CGGCCATCAC ACCGGACTGG GGCGATGAAG CCTCACAGCG TGGGTACCCC
TCGCAGTCGA CTGGAATGTG TGGCGGCTGG AAGTTCTCGA ACTTCAAGGA CGGTGTCTTC
GCCGAGTTCT TCCACGTGAA CGAGGCGGAC AACAACCTCG CAAAACTCCC TGAAGGCATG
TCCCTCGAGG CAGCCGTCAT GATGCCTGAC ATGATGAGCA CGGGCTTCAT GGCCGCTGAG
AACGCGAGGA TCCCGATCGG TGGCTCGGTC GCGGTCTTCG GCATCGGACC GGTCGGCCTC
TGCGGTATCG CAGGAGCGAA ACTTCGGGGA GCCGGACGGA TCTTCGCCAT CGGAACCCGA
GCCAAACCCA TCGAGGTCGC GAAGGCATAC GGCGCGACCG ATATTATCAG TTACAAGAAC
GGCGACACCG TTAAGCAGAT CATGGATCTG ACCCATGGAG CGGGCGTCGA CTCTGTCATC
GTCTCCGGCG GCGGACCTGA CATCCTCGTG GACGCCATTA ACGTGGCCAA GGCCGGGGGT
GCCATCGGGA ACAACAACTA CTTTGGCAAG GGTATGTTCG ACAAGGATTA CCTGCCAATC
CCTCGTGTAG GCTGGGGCTT TGGTATGGCC AGCAAGGACA TCATCACTGG TCTCTGCCCC
GGCGGAAAGG TCCGGATGGA GCGGCTCGCC GAGATCATCA AGTACAAGCG CATGGATCCA
GGGCTCATGG CAACTCATGT CTACAAAGGC CTCGACAAGG TCGAGGATGC GCTCATGATG
ATGAAGAGCA AGTCTGGCGA TCTGATCAAG CCTGTCGTCA TCTGCGAGTA G
 
Protein sequence
MKAFVMKRIG EVGWMEKDRP ACGPLDAICR PLALAPCTSD VHTVWEGALG DRHNMTLGHE 
ALGIVDEVGS EVKDLKKGDR VIVPAITPDW GDEASQRGYP SQSTGMCGGW KFSNFKDGVF
AEFFHVNEAD NNLAKLPEGM SLEAAVMMPD MMSTGFMAAE NARIPIGGSV AVFGIGPVGL
CGIAGAKLRG AGRIFAIGTR AKPIEVAKAY GATDIISYKN GDTVKQIMDL THGAGVDSVI
VSGGGPDILV DAINVAKAGG AIGNNNYFGK GMFDKDYLPI PRVGWGFGMA SKDIITGLCP
GGKVRMERLA EIIKYKRMDP GLMATHVYKG LDKVEDALMM MKSKSGDLIK PVVICE