Gene Mpal_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1042 
Symbol 
ID7271776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1071328 
End bp1072494 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content59% 
IMG OID643569679 
Productgeranylgeranyl reductase 
Protein accessionYP_002466113 
Protein GI219851681 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR02032] geranylgeranyl reductase family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATGA TGAAACCCTA TGATGTGGTG GTAGTCGGTG CCGGACCGGC AGGGACTGCA 
GCGGCTGCAT CCTGTGCCGC AGCCGGCCTC TCAACGCTGG TGATTGAGGA ACACGGGACG
ATCGGGTATC CGGTCCAGTG TGCCGGTCTC CTCTCGGAGG CTGCCTTTGC CGAGTGCCGG
GTATCAGAAC GGCCGGTGCT GAACACGGTG CGTGGCGCCC GGATTGCTTC GGATCTTGGT
GGTTCACTCC TTTTTGATGC AGGAAAAACC AAGGCTTTCG TGGTCGATCG GGGTTTGCTC
GACCGTGAGA TGGCAGTCGC CGCCGCAGAT GCAGGGGCCG AGTTCATGCT GAAGACCGGG
TTCACCAGTA TATCGGGGGA CAGAGTCATG ACCTGTGGGA TCCATGGAGA ACAATCGATT
GGGTATCGCC TCCTGATAGG GGCCGACGGT CCGCGGTCGA TGGTGACACG ATCGCTCGGC
CTGCCCCGTG CTCAGACCTA TCTTTCAGGG ATTCAGGCAG ACCTCGCGGT GGCTGATCCG
GTTGACCCCA GGTTTGTCGA GATCTATCCG GACGCTTCTC CTGAGTTCTT TGGCTGGCAG
ATCCCGCTCG CACCCCGGAG GATCCGGGTC GGGCTCTGTG GAACCTGCGG GGTCAGAGAG
AAGTTTGAAC GATTCCAGGC ACGATTTCCG GGTTCGTGCC TGCATCTGGT TACCGGTACG
ATCCCGCTCG GTGTGATTGG GAAGACCTAT GGTAAACAGG CTCTGATCGT CGGAGATGCG
GCAGGGTTTG CAAAACCGAC CTCCGGTGGC GGGGTCTACA CTGGTATCCG TTCCGCCCGT
CTGGCTGCAG AGACTGCTAT CGCCTGTATT GAGGAGGGGC GGTTCGATGA CGCAGCCCTG
AGCAGGTATG AGAAGGCCTG GCAGGACGAT TTTGGAAAGG AACTGGCAGT CGGGTACAAA
CTGATCACCG CACGCCAAAA AATGACCAGC CTGGAGATCG ATCAGATTCT CCGGGCGATG
AACGACCCCT CGATCATCGA GTCGATCGTC AACTTTGGGG ATATGGACCG CCCCTCGGCG
CTGATCCGAA AACTGATGTT GAAACCCGCG CTCTGGGGGG CGATGAAGAT ACTGCTCAAT
TCAGGGATTC GACAGATCTT TGGGTGA
 
Protein sequence
MKMMKPYDVV VVGAGPAGTA AAASCAAAGL STLVIEEHGT IGYPVQCAGL LSEAAFAECR 
VSERPVLNTV RGARIASDLG GSLLFDAGKT KAFVVDRGLL DREMAVAAAD AGAEFMLKTG
FTSISGDRVM TCGIHGEQSI GYRLLIGADG PRSMVTRSLG LPRAQTYLSG IQADLAVADP
VDPRFVEIYP DASPEFFGWQ IPLAPRRIRV GLCGTCGVRE KFERFQARFP GSCLHLVTGT
IPLGVIGKTY GKQALIVGDA AGFAKPTSGG GVYTGIRSAR LAAETAIACI EEGRFDDAAL
SRYEKAWQDD FGKELAVGYK LITARQKMTS LEIDQILRAM NDPSIIESIV NFGDMDRPSA
LIRKLMLKPA LWGAMKILLN SGIRQIFG