Gene Mpal_0057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0057 
Symbol 
ID7272226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp59946 
End bp60947 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content64% 
IMG OID643568714 
Productflap endonuclease-1 
Protein accessionYP_002465174 
Protein GI219850742 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID[TIGR03674] flap structure-specific endonuclease 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGTTG CTCTCCGTGA GGTGCTGACC GAGTACAAGC ATCCCAGAAC CTGGGAGACC 
CTGGCCGGGA CCGCCGCCAT CGACGGGAAC AATGCCCTTT ACCAGTTCCT CTCGATCATC
CGGCAACCGG ACGGCACCCC GCTGATGAAC AGCGAAGGGA GGATCACCTC TCACCTCTCC
GGGGTCTTCT TCCGCACGCT CCGGTTCCTT GAGAAGGGGA TCCGCCCGGT GTATATCTTC
GACGGCAAAC CCCCTGCTCT GAAACAGGAG ACGATCGAGA GCCGGCGGGA GGTGAGACGG
GAGGCCGGCG TCCAGTGGGA GGCTGCTCTG GCCCGGGGGG ACCAGGAGGA GGCGTACAAA
CAGGCCCGTG CCTCCTCCCG GGTCACTCCT GAGATCATCG CCACCTCAAA AGAGCTGCTG
ACCCTGATGG GCGTCCCCTG CGTGCAGGCT CCGTCCGAGG GGGAGGCCCA GGCCGCCTCG
ATGGCCGCTT CCGGGGCGGT CACCTACGCC GTCTCGCAGG ACTACGACTC CCTCCTCTTC
GGAGCCCCGC TGCTGGTCAG GAATCTGACC GTCTCGAGCA AACGGCGGGT GCAGGGGAGG
ACGATCGCAG TCCAGCCCGA GTCGATCCGT CTCGATGAGG TGCTCGGGGG ACTCGGGATC
ACCCGTGAAC AGTTGATTGA GGCCGGCATT CTGATCGGCA CCGACTTCAA CCCCGGCATT
AGGGGAGTCG GACCGAAGAC AGCGCTGAAG ATCGTGAAGA AGGACGGGTT CGCCGACATG
ATCGCCGAGA AGTTACCGGA CTTCGACCCG TCCCCGATCC TACAGTTCTT CCGCTCCCCG
CCGGTGATCG CCAACCTCTC CCTGGACTGG CAGCCGCCGG ACCAGGCAGG GATCGAGGAT
CTCCTCTGTG GGGAGTACGG GTTTGCAACA GAACGGGTAA GAACTGCCCT TCAGAAGATC
AGCGGCCCTC CCGGGCAGAA GACCCTGGAC CGCTGGTTCT GA
 
Protein sequence
MGVALREVLT EYKHPRTWET LAGTAAIDGN NALYQFLSII RQPDGTPLMN SEGRITSHLS 
GVFFRTLRFL EKGIRPVYIF DGKPPALKQE TIESRREVRR EAGVQWEAAL ARGDQEEAYK
QARASSRVTP EIIATSKELL TLMGVPCVQA PSEGEAQAAS MAASGAVTYA VSQDYDSLLF
GAPLLVRNLT VSSKRRVQGR TIAVQPESIR LDEVLGGLGI TREQLIEAGI LIGTDFNPGI
RGVGPKTALK IVKKDGFADM IAEKLPDFDP SPILQFFRSP PVIANLSLDW QPPDQAGIED
LLCGEYGFAT ERVRTALQKI SGPPGQKTLD RWF