Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0057 |
Symbol | |
ID | 7272226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 59946 |
End bp | 60947 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643568714 |
Product | flap endonuclease-1 |
Protein accession | YP_002465174 |
Protein GI | 219850742 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) |
TIGRFAM ID | [TIGR03674] flap structure-specific endonuclease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGTTG CTCTCCGTGA GGTGCTGACC GAGTACAAGC ATCCCAGAAC CTGGGAGACC CTGGCCGGGA CCGCCGCCAT CGACGGGAAC AATGCCCTTT ACCAGTTCCT CTCGATCATC CGGCAACCGG ACGGCACCCC GCTGATGAAC AGCGAAGGGA GGATCACCTC TCACCTCTCC GGGGTCTTCT TCCGCACGCT CCGGTTCCTT GAGAAGGGGA TCCGCCCGGT GTATATCTTC GACGGCAAAC CCCCTGCTCT GAAACAGGAG ACGATCGAGA GCCGGCGGGA GGTGAGACGG GAGGCCGGCG TCCAGTGGGA GGCTGCTCTG GCCCGGGGGG ACCAGGAGGA GGCGTACAAA CAGGCCCGTG CCTCCTCCCG GGTCACTCCT GAGATCATCG CCACCTCAAA AGAGCTGCTG ACCCTGATGG GCGTCCCCTG CGTGCAGGCT CCGTCCGAGG GGGAGGCCCA GGCCGCCTCG ATGGCCGCTT CCGGGGCGGT CACCTACGCC GTCTCGCAGG ACTACGACTC CCTCCTCTTC GGAGCCCCGC TGCTGGTCAG GAATCTGACC GTCTCGAGCA AACGGCGGGT GCAGGGGAGG ACGATCGCAG TCCAGCCCGA GTCGATCCGT CTCGATGAGG TGCTCGGGGG ACTCGGGATC ACCCGTGAAC AGTTGATTGA GGCCGGCATT CTGATCGGCA CCGACTTCAA CCCCGGCATT AGGGGAGTCG GACCGAAGAC AGCGCTGAAG ATCGTGAAGA AGGACGGGTT CGCCGACATG ATCGCCGAGA AGTTACCGGA CTTCGACCCG TCCCCGATCC TACAGTTCTT CCGCTCCCCG CCGGTGATCG CCAACCTCTC CCTGGACTGG CAGCCGCCGG ACCAGGCAGG GATCGAGGAT CTCCTCTGTG GGGAGTACGG GTTTGCAACA GAACGGGTAA GAACTGCCCT TCAGAAGATC AGCGGCCCTC CCGGGCAGAA GACCCTGGAC CGCTGGTTCT GA
|
Protein sequence | MGVALREVLT EYKHPRTWET LAGTAAIDGN NALYQFLSII RQPDGTPLMN SEGRITSHLS GVFFRTLRFL EKGIRPVYIF DGKPPALKQE TIESRREVRR EAGVQWEAAL ARGDQEEAYK QARASSRVTP EIIATSKELL TLMGVPCVQA PSEGEAQAAS MAASGAVTYA VSQDYDSLLF GAPLLVRNLT VSSKRRVQGR TIAVQPESIR LDEVLGGLGI TREQLIEAGI LIGTDFNPGI RGVGPKTALK IVKKDGFADM IAEKLPDFDP SPILQFFRSP PVIANLSLDW QPPDQAGIED LLCGEYGFAT ERVRTALQKI SGPPGQKTLD RWF
|
| |