Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0212 |
Symbol | |
ID | 5104078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 173908 |
End bp | 174810 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640506117 |
Product | flap endonuclease-1 |
Protein accession | YP_001190313 |
Protein GI | 146302997 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) |
TIGRFAM ID | [TIGR03674] flap structure-specific endonuclease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.022826 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0048464 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAATAGAC AAGGCAAGGT AACTAGTCAT CTAAATGGAG TCTTTTATAG AACCGTGAAC CTTCTTGAGG AAGGGATCAT TCCAATTTAC GTTTTCGATG GGAAGCCTCC TGAGTTGAAG GCCCAGGAAC TGGAGAACAG GAGAAAAATG AAGGAGGAGG CAGAGAAGAA GCTAGAGAAG GCCAAGGAAT CGGGAAAGGT GGAGGAAATG AGGAAGTACT CGCAGATGAC TTCCAGACTG ACCACCGATA TGGCGAAGGA AAGCAAGGAA CTCCTGGAAT ACATGGGAGT CCCCACAGTT CAGGCCCCAT CTGAGGGTGA AGCTGAGGCT GCCTACCTTA ATGCGAAGGG AATTACGTAT GCCTCCGCCA GTCAAGACTA TGATTCTCTG CTCTTCGGCG CAGAGAAGTT GATCAGAAAC CTCACCATAA GCGGGAAGAG GAAACTGCCC AACAAGGACG TGTACGTGGA GGTAAAACCT GAACTCATCG AGACAGCTTC CCTCCTCAAG AAGCTAGAGA TAACAAGGGA GCAACTGATT GACATTGCGA TTCTGGTGGG AACGGACTAC AACCCCGATG GAGTAAGGGG CATAGGCCCC AAGAAAGCCT ATAAACTGAT CAAGACCTAC AAAAAAATTG AAAATATAGA TAAAAGGGAG TTACCTGAGC CTATTTACTT TGACTACGAA AAGATAAGGG AGTTGTTCCT TAAACCTCAA GTAACTCTTC CGTCCACACC ACTTGAGCTG AGCGATCCCG ATCCCAGCAA GATAATTCAG TTTCTGGTGA ACGAGAACGA TTTTAACGAG GAAAGGGTTA GGGGAACGAT AGAGAGGCTT CAGAAGGCTA TGAAGGAGAT TAAGGATATA AAAAGACAAA CAGGGTTGGA TCAGTGGTTT TAA
|
Protein sequence | MNRQGKVTSH LNGVFYRTVN LLEEGIIPIY VFDGKPPELK AQELENRRKM KEEAEKKLEK AKESGKVEEM RKYSQMTSRL TTDMAKESKE LLEYMGVPTV QAPSEGEAEA AYLNAKGITY ASASQDYDSL LFGAEKLIRN LTISGKRKLP NKDVYVEVKP ELIETASLLK KLEITREQLI DIAILVGTDY NPDGVRGIGP KKAYKLIKTY KKIENIDKRE LPEPIYFDYE KIRELFLKPQ VTLPSTPLEL SDPDPSKIIQ FLVNENDFNE ERVRGTIERL QKAMKEIKDI KRQTGLDQWF
|
| |