Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0703 |
Symbol | |
ID | 5773954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 642950 |
End bp | 643972 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641316339 |
Product | flap endonuclease-1 |
Protein accession | YP_001582037 |
Protein GI | 161528211 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) |
TIGRFAM ID | [TIGR03674] flap structure-specific endonuclease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTAA ATCTAAAAGA TTTAGTTGTC AGAGAAAAAA CCACACTAGA GGCATTTTCA AACAAAGTAA TTGCGATTGA TGCATACAAT GCTATCTACC AATTTTTAGC AAGTATAAGA GGTCCAGACG GGTTACAATT ATCAGATTCA GAAGGCAGAA TTACTAGTCA TCTCAGTGGG TTACTGTACA GAAATGTAAA TTTTCTATCT CTAGGAATAA AACCAGTTTA CGTATTTGAT GGAAAACCAC CATCTCTAAA AACAGCAGAA ATTGAGCGTA GAAAACAAAT CAAAATGGAT GCAACCATAA AATATGAAAA AGCAATTGCA GATGGAAATA TGGAAGATGC TAGAAAATAT GCTCAACAGA CAACAAGTAT GAAAGATGGG ATGGTAAAAG AATCAAAGCA ACTTTTGACA TATTTTGGCA TACCATACAT TGAAGCACCA TCAGAGGGGG AAGCAACTGC AGCCCATCTC ACAAACACAG GTCAAGCATA TGCTTCAGCA AGTCAAGACT TTGACTCAAT TTTGTGTGGA GCAAAAAGAT TGGTGAGAAA TTTTACAAAT AGCGGTAGAA GGAAAATCCC AAACAAGAAC ACATACATCG ATATTGTTCC AGAGATTATT GAAACACAAA AAACATTAGA CTCACTAGAA TTAACACGTG AAGAATTAAT TGATGTTGGA ATTTTAATTG GGACAGACTT TAATCCAAAT GGATTTGAAA GAGTAGGTCC AAAAACCGCA CTAAAAATGA TCAAACAACA TTCAAAGTTG GAAGAGATTC CACAAATTCA AGAGCAGTTA GAAGAAATAG ATTATCAAGA AATTAGAAAA ATATTTTTGA ATCCAGAAGT TGCAGATGTA AAAGAAATTG TTTTTGAGAA TGTCAACTAT GAAGGAATGA GCAATTATCT TGTAAGAGAA AGAAGTTTTT CTGAAGACAG AGTAAATTCA ACATTGAATC GATTGAAAAA GGCATTAGAA AAGAAAAGCC AAAACTTGGA TCAGTGGTTT TGA
|
Protein sequence | MGLNLKDLVV REKTTLEAFS NKVIAIDAYN AIYQFLASIR GPDGLQLSDS EGRITSHLSG LLYRNVNFLS LGIKPVYVFD GKPPSLKTAE IERRKQIKMD ATIKYEKAIA DGNMEDARKY AQQTTSMKDG MVKESKQLLT YFGIPYIEAP SEGEATAAHL TNTGQAYASA SQDFDSILCG AKRLVRNFTN SGRRKIPNKN TYIDIVPEII ETQKTLDSLE LTREELIDVG ILIGTDFNPN GFERVGPKTA LKMIKQHSKL EEIPQIQEQL EEIDYQEIRK IFLNPEVADV KEIVFENVNY EGMSNYLVRE RSFSEDRVNS TLNRLKKALE KKSQNLDQWF
|
| |