Gene Nmar_0703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0703 
Symbol 
ID5773954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp642950 
End bp643972 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content34% 
IMG OID641316339 
Productflap endonuclease-1 
Protein accessionYP_001582037 
Protein GI161528211 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID[TIGR03674] flap structure-specific endonuclease 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTAA ATCTAAAAGA TTTAGTTGTC AGAGAAAAAA CCACACTAGA GGCATTTTCA 
AACAAAGTAA TTGCGATTGA TGCATACAAT GCTATCTACC AATTTTTAGC AAGTATAAGA
GGTCCAGACG GGTTACAATT ATCAGATTCA GAAGGCAGAA TTACTAGTCA TCTCAGTGGG
TTACTGTACA GAAATGTAAA TTTTCTATCT CTAGGAATAA AACCAGTTTA CGTATTTGAT
GGAAAACCAC CATCTCTAAA AACAGCAGAA ATTGAGCGTA GAAAACAAAT CAAAATGGAT
GCAACCATAA AATATGAAAA AGCAATTGCA GATGGAAATA TGGAAGATGC TAGAAAATAT
GCTCAACAGA CAACAAGTAT GAAAGATGGG ATGGTAAAAG AATCAAAGCA ACTTTTGACA
TATTTTGGCA TACCATACAT TGAAGCACCA TCAGAGGGGG AAGCAACTGC AGCCCATCTC
ACAAACACAG GTCAAGCATA TGCTTCAGCA AGTCAAGACT TTGACTCAAT TTTGTGTGGA
GCAAAAAGAT TGGTGAGAAA TTTTACAAAT AGCGGTAGAA GGAAAATCCC AAACAAGAAC
ACATACATCG ATATTGTTCC AGAGATTATT GAAACACAAA AAACATTAGA CTCACTAGAA
TTAACACGTG AAGAATTAAT TGATGTTGGA ATTTTAATTG GGACAGACTT TAATCCAAAT
GGATTTGAAA GAGTAGGTCC AAAAACCGCA CTAAAAATGA TCAAACAACA TTCAAAGTTG
GAAGAGATTC CACAAATTCA AGAGCAGTTA GAAGAAATAG ATTATCAAGA AATTAGAAAA
ATATTTTTGA ATCCAGAAGT TGCAGATGTA AAAGAAATTG TTTTTGAGAA TGTCAACTAT
GAAGGAATGA GCAATTATCT TGTAAGAGAA AGAAGTTTTT CTGAAGACAG AGTAAATTCA
ACATTGAATC GATTGAAAAA GGCATTAGAA AAGAAAAGCC AAAACTTGGA TCAGTGGTTT
TGA
 
Protein sequence
MGLNLKDLVV REKTTLEAFS NKVIAIDAYN AIYQFLASIR GPDGLQLSDS EGRITSHLSG 
LLYRNVNFLS LGIKPVYVFD GKPPSLKTAE IERRKQIKMD ATIKYEKAIA DGNMEDARKY
AQQTTSMKDG MVKESKQLLT YFGIPYIEAP SEGEATAAHL TNTGQAYASA SQDFDSILCG
AKRLVRNFTN SGRRKIPNKN TYIDIVPEII ETQKTLDSLE LTREELIDVG ILIGTDFNPN
GFERVGPKTA LKMIKQHSKL EEIPQIQEQL EEIDYQEIRK IFLNPEVADV KEIVFENVNY
EGMSNYLVRE RSFSEDRVNS TLNRLKKALE KKSQNLDQWF