Gene Nmar_1509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1509 
Symbol 
ID5774177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1372252 
End bp1373673 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content40% 
IMG OID641317160 
Productphosphoesterase DHHA1 
Protein accessionYP_001582843 
Protein GI161529017 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0271209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAT CACTTGATGA GTCACTTTCG TTTTTCAAAG ATAAAGTTAC AGATTGCATA 
AGATCTAAAA AATCAATTTT TGTTACAACC CACATTGATT GTGACGGGTT GACATCTGGA
AGTATCATTA CCAAAGCTCT GATAAGAGCT GGGGCAAATT GTACTGTTAG GACATCAAAA
GAATTTAGCA AAAATGTTGT AGACTCTTTC AAAACAGATT CTAGAGATTT TCACATAGTT
ACTGATCTTG GAGGAGGTTT TGGAAAGGAC CTCAATGAGA CACTTGGAGA TAACTGGATT
GTCTTGGATC ATCACCAAAT CCCAGATGAG GAGATAGAGA ATCCAAATGT GATTAATGCA
TGGAAGTATG GAATCGATGG AGGCCTTGAA ATTTGTGCCG GCGGAATGGC ATATCTAGCA
TCCATGGCAC TTGATGAGAA AAACTCTGAC TTGTCATCAA TTGCAGTAGT ATCTGCTCTT
GGAGACAGAC AGGACCAAGG AGAAAGAAAG TCATTTACTG GAAAGAATTT TGAAATCGCA
AACACTGCAA AAGAACAAGG ACTAGTTGAG ATTGACTTGG ACCTATTATT GGTTGGAAGA
GAGACAAGAC CACTTCCAGA TGCCTTGGCA TTTACATCCC AGCCATTTAT TGAGGGACTT
ACCTGGAACA GAGATGCCTG CCTTTCACTA CTAAATTCAT CAGGAATCCA GCTTAAAGAC
GAGGGCAGAT GGAGGGTTCC AGCAGAGCTA GACGAGGAAG AAAAAAGACA GGTAATCGAG
TCAATCACCA AATTTACAGC TGGCAAAAAT GCCACAGAGA TAATGTCTGA ATTAATCGGA
TACACTTACA CATTTCCTAG AGAAGACAAG AGGAGTTTCT TGAGGGATGG TAGAGAGTTT
TCAACTATGC TAAACTCTTG TGGAAGAATA AACCGCTCCG GAGTCGGAAT GGCAATCTGC
ATGGGAGACA GAAACAAGAT TCTAAGAGAA GGGGAGACAA TCCTGACAGA CTATAGAAAG
ATGATCAGAG AATACATGAA CATTCTATCA AATGAGAGAT GGAGGATTTC TGAAAGTGAG
ACATGTGTTA TGGTAAATGG AGAAGACATT GTCCCTGAAA CAATGACTGG AACCATCTCA
TCACTAATTG CAGGCTCTCC AAAGAATTCT GGTAAAATTG TAATTCTCAG AACAAAGGGA
GAAGAGAACA CTATCAAGTT TTCATCAAGA AAGTCATTTG GTTGCAAATC AGACATCAAC
CTAAGTGATC TGATGAGAGC TGGTGCTGAG AAGTTTGATG GTATTGGAGG AGGTCATGAT
GCAGCAGCTG GAGCAAAAAT AACTAAAGAC AAATTAGATG AGTTTCTCAA TTATTTAGAA
GTAAATGTCG TTAACGTGTC AAGTGCAGAT AGTCCTCAGT AA
 
Protein sequence
MTKSLDESLS FFKDKVTDCI RSKKSIFVTT HIDCDGLTSG SIITKALIRA GANCTVRTSK 
EFSKNVVDSF KTDSRDFHIV TDLGGGFGKD LNETLGDNWI VLDHHQIPDE EIENPNVINA
WKYGIDGGLE ICAGGMAYLA SMALDEKNSD LSSIAVVSAL GDRQDQGERK SFTGKNFEIA
NTAKEQGLVE IDLDLLLVGR ETRPLPDALA FTSQPFIEGL TWNRDACLSL LNSSGIQLKD
EGRWRVPAEL DEEEKRQVIE SITKFTAGKN ATEIMSELIG YTYTFPREDK RSFLRDGREF
STMLNSCGRI NRSGVGMAIC MGDRNKILRE GETILTDYRK MIREYMNILS NERWRISESE
TCVMVNGEDI VPETMTGTIS SLIAGSPKNS GKIVILRTKG EENTIKFSSR KSFGCKSDIN
LSDLMRAGAE KFDGIGGGHD AAAGAKITKD KLDEFLNYLE VNVVNVSSAD SPQ