Gene Nmar_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1547 
Symbol 
ID5774529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1411775 
End bp1416979 
Gene Length5205 bp 
Protein Length1734 aa 
Translation table11 
GC content42% 
IMG OID641317199 
Producthypothetical protein 
Protein accessionYP_001582881 
Protein GI161529055 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.114504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAACG AAATAGGACG TAAAATAACT AGTCTTACAT TAATGACAAT TATGGTTGCA 
GGTGGACTTA CCTTTGCAAT TCCAGGTGTA ATGCCTGAAG CAATGGCCGC CAACGCCAAC
CTATTTGTTT CCGCAGAAAA CTCACAGTTT GATAACTACA TGTCAGGACC TCAAGTAATT
GAGGTCGTCG TAATTGATAG TGACATCAAT GATACAGATG AGGCAAAAGG TGAACCAGAC
GTAACCGTTA ACGGTAAGGT CCTGAGAATG GTCCAAGCAG TTGATGGTAA CTGGTATGGT
TACTTTGCAG ACAGAGACCA AGCCCAAATT GCAGACTCTA CTGCTACAAC AGCAGATTCT
GGATTGGACT TTGGTGTCTT TTGTGCATCA TCATCTGGTA CAGCAGCCTT AGGATTCTCA
ACAACTGAAA CAGATGGTAT CGCAATTCCA ATTACTATTG CAAATGCAAC AGCAACTGGT
AACGGTACAC AAACCGGTAG CAGCAGTGGT GGTGCAATTA CTACCACATG TGCAGCAAAT
ACACTTGATG CATCAACTGC TAATGGTACA ATTAACGTTG TAAGAGAAGC CAAAGATCCT
GTAGCAGCAT CTGGTAGTGT CAGTGTAGGC CAAATCGGTC TAAAAAATGG TACAGCTAAT
AGCGGTCCAA ACTGGCCTTT CATTCAGCTC TATGAATTAA ACCCAACAGG TAACGTTGTC
GTACAGTACA ACAAAGGTGG TGGTGTTCAA TCAACAACAC TTACTTTTGA TACTGTTGAT
CAATTTGCCG AATTGAGTTT AGATAGAACT GTATTCCCAA GAGTATCACA AGTTCACGCA
ACAATTACTG ACTTATGGTT AAACATTGAC CCAACTGATG AAGACTCTTG GACATTTGCT
ACCAACACGA AGAATACTAC ATCATCCTTC AACGTTGATA CATTTTACCA AGTATTCGAT
GAAAACGGTG CTTCCGGTGG AAGCGCATTA ACCCTGAGAA CAACATTGTC CAGCTTGATG
TGTGAGGACA ATTGTGTATT AACATTAGAT GTCGATGCAC AAAGCTCTGG CACACCAGTT
GTGACAATTC AGGATAACGG CGATTCAATC CTTACCCAAC TTAATGCCTC TTCAAATACT
AATGCAAATA ATGCATCTGC ATTTGGTATT TCAACAGAGA CAGCCAAGTT AGGTACAGGT
TCCATCCCAG TTACCATCAC CGAACAAGGT CCAAACAGTG GTGTCTTTGG TACTTATGAC
GAGTCTGACA AGTCAGTACT AAAGATTACA GATAATGCAA AGAGGGGAAC CTCTGCATCA
CTCGATTACA ATGAAACACC TCAAACTATC CTTGTAGGAT TCTCCTTTGC TAGTATTGAT
ATTCAGCCAG TGACTGATGA ATGGACCTCT GGACAAGAGA TTCCAGTAGT CATCGTTGAT
GCTGATCAAA ACAAAAACAG CAGAGCAGAT GAAGACTTAG ACTTAAACAA CCCAGACGTA
ACCCTGATTC CAGCACTCCG TACTGGTGAT CCATTTACTA TCGATGAGGG TGGAACCCCT
AGCTTGATCT TTACCAATGG TACAAATGGT GATGATAGCA TCTTTGATAC AGGTGCAATA
AACAACACCT CAGCAGGTCA AGTAGGTAAC TTTACACTCA ACATCAATGT AACTAGATTT
TCCAGCGCAA CCAACATCAC TTCAACTGAA TCTATTGATA CATTCAGTAA GAGATTAATC
TCTGCACAGA CTGCCAATAG TTCAGCAAAC TTTGATGTTG ACTTTGCAAT CATTGATCTC
GGTAGTGCAA CATTGGAAAC CCTAAAAGAA ACTGTAGTTG ATGAAGATAA CACCGCAGTC
GGTTTTAACT TCTTTAACTA TGATGTTAGA TCATTAGGTG CAGATACAGT AAGTATCGCA
TTGCTTAACA CCACAGGAAA TATTCTCCCA TGGGTTAACA ACGATACAAG AAATGTTGAC
AAAAACAATG CAATATTGTT AGTCAGCAAT TCAACTAATT CACAGGCTTA CGTTGATTTG
ACCAATGCAG TATCTGATGC AGTTTACGGA TCTACCAATA CTGATAGTAA CGTAAACATC
GGATTTGCAA TGTACTTCAC AGGTGTCGGC GACCTCGCCG CCAAAGAAGT AATCGTCATG
GACTTCTTCT CATTTGGTTT CACTGATGAT GGTGTGCAAT CTTCTGAAAG ATTTGCAAAC
CAAATAATCA GAATTGAAGC TGAAGAAACA GGTGATAACA CAAGTACCTT TGAAGGTTCA
CTTGAGTATG TCATGGTTAA CCAAATTAAC ATACAGGATG CTGGTACCTT TAGTGGTATC
ACACCAATCG CAGATGATCC ATCATTCATT GTAATTGAGG ATCTTACTGA CGAAGATGCA
CCAAGAGTCA ACTATAATGA CTTAGGTGCA GATGGTGTAA CAACTCCTGT ATCTGACCAA
GAAGAGGCTC CAAGCCACTC TGGTGTTGTA TCTCTAAACG CTGATTCATA CAAGATTGCT
GACACTGTAG TAATAACTGT AGAAGACTTA GATCTTAACG TAGATTCTGA TCTTATTGAC
ATCTTTACTG TTGTTTCTGA TAATTCAAAA GCAACAGATG ACGCCGTTGG TTCTGCCACA
ACTCAATCTT TGAGCTTTGG TGAACTCGGT AGATTATTAG ATGTTACATT TGATGATGTT
ATCTGGTCAA CTCCTGACGG TGCAAACAAT ACTGCAACTG GTAATGACAG TGACACATGT
TCCACTGAAC TTAGCAATGC AGGAATTACT GATACCGGAC TTGGAGCAAC TGGATTCACT
CTAGTTGAAA CCGGCGCAGC AACTGGTGTA TTTGTTGGTG ATTTCCAAAT CCCATCATTT
TGGTGTAGAG TCTCTGACAC TACAACAACA CCATATACCT ACGCAGGTGA CGAAGAAACA
ACAACCGGAC TCGATATCGA AGTTAACTAT GTTGACTTCA GAGATGCATC TGGTGAAATC
GTCGAAGTCG GCGACTCAGC AGGTGTTAGA GCAAACACCG GTTCCGTTAG CCTTGATAGA
ACTGTCTATC CAGTACCATT TGGTACAATA GCAGATTCTT CAAAAGCCGC TAACGCAGCA
CCAAATGGAA GATCAGTATT CCCAATTCAC GCAACTGGAA TCACTAGTAC TATTGATTCC
ACTGAAGAAT TACCTACAGG AGATCTAACT ATCCACGTCA GAATTAACGA TCCAGACTTT
GATGAAAACC CAGCTGGTGA AGATGCAATG GACCAAGATA ATGCACTCAA AATCTCTGTT
ATCAGAGGTT CTGATAGTGT AGTTCTCGGC TATGCAGGCG CTTCTGAAAG AACCGGAAAG
ATTGATGTTG GTGGTAACAA TGGAACCATC TCAAACATCA GAAGCTTCGG TGAAATGGAC
GAAATCGCAC CAGATGCAGG TATTTTCGAA CTGGATGTAA ACATCAAATT CACTGACGGT
CCAGCATCAG CACAATGTAA CAGCCATGAC ACCCTCTATA CCGCATTAGA CGGTACTACT
GGTAAGGCTG ACACTAACAG ATTTGACGAC GGTGCAGCAT CTGGTCAAGA ATACTGTATC
TTACAAGGAG ATATTCTCCA AGTAGAATAC ACTGATCCAG CTGACGCATC TGGTGATGCA
AATACTGTTA CTGATTCTGC AACATTTGAC CTAAGAAACG GTGTATTACA ATCTGACAAA
TCCGTATACA TTATCGGTTC AGACATGATC TTAACACTCA TTGAGCCAGA CTTTGATCTT
GACAATGACA GTGCTGAGAC CTATGACTTG GACTTGATCG AATGGGACTC TGATGCCGCC
ACCACTACCA TGGGTAACAA AGGTGTAACC GGCGCAGCAG CTGCATTTGA CCCAGAACCA
ACTGACTTTA GAGAAACAGG TGACTCTACT GGTATCTTCC AGATCGTCAT CGAAATTCCA
GAATCACTTT CTAATGACAA ATTAGAAAGA GGTGAGGAAA TCATCCTAGA GTATACTGAC
TGGGGTCCAT CCGGATCTGA TTATGTAGGA GATGAAGATG AAGATGTCAA CTTGACAATC
TACACTTCAA ACTTCGGAGC AACTGTAGAA CTTGACCAAA AAGTATACTC TTGGACTGAC
AAAGTATACA TCACTATTGT CGCACCAGAT CACAACTTTG ACAGTGACCT AGTTGACGAA
ATCGGAGAAA CTGACAGTGA CCCAATTAAG GTCTCTACCA GAGGATTTGA TCTTGACAAC
TACAAACTCG TCGAGACTGG TACTGACACA GGTATCTTTA CTGGTGAAGT AATCCTCACA
GGATTTACTG CCCATGATGC TGATGGTGAT GGAAATACTG GCGATGCAAC CGGTACCACT
TCTGGTAGCG GTCCAACAGA TGGTCTCTTG GCCACTGACG ATGATGACGG ACTTACTGTC
TCCTTCGAAT TCTCTGAAGA TGAGACAATT GTAGGTTCTG CCCTCATTAG ATGGAACATC
GGTGAAGTCC AATGGCTTGA GGCAAGCTAT CCAGCTAGCG GAACAGGTGT TGTAAGAGTA
ATTGATCCAG ACATGAACTT AGATCCAGAA GCAGTCGACA ACTTCGAAGT CGACGTATGG
TCTGACTCCG ATGCCGGAGG TATTGATCTT ACTGTAACTG AGACTAATGA GGCAACCGGA
ATCTTTGAGG GAACTGTGTT CTTCACAACC CTTGATGAAT CATCTGGTCA CAGACTCAGA
GTTTCAGAAG GTGACACAGT CACTGCAGAA TATGAGGACA ATACACTACC TGATCCATAC
ACAACTGCAG ATGAACTTGA TATTACTGCC ACTTCACTAA TTGGCACTGT AGTACCACCT
CTCGAGAGAG CACCAGCTGC TAACTTGAGA ACCGTTGACG CATTCGGTAA CAGCTTAGAT
TCTGTTTCCG TTGACCAACA GGTACAAATC AGCGCTGACT TAGCAAATGG TCAGGATAGA
GAGCAATCAT TTGCATACTT GGTACAGATT CAGGATGCAA ACGGTGTTAC AGTCTCACTA
GCATGGATTA CAGGTTCACT ATCTAGCGGT CAATCATTCA GCCCAGCTTT ATCATGGATT
CCAACTGAAG CAGGAACATA CACTGCTACT GCATTCGTTT GGGAGTCTGT TGATAATCCT
ACGGCATTAT CACCACCAGT TAGTACAACT GTCAACGTAA GTTAG
 
Protein sequence
MNNEIGRKIT SLTLMTIMVA GGLTFAIPGV MPEAMAANAN LFVSAENSQF DNYMSGPQVI 
EVVVIDSDIN DTDEAKGEPD VTVNGKVLRM VQAVDGNWYG YFADRDQAQI ADSTATTADS
GLDFGVFCAS SSGTAALGFS TTETDGIAIP ITIANATATG NGTQTGSSSG GAITTTCAAN
TLDASTANGT INVVREAKDP VAASGSVSVG QIGLKNGTAN SGPNWPFIQL YELNPTGNVV
VQYNKGGGVQ STTLTFDTVD QFAELSLDRT VFPRVSQVHA TITDLWLNID PTDEDSWTFA
TNTKNTTSSF NVDTFYQVFD ENGASGGSAL TLRTTLSSLM CEDNCVLTLD VDAQSSGTPV
VTIQDNGDSI LTQLNASSNT NANNASAFGI STETAKLGTG SIPVTITEQG PNSGVFGTYD
ESDKSVLKIT DNAKRGTSAS LDYNETPQTI LVGFSFASID IQPVTDEWTS GQEIPVVIVD
ADQNKNSRAD EDLDLNNPDV TLIPALRTGD PFTIDEGGTP SLIFTNGTNG DDSIFDTGAI
NNTSAGQVGN FTLNINVTRF SSATNITSTE SIDTFSKRLI SAQTANSSAN FDVDFAIIDL
GSATLETLKE TVVDEDNTAV GFNFFNYDVR SLGADTVSIA LLNTTGNILP WVNNDTRNVD
KNNAILLVSN STNSQAYVDL TNAVSDAVYG STNTDSNVNI GFAMYFTGVG DLAAKEVIVM
DFFSFGFTDD GVQSSERFAN QIIRIEAEET GDNTSTFEGS LEYVMVNQIN IQDAGTFSGI
TPIADDPSFI VIEDLTDEDA PRVNYNDLGA DGVTTPVSDQ EEAPSHSGVV SLNADSYKIA
DTVVITVEDL DLNVDSDLID IFTVVSDNSK ATDDAVGSAT TQSLSFGELG RLLDVTFDDV
IWSTPDGANN TATGNDSDTC STELSNAGIT DTGLGATGFT LVETGAATGV FVGDFQIPSF
WCRVSDTTTT PYTYAGDEET TTGLDIEVNY VDFRDASGEI VEVGDSAGVR ANTGSVSLDR
TVYPVPFGTI ADSSKAANAA PNGRSVFPIH ATGITSTIDS TEELPTGDLT IHVRINDPDF
DENPAGEDAM DQDNALKISV IRGSDSVVLG YAGASERTGK IDVGGNNGTI SNIRSFGEMD
EIAPDAGIFE LDVNIKFTDG PASAQCNSHD TLYTALDGTT GKADTNRFDD GAASGQEYCI
LQGDILQVEY TDPADASGDA NTVTDSATFD LRNGVLQSDK SVYIIGSDMI LTLIEPDFDL
DNDSAETYDL DLIEWDSDAA TTTMGNKGVT GAAAAFDPEP TDFRETGDST GIFQIVIEIP
ESLSNDKLER GEEIILEYTD WGPSGSDYVG DEDEDVNLTI YTSNFGATVE LDQKVYSWTD
KVYITIVAPD HNFDSDLVDE IGETDSDPIK VSTRGFDLDN YKLVETGTDT GIFTGEVILT
GFTAHDADGD GNTGDATGTT SGSGPTDGLL ATDDDDGLTV SFEFSEDETI VGSALIRWNI
GEVQWLEASY PASGTGVVRV IDPDMNLDPE AVDNFEVDVW SDSDAGGIDL TVTETNEATG
IFEGTVFFTT LDESSGHRLR VSEGDTVTAE YEDNTLPDPY TTADELDITA TSLIGTVVPP
LERAPAANLR TVDAFGNSLD SVSVDQQVQI SADLANGQDR EQSFAYLVQI QDANGVTVSL
AWITGSLSSG QSFSPALSWI PTEAGTYTAT AFVWESVDNP TALSPPVSTT VNVS