Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1547 |
Symbol | |
ID | 5774529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1411775 |
End bp | 1416979 |
Gene Length | 5205 bp |
Protein Length | 1734 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641317199 |
Product | hypothetical protein |
Protein accession | YP_001582881 |
Protein GI | 161529055 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.114504 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAACG AAATAGGACG TAAAATAACT AGTCTTACAT TAATGACAAT TATGGTTGCA GGTGGACTTA CCTTTGCAAT TCCAGGTGTA ATGCCTGAAG CAATGGCCGC CAACGCCAAC CTATTTGTTT CCGCAGAAAA CTCACAGTTT GATAACTACA TGTCAGGACC TCAAGTAATT GAGGTCGTCG TAATTGATAG TGACATCAAT GATACAGATG AGGCAAAAGG TGAACCAGAC GTAACCGTTA ACGGTAAGGT CCTGAGAATG GTCCAAGCAG TTGATGGTAA CTGGTATGGT TACTTTGCAG ACAGAGACCA AGCCCAAATT GCAGACTCTA CTGCTACAAC AGCAGATTCT GGATTGGACT TTGGTGTCTT TTGTGCATCA TCATCTGGTA CAGCAGCCTT AGGATTCTCA ACAACTGAAA CAGATGGTAT CGCAATTCCA ATTACTATTG CAAATGCAAC AGCAACTGGT AACGGTACAC AAACCGGTAG CAGCAGTGGT GGTGCAATTA CTACCACATG TGCAGCAAAT ACACTTGATG CATCAACTGC TAATGGTACA ATTAACGTTG TAAGAGAAGC CAAAGATCCT GTAGCAGCAT CTGGTAGTGT CAGTGTAGGC CAAATCGGTC TAAAAAATGG TACAGCTAAT AGCGGTCCAA ACTGGCCTTT CATTCAGCTC TATGAATTAA ACCCAACAGG TAACGTTGTC GTACAGTACA ACAAAGGTGG TGGTGTTCAA TCAACAACAC TTACTTTTGA TACTGTTGAT CAATTTGCCG AATTGAGTTT AGATAGAACT GTATTCCCAA GAGTATCACA AGTTCACGCA ACAATTACTG ACTTATGGTT AAACATTGAC CCAACTGATG AAGACTCTTG GACATTTGCT ACCAACACGA AGAATACTAC ATCATCCTTC AACGTTGATA CATTTTACCA AGTATTCGAT GAAAACGGTG CTTCCGGTGG AAGCGCATTA ACCCTGAGAA CAACATTGTC CAGCTTGATG TGTGAGGACA ATTGTGTATT AACATTAGAT GTCGATGCAC AAAGCTCTGG CACACCAGTT GTGACAATTC AGGATAACGG CGATTCAATC CTTACCCAAC TTAATGCCTC TTCAAATACT AATGCAAATA ATGCATCTGC ATTTGGTATT TCAACAGAGA CAGCCAAGTT AGGTACAGGT TCCATCCCAG TTACCATCAC CGAACAAGGT CCAAACAGTG GTGTCTTTGG TACTTATGAC GAGTCTGACA AGTCAGTACT AAAGATTACA GATAATGCAA AGAGGGGAAC CTCTGCATCA CTCGATTACA ATGAAACACC TCAAACTATC CTTGTAGGAT TCTCCTTTGC TAGTATTGAT ATTCAGCCAG TGACTGATGA ATGGACCTCT GGACAAGAGA TTCCAGTAGT CATCGTTGAT GCTGATCAAA ACAAAAACAG CAGAGCAGAT GAAGACTTAG ACTTAAACAA CCCAGACGTA ACCCTGATTC CAGCACTCCG TACTGGTGAT CCATTTACTA TCGATGAGGG TGGAACCCCT AGCTTGATCT TTACCAATGG TACAAATGGT GATGATAGCA TCTTTGATAC AGGTGCAATA AACAACACCT CAGCAGGTCA AGTAGGTAAC TTTACACTCA ACATCAATGT AACTAGATTT TCCAGCGCAA CCAACATCAC TTCAACTGAA TCTATTGATA CATTCAGTAA GAGATTAATC TCTGCACAGA CTGCCAATAG TTCAGCAAAC TTTGATGTTG ACTTTGCAAT CATTGATCTC GGTAGTGCAA CATTGGAAAC CCTAAAAGAA ACTGTAGTTG ATGAAGATAA CACCGCAGTC GGTTTTAACT TCTTTAACTA TGATGTTAGA TCATTAGGTG CAGATACAGT AAGTATCGCA TTGCTTAACA CCACAGGAAA TATTCTCCCA TGGGTTAACA ACGATACAAG AAATGTTGAC AAAAACAATG CAATATTGTT AGTCAGCAAT TCAACTAATT CACAGGCTTA CGTTGATTTG ACCAATGCAG TATCTGATGC AGTTTACGGA TCTACCAATA CTGATAGTAA CGTAAACATC GGATTTGCAA TGTACTTCAC AGGTGTCGGC GACCTCGCCG CCAAAGAAGT AATCGTCATG GACTTCTTCT CATTTGGTTT CACTGATGAT GGTGTGCAAT CTTCTGAAAG ATTTGCAAAC CAAATAATCA GAATTGAAGC TGAAGAAACA GGTGATAACA CAAGTACCTT TGAAGGTTCA CTTGAGTATG TCATGGTTAA CCAAATTAAC ATACAGGATG CTGGTACCTT TAGTGGTATC ACACCAATCG CAGATGATCC ATCATTCATT GTAATTGAGG ATCTTACTGA CGAAGATGCA CCAAGAGTCA ACTATAATGA CTTAGGTGCA GATGGTGTAA CAACTCCTGT ATCTGACCAA GAAGAGGCTC CAAGCCACTC TGGTGTTGTA TCTCTAAACG CTGATTCATA CAAGATTGCT GACACTGTAG TAATAACTGT AGAAGACTTA GATCTTAACG TAGATTCTGA TCTTATTGAC ATCTTTACTG TTGTTTCTGA TAATTCAAAA GCAACAGATG ACGCCGTTGG TTCTGCCACA ACTCAATCTT TGAGCTTTGG TGAACTCGGT AGATTATTAG ATGTTACATT TGATGATGTT ATCTGGTCAA CTCCTGACGG TGCAAACAAT ACTGCAACTG GTAATGACAG TGACACATGT TCCACTGAAC TTAGCAATGC AGGAATTACT GATACCGGAC TTGGAGCAAC TGGATTCACT CTAGTTGAAA CCGGCGCAGC AACTGGTGTA TTTGTTGGTG ATTTCCAAAT CCCATCATTT TGGTGTAGAG TCTCTGACAC TACAACAACA CCATATACCT ACGCAGGTGA CGAAGAAACA ACAACCGGAC TCGATATCGA AGTTAACTAT GTTGACTTCA GAGATGCATC TGGTGAAATC GTCGAAGTCG GCGACTCAGC AGGTGTTAGA GCAAACACCG GTTCCGTTAG CCTTGATAGA ACTGTCTATC CAGTACCATT TGGTACAATA GCAGATTCTT CAAAAGCCGC TAACGCAGCA CCAAATGGAA GATCAGTATT CCCAATTCAC GCAACTGGAA TCACTAGTAC TATTGATTCC ACTGAAGAAT TACCTACAGG AGATCTAACT ATCCACGTCA GAATTAACGA TCCAGACTTT GATGAAAACC CAGCTGGTGA AGATGCAATG GACCAAGATA ATGCACTCAA AATCTCTGTT ATCAGAGGTT CTGATAGTGT AGTTCTCGGC TATGCAGGCG CTTCTGAAAG AACCGGAAAG ATTGATGTTG GTGGTAACAA TGGAACCATC TCAAACATCA GAAGCTTCGG TGAAATGGAC GAAATCGCAC CAGATGCAGG TATTTTCGAA CTGGATGTAA ACATCAAATT CACTGACGGT CCAGCATCAG CACAATGTAA CAGCCATGAC ACCCTCTATA CCGCATTAGA CGGTACTACT GGTAAGGCTG ACACTAACAG ATTTGACGAC GGTGCAGCAT CTGGTCAAGA ATACTGTATC TTACAAGGAG ATATTCTCCA AGTAGAATAC ACTGATCCAG CTGACGCATC TGGTGATGCA AATACTGTTA CTGATTCTGC AACATTTGAC CTAAGAAACG GTGTATTACA ATCTGACAAA TCCGTATACA TTATCGGTTC AGACATGATC TTAACACTCA TTGAGCCAGA CTTTGATCTT GACAATGACA GTGCTGAGAC CTATGACTTG GACTTGATCG AATGGGACTC TGATGCCGCC ACCACTACCA TGGGTAACAA AGGTGTAACC GGCGCAGCAG CTGCATTTGA CCCAGAACCA ACTGACTTTA GAGAAACAGG TGACTCTACT GGTATCTTCC AGATCGTCAT CGAAATTCCA GAATCACTTT CTAATGACAA ATTAGAAAGA GGTGAGGAAA TCATCCTAGA GTATACTGAC TGGGGTCCAT CCGGATCTGA TTATGTAGGA GATGAAGATG AAGATGTCAA CTTGACAATC TACACTTCAA ACTTCGGAGC AACTGTAGAA CTTGACCAAA AAGTATACTC TTGGACTGAC AAAGTATACA TCACTATTGT CGCACCAGAT CACAACTTTG ACAGTGACCT AGTTGACGAA ATCGGAGAAA CTGACAGTGA CCCAATTAAG GTCTCTACCA GAGGATTTGA TCTTGACAAC TACAAACTCG TCGAGACTGG TACTGACACA GGTATCTTTA CTGGTGAAGT AATCCTCACA GGATTTACTG CCCATGATGC TGATGGTGAT GGAAATACTG GCGATGCAAC CGGTACCACT TCTGGTAGCG GTCCAACAGA TGGTCTCTTG GCCACTGACG ATGATGACGG ACTTACTGTC TCCTTCGAAT TCTCTGAAGA TGAGACAATT GTAGGTTCTG CCCTCATTAG ATGGAACATC GGTGAAGTCC AATGGCTTGA GGCAAGCTAT CCAGCTAGCG GAACAGGTGT TGTAAGAGTA ATTGATCCAG ACATGAACTT AGATCCAGAA GCAGTCGACA ACTTCGAAGT CGACGTATGG TCTGACTCCG ATGCCGGAGG TATTGATCTT ACTGTAACTG AGACTAATGA GGCAACCGGA ATCTTTGAGG GAACTGTGTT CTTCACAACC CTTGATGAAT CATCTGGTCA CAGACTCAGA GTTTCAGAAG GTGACACAGT CACTGCAGAA TATGAGGACA ATACACTACC TGATCCATAC ACAACTGCAG ATGAACTTGA TATTACTGCC ACTTCACTAA TTGGCACTGT AGTACCACCT CTCGAGAGAG CACCAGCTGC TAACTTGAGA ACCGTTGACG CATTCGGTAA CAGCTTAGAT TCTGTTTCCG TTGACCAACA GGTACAAATC AGCGCTGACT TAGCAAATGG TCAGGATAGA GAGCAATCAT TTGCATACTT GGTACAGATT CAGGATGCAA ACGGTGTTAC AGTCTCACTA GCATGGATTA CAGGTTCACT ATCTAGCGGT CAATCATTCA GCCCAGCTTT ATCATGGATT CCAACTGAAG CAGGAACATA CACTGCTACT GCATTCGTTT GGGAGTCTGT TGATAATCCT ACGGCATTAT CACCACCAGT TAGTACAACT GTCAACGTAA GTTAG
|
Protein sequence | MNNEIGRKIT SLTLMTIMVA GGLTFAIPGV MPEAMAANAN LFVSAENSQF DNYMSGPQVI EVVVIDSDIN DTDEAKGEPD VTVNGKVLRM VQAVDGNWYG YFADRDQAQI ADSTATTADS GLDFGVFCAS SSGTAALGFS TTETDGIAIP ITIANATATG NGTQTGSSSG GAITTTCAAN TLDASTANGT INVVREAKDP VAASGSVSVG QIGLKNGTAN SGPNWPFIQL YELNPTGNVV VQYNKGGGVQ STTLTFDTVD QFAELSLDRT VFPRVSQVHA TITDLWLNID PTDEDSWTFA TNTKNTTSSF NVDTFYQVFD ENGASGGSAL TLRTTLSSLM CEDNCVLTLD VDAQSSGTPV VTIQDNGDSI LTQLNASSNT NANNASAFGI STETAKLGTG SIPVTITEQG PNSGVFGTYD ESDKSVLKIT DNAKRGTSAS LDYNETPQTI LVGFSFASID IQPVTDEWTS GQEIPVVIVD ADQNKNSRAD EDLDLNNPDV TLIPALRTGD PFTIDEGGTP SLIFTNGTNG DDSIFDTGAI NNTSAGQVGN FTLNINVTRF SSATNITSTE SIDTFSKRLI SAQTANSSAN FDVDFAIIDL GSATLETLKE TVVDEDNTAV GFNFFNYDVR SLGADTVSIA LLNTTGNILP WVNNDTRNVD KNNAILLVSN STNSQAYVDL TNAVSDAVYG STNTDSNVNI GFAMYFTGVG DLAAKEVIVM DFFSFGFTDD GVQSSERFAN QIIRIEAEET GDNTSTFEGS LEYVMVNQIN IQDAGTFSGI TPIADDPSFI VIEDLTDEDA PRVNYNDLGA DGVTTPVSDQ EEAPSHSGVV SLNADSYKIA DTVVITVEDL DLNVDSDLID IFTVVSDNSK ATDDAVGSAT TQSLSFGELG RLLDVTFDDV IWSTPDGANN TATGNDSDTC STELSNAGIT DTGLGATGFT LVETGAATGV FVGDFQIPSF WCRVSDTTTT PYTYAGDEET TTGLDIEVNY VDFRDASGEI VEVGDSAGVR ANTGSVSLDR TVYPVPFGTI ADSSKAANAA PNGRSVFPIH ATGITSTIDS TEELPTGDLT IHVRINDPDF DENPAGEDAM DQDNALKISV IRGSDSVVLG YAGASERTGK IDVGGNNGTI SNIRSFGEMD EIAPDAGIFE LDVNIKFTDG PASAQCNSHD TLYTALDGTT GKADTNRFDD GAASGQEYCI LQGDILQVEY TDPADASGDA NTVTDSATFD LRNGVLQSDK SVYIIGSDMI LTLIEPDFDL DNDSAETYDL DLIEWDSDAA TTTMGNKGVT GAAAAFDPEP TDFRETGDST GIFQIVIEIP ESLSNDKLER GEEIILEYTD WGPSGSDYVG DEDEDVNLTI YTSNFGATVE LDQKVYSWTD KVYITIVAPD HNFDSDLVDE IGETDSDPIK VSTRGFDLDN YKLVETGTDT GIFTGEVILT GFTAHDADGD GNTGDATGTT SGSGPTDGLL ATDDDDGLTV SFEFSEDETI VGSALIRWNI GEVQWLEASY PASGTGVVRV IDPDMNLDPE AVDNFEVDVW SDSDAGGIDL TVTETNEATG IFEGTVFFTT LDESSGHRLR VSEGDTVTAE YEDNTLPDPY TTADELDITA TSLIGTVVPP LERAPAANLR TVDAFGNSLD SVSVDQQVQI SADLANGQDR EQSFAYLVQI QDANGVTVSL AWITGSLSSG QSFSPALSWI PTEAGTYTAT AFVWESVDNP TALSPPVSTT VNVS
|
| |