Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1545 |
Symbol | |
ID | 5772968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1406818 |
End bp | 1410627 |
Gene Length | 3810 bp |
Protein Length | 1269 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 641317197 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001582879 |
Protein GI | 161529053 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00227645 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCAAGG CGGCTATTTT CTTTTCAATC CTTCTTTTAT CTAGTTTTTT GGCAAATTCG TATGCATTTA CTGGTGATGA TGTATACCTA CAAGATAATT CCCAAACGAT TGAATTTACT TCTCAAATAA TTGATGTTGA TTCAAATTTT TTTGTAGAAA ATCATTTTAA ACGTTATTTG ATTTTTGGAA CAAACACTCA AAATTATGAT TACTTAGAAA ATAATTCAAT ATATGGAATA AAATCTGATC ATGGATTTTT TTATGTTTCA CTTTTATCTG AAAAATCAGC TTCATTATTG GCAAGCCAAG GAATTCACAT AATTGAAGAT TCTAAATTAG ATTTTCATTC ATCTGATAAT GAAATAGTTG ATGCTACACG AATTGGCGAA ATCACTGGAT CAAAAAATGC ACAACTCAAT TACAATGCAT CAGGAAATGG AACTGTAATT GCAATTATAG ACACTGGAGT TGATTTTTCA AACCCCGACA TACAACATTC TCTTGCTAGA GATGAACTTA ATCATCCTCT TATGCTTGAT CCTGATGGAC AAGGAATTAT TCTAACCAAT TCAACTTTTT TTGCATATAT TGACGAAAAT GAAATTATAC GAAATTATAC AAAACCCATT CCAGAACATA TGATTTCTTC TGCATATGTT ACTAGGGAAG GAATATTTCT TGATCTTTCT CAAGGAGGAA AAGGAAGTGA TATTCAAATT TACAATTCCT TTTTTCCACA AATTGGTTCA TCTCCAATTT TTAAAGGAAC TTTAGATAAA GATATGAAAA TAGGACAAGA TAATAGAAAT TACATAAAAT CAAAAAGTGG AGAATACCGT CTTGGTGTAA TTTATCAAGG AGGATTAAGT GGACCTCTTG CCAAAGTTCA AGTCGTACCT GTTCTTGTAG TTGATTCTTT TATTCCTGGA GTATATGATA CAATTATTCC TGATATGAGT ACTTCTTGGG AAGACTATAC TCGATTTAGC TTACCTTCTG GGCAACAACC AAATTATGAT TTTGATTTTA CTGATGAAAA ACCAATTGTT TTGGGAAGTG GTAACGAATT TCTTGTTTAT GATTCAAACA ACGATGGAAA AACTGATTAC AGTGCTGGAA CAATTGGTGC ACAAGTTTTA GATGTTTATG GTGTTATTAG AAACAATTCC ACTGAAATTG ATGATAAACT AAATGCAATT AATGGCACTC TTTTACCTGC ATTAGATCCT AATGGTGAAT TTTTTGGTAT AATGACTGAT TTTATGGGGC ATGGTACTTC AAGTACTGCA TCAATTGTTT CTCGCGGACA AGAAACATAC GATATTTACA ATGACACTAA AAAATATTCA ATAACTGGTG TTGCACCTGG TGCAAAAATT TTGCCTGTAA AGGCATTATG GTTCGGTGAT ACTGTTTATG CATGGTTATG GTCTGCAGGA TTTGAAAATG GAGATAATAT TTGGGAATTT TCTGGAAAAC CTAGAGTAGA TATAATTTCC AACAGCTGGG GCATTTCAAA TTTCCCATCA TTCAAATCTG CACCGGGAAT GGATGTCTTA TCATTAATCC AAAGTATTCT TTCAACCCCT CATTCACTTG ATGATGATTA TCCTGGAGTT GTAATGGTAT CTAGTGCTGG GAACTCTGGT CATGGATATG GAACAATTGG ACTACCTAAT GCATCTCCCT TTGGAATAAC TGTTGGGGCT ACTACTAACA ATGTTTTTGT TGGATATGGT CCATTCAAAG ACCAACCAAG ATTTGGTAAT TCAACTAATC ACTATAATCA TGTAGTTGAT TTCTCAAGTC GAGGACCAAC TGCAATTGGT GATCCAAAAC CTGATGTGAT GAGTCTTGGT GCTCATGGAT TTGTTCCCTC AAATATGATA AAAACTACAA AAGATTCCAA GGATGAATCC TTTTCATTAT TTGGAGGGAC TAGCATGGCT GCCCCATTAG TTTCTGGAAG TGCTGCCATT TTGATTGAAG AGATGACAAA ACAATCTCAA GACTATGATT CATTTATGAT AAAAAATATC TTAATGTCTA CTGCAGTAGA TATGAATAAT GACCCTTTTA CCCAAGGTTC CGGTTTGACA AATGTTAACT CTGCTTTAGA TTATGTTCAT GGAAAAAATG GTGTGTTTAT GGTAACTAAT GAAAATTCTT ATGAAAACCT CAAAAAAATT CTTGAGCCTT CGATTGAAAA CTTTAATCAC ACTGCAATAG GTTTTCAGCA ATTCAAACTT CCTTCACACT CTCTTCCTAT GGCTAGTTGG TTTGGAGGAC AATTGATTGC CGGGGATAGA ACTACTGCAA CTTTTACTAT CTCAAATCCT ACAGAAAATG AAATTATTGT TGATGTAAAA CCAGAAACAT TGGCTCTGAT TAAACATACT CAATTTAACG GAACAACAAC TCCTCGTCAA CACGATTCCA TTTTGAATAA AACTGATACA TTTATTCCAA ATTATGTAAA ATTATCTGAT GTAAAACCTC ATGAAGACCT AGAGGATTTC TTTGATGAAA AAAATCCCAT CCCTGATGAA TCTTCTTTAA TGATCTTAAA CGTAAATTTT CCTTTTGATC AATTTATGAA TAGTACTTCT GATGTTTATG CTGATGATCT TAAAATTTCT TCATTATACT TGTATGATTG GCTTGATAAC AATAATGATA CAAAAATTAC CAGTGATGAA ATATCTCTAG TTAATAGAGG CGGATCTTGG GGAACTGTTC AAGAAATTCG AGTGTCAGAA CCAAATGAAA AATTTGATGG TGTTCCATTA GTTGGTGTTT ATCCAGTTCC TACACGATAC TCTTACTGGT TAGGAAACAC AAATCAAAAT TCAACATCAA TGGATTATAC CATATCTGCA AGTTATTATC AAAATACTAA ATGGTCTATG TTGTGGCCTA AATCTGAAAC AATAACTGTT CCTTCTAATG GCGTTTCAAC AGTAGATGTA ACTCTGGTTA CTCCAACTGA TTTAGAAACA GGTGTGTATC AGGGATTTTT GAATTTTCAA AGTAAAATGC ATGAAGTAAA CACACCAGTA TCATTTGTAA TTAAAGAACC AATAACCGAA AATGATTCAA CGATAATAAT TCATGGAAAA CAAAATGATG ACGTCCTTTA TGGAAATGGA TATACCAAAG GAGCATTTGA TATGACAAAT CGTTACATGG CAGGTGATTG GCGACAATAT TACTTTGATG TACAAAATGA ATTTGTAAAT TCTGCAGCTA TAGAACTTTC CTGGACTAGT GATGATACAA ACCTATCTGT ATTTGTTATG GATCCTCTAG GACAAATCAT TCAAACTAAT GTTCCTTCTG GAGTGTTTGG ACATTTTCTT GGATGGCCAT CACTTGATTG GTTGGGAAAT TCTCTATTTA GTCAGGGTGG TGGATTTTTC CCAGTAAAAA ACAAGGATGA TACATCAACT GTATTGTATG TGCCAATTAA TCAAACTGGT ACTTACACAT TGTTGACACA CTCTACATTG TTTGGAGGAA ATTCTACAAC TGAACCTATT ACATTAGCTG CAAAATTTAC AAACATCTCT ACTGAATTAG TATCTCAGAA TTCAGAAATC ATAATTGAAG ATGAAACGTC TTCAGACTTG GAGAAAACCA CTAAAAATGA AACTATTTCC ATTGAAAAAG AAACTATTGT AAAAGAAACC ATAATTTCAT CTAATGACTC TGATTCTTTG CTTTTAGTAG GGATTGGAAT AGGAATAGCT ATTGGAATTG CGATCGGAAT TGTTTCTATT ATTGTAATTA GACAAAAACC TGCAAAGTAA
|
Protein sequence | MPKAAIFFSI LLLSSFLANS YAFTGDDVYL QDNSQTIEFT SQIIDVDSNF FVENHFKRYL IFGTNTQNYD YLENNSIYGI KSDHGFFYVS LLSEKSASLL ASQGIHIIED SKLDFHSSDN EIVDATRIGE ITGSKNAQLN YNASGNGTVI AIIDTGVDFS NPDIQHSLAR DELNHPLMLD PDGQGIILTN STFFAYIDEN EIIRNYTKPI PEHMISSAYV TREGIFLDLS QGGKGSDIQI YNSFFPQIGS SPIFKGTLDK DMKIGQDNRN YIKSKSGEYR LGVIYQGGLS GPLAKVQVVP VLVVDSFIPG VYDTIIPDMS TSWEDYTRFS LPSGQQPNYD FDFTDEKPIV LGSGNEFLVY DSNNDGKTDY SAGTIGAQVL DVYGVIRNNS TEIDDKLNAI NGTLLPALDP NGEFFGIMTD FMGHGTSSTA SIVSRGQETY DIYNDTKKYS ITGVAPGAKI LPVKALWFGD TVYAWLWSAG FENGDNIWEF SGKPRVDIIS NSWGISNFPS FKSAPGMDVL SLIQSILSTP HSLDDDYPGV VMVSSAGNSG HGYGTIGLPN ASPFGITVGA TTNNVFVGYG PFKDQPRFGN STNHYNHVVD FSSRGPTAIG DPKPDVMSLG AHGFVPSNMI KTTKDSKDES FSLFGGTSMA APLVSGSAAI LIEEMTKQSQ DYDSFMIKNI LMSTAVDMNN DPFTQGSGLT NVNSALDYVH GKNGVFMVTN ENSYENLKKI LEPSIENFNH TAIGFQQFKL PSHSLPMASW FGGQLIAGDR TTATFTISNP TENEIIVDVK PETLALIKHT QFNGTTTPRQ HDSILNKTDT FIPNYVKLSD VKPHEDLEDF FDEKNPIPDE SSLMILNVNF PFDQFMNSTS DVYADDLKIS SLYLYDWLDN NNDTKITSDE ISLVNRGGSW GTVQEIRVSE PNEKFDGVPL VGVYPVPTRY SYWLGNTNQN STSMDYTISA SYYQNTKWSM LWPKSETITV PSNGVSTVDV TLVTPTDLET GVYQGFLNFQ SKMHEVNTPV SFVIKEPITE NDSTIIIHGK QNDDVLYGNG YTKGAFDMTN RYMAGDWRQY YFDVQNEFVN SAAIELSWTS DDTNLSVFVM DPLGQIIQTN VPSGVFGHFL GWPSLDWLGN SLFSQGGGFF PVKNKDDTST VLYVPINQTG TYTLLTHSTL FGGNSTTEPI TLAAKFTNIS TELVSQNSEI IIEDETSSDL EKTTKNETIS IEKETIVKET IISSNDSDSL LLVGIGIGIA IGIAIGIVSI IVIRQKPAK
|
| |