Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0393 |
Symbol | |
ID | 5774266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 353850 |
End bp | 355832 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641316022 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001581727 |
Protein GI | 161527901 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.13824 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTAA TTTTAACTCA AGCAACTGAT TCTAAAAATC AAATTCATAC ATACTTGGAA AGAAGTGTCC CATTTGTTGG AACAGATATT CCAAGAATAG ATGGAATAGA TGGAACAGGA ATCAAAATTG CGATTATTGA TACTGGTGTA GACTTTAATC ATCCAGATTT GTTTGGTTGG GGTCATGATG GAAAAGTTGT TGGAGGATAC AATTTCATTC AAGAGGGTCA ACCCCCTATG GACAATAATG GACATGGCAC ACAAGTTGCA GGGGTCATAG CTGCTGATGG GCAAGCAACA GGTGTTGCAC CCAAGGCAAA AATCCTTGCC TACAAGGTAT CTGAAGATGG AGAGGGAGTA TCATCAGATT TAATCATTAG AGCAATTGAA AAAGCAATTG AAGATGATGC AGACATTATC AATATCAGTT TAGGAGTAAA CAAAACAAAC ACAAAGATTG AAAGAGCAGT AAATCAAGCA CTAGAAAAAG AAATCTTTGT AGTAACTGCA GCTGGAAATG ACGGTCCAGA ATTAGGAAGT ATCGGAAGTC CTGGAAGAAA CTTTGGGTCA GTTACAGTTG GTGCAACTTA CAACAATCTT ACATCAAGTT TGGTTGCAAC ATTAGAAGTT AATGACAAGC CATATACAGT AATTCCAATG GCAGGAAATA AAAAAACGGA AGACGCAATT TCTGGAAAAC TAGTATTTGG AGGATTTGGA AAAGTAGATG AATTAAAAGA TCTCGATGTC AAAGATTCAA TATTGATTGT TGAAAGAGGC AGTGATGTAA AAGAGGAATT GTTGTATTTT TCAATAAAGG AAACTAATGC AGCAGACGCA GGAGCAAAAG CAATGATTGT TTACAATAAC ATTCCAGGAA TTTTTCTTGG AGAATTAATT CATGAGTTTA TTGAGCCAAA TTATTCGCCT AGAATTCCAG TTGTATCAAT TGATAGAGAA GAAGGATTAG AGATTATTGA ATCGATTAAT GGAGATGAAG CTACATTGAA TTTGTTCTTT AATCCAGATT ATCTTGCACA TTTTAGTTCA AGAGGACCAG TATCTCCATT TTATATCAAA CCAGAAATTG TGGCGCCAGG AGCATACATC AATACCACAC AAAACAATGC AGGATACAAT TTTACTAGTG GAACCAGTTA TGCGGCTCCC CACGTAAGTG GTGCAGCAGC ACTATTACTA CAAAAAAATC CAGAATTGCA CCATCATGAG ATAAAGTCAT TATTACTGAC CACAGTTGAG CCTGTATCTG ATGCCTACGG AGATGAATTC TCCATACAAG AAACAGGCGC AGGCAGGCTG AACATCGCAA AAGCATTTGA TGCAAAATTG ATCATAGAGC CTCCAAACTT TGTAGGACTA GCATCATCAG ATAACAAAGT CGTTGAAAAA CAATTCCAAC TAAAATCGCT TGATGGGACA TGGGATGGAT TTGACATGAC ATTTGATGGG CCAGAGTTCA TAAAATTTGG AGGAGAGTTA GTTGATGAAA ATATTTTGAA TGTAAAAATG AGGGTACTAG AAGACAAGTT TGGAGAACAT GAAGGAAAGA TAAAGATAAG TCATGAAGGG ATACAATATG TCATTCCATT TGTGTTACAC TATACACAAG GGTCAATTTC AATAGTTCAA CAAAACGAAA AACTATTCTT TGAAATCCAT CACCCAGAAG AATGGAGTTT TGCAAAGATT TCAGTTACAA ACAGCAAAGA CGGAAGAATT GATACAACAA CTGTAACACC AGACAAAGCT GCATCAATTG ATGTCTATGA GAATGCAGAA TATTGGGTGG ATGCAAAAAT TAGAATTAAC GGAAACACAT CAAGTGCATA CAACACAATA GAAATCAAAA CGCTTGAAGA AAGAAAAAGA CTAGACTTTG ACATTCCAGA AAGACAGATG GTAATAATTG CCAGTGCAGT AATTGTAATT GGAATAATAG GTATTGTTCT AAAAAGAAGA TAG
|
Protein sequence | MGLILTQATD SKNQIHTYLE RSVPFVGTDI PRIDGIDGTG IKIAIIDTGV DFNHPDLFGW GHDGKVVGGY NFIQEGQPPM DNNGHGTQVA GVIAADGQAT GVAPKAKILA YKVSEDGEGV SSDLIIRAIE KAIEDDADII NISLGVNKTN TKIERAVNQA LEKEIFVVTA AGNDGPELGS IGSPGRNFGS VTVGATYNNL TSSLVATLEV NDKPYTVIPM AGNKKTEDAI SGKLVFGGFG KVDELKDLDV KDSILIVERG SDVKEELLYF SIKETNAADA GAKAMIVYNN IPGIFLGELI HEFIEPNYSP RIPVVSIDRE EGLEIIESIN GDEATLNLFF NPDYLAHFSS RGPVSPFYIK PEIVAPGAYI NTTQNNAGYN FTSGTSYAAP HVSGAAALLL QKNPELHHHE IKSLLLTTVE PVSDAYGDEF SIQETGAGRL NIAKAFDAKL IIEPPNFVGL ASSDNKVVEK QFQLKSLDGT WDGFDMTFDG PEFIKFGGEL VDENILNVKM RVLEDKFGEH EGKIKISHEG IQYVIPFVLH YTQGSISIVQ QNEKLFFEIH HPEEWSFAKI SVTNSKDGRI DTTTVTPDKA ASIDVYENAE YWVDAKIRIN GNTSSAYNTI EIKTLEERKR LDFDIPERQM VIIASAVIVI GIIGIVLKRR
|
| |