Gene Nmar_1545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1545 
Symbol 
ID5772968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1406818 
End bp1410627 
Gene Length3810 bp 
Protein Length1269 aa 
Translation table11 
GC content33% 
IMG OID641317197 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001582879 
Protein GI161529053 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00227645 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCAAGG CGGCTATTTT CTTTTCAATC CTTCTTTTAT CTAGTTTTTT GGCAAATTCG 
TATGCATTTA CTGGTGATGA TGTATACCTA CAAGATAATT CCCAAACGAT TGAATTTACT
TCTCAAATAA TTGATGTTGA TTCAAATTTT TTTGTAGAAA ATCATTTTAA ACGTTATTTG
ATTTTTGGAA CAAACACTCA AAATTATGAT TACTTAGAAA ATAATTCAAT ATATGGAATA
AAATCTGATC ATGGATTTTT TTATGTTTCA CTTTTATCTG AAAAATCAGC TTCATTATTG
GCAAGCCAAG GAATTCACAT AATTGAAGAT TCTAAATTAG ATTTTCATTC ATCTGATAAT
GAAATAGTTG ATGCTACACG AATTGGCGAA ATCACTGGAT CAAAAAATGC ACAACTCAAT
TACAATGCAT CAGGAAATGG AACTGTAATT GCAATTATAG ACACTGGAGT TGATTTTTCA
AACCCCGACA TACAACATTC TCTTGCTAGA GATGAACTTA ATCATCCTCT TATGCTTGAT
CCTGATGGAC AAGGAATTAT TCTAACCAAT TCAACTTTTT TTGCATATAT TGACGAAAAT
GAAATTATAC GAAATTATAC AAAACCCATT CCAGAACATA TGATTTCTTC TGCATATGTT
ACTAGGGAAG GAATATTTCT TGATCTTTCT CAAGGAGGAA AAGGAAGTGA TATTCAAATT
TACAATTCCT TTTTTCCACA AATTGGTTCA TCTCCAATTT TTAAAGGAAC TTTAGATAAA
GATATGAAAA TAGGACAAGA TAATAGAAAT TACATAAAAT CAAAAAGTGG AGAATACCGT
CTTGGTGTAA TTTATCAAGG AGGATTAAGT GGACCTCTTG CCAAAGTTCA AGTCGTACCT
GTTCTTGTAG TTGATTCTTT TATTCCTGGA GTATATGATA CAATTATTCC TGATATGAGT
ACTTCTTGGG AAGACTATAC TCGATTTAGC TTACCTTCTG GGCAACAACC AAATTATGAT
TTTGATTTTA CTGATGAAAA ACCAATTGTT TTGGGAAGTG GTAACGAATT TCTTGTTTAT
GATTCAAACA ACGATGGAAA AACTGATTAC AGTGCTGGAA CAATTGGTGC ACAAGTTTTA
GATGTTTATG GTGTTATTAG AAACAATTCC ACTGAAATTG ATGATAAACT AAATGCAATT
AATGGCACTC TTTTACCTGC ATTAGATCCT AATGGTGAAT TTTTTGGTAT AATGACTGAT
TTTATGGGGC ATGGTACTTC AAGTACTGCA TCAATTGTTT CTCGCGGACA AGAAACATAC
GATATTTACA ATGACACTAA AAAATATTCA ATAACTGGTG TTGCACCTGG TGCAAAAATT
TTGCCTGTAA AGGCATTATG GTTCGGTGAT ACTGTTTATG CATGGTTATG GTCTGCAGGA
TTTGAAAATG GAGATAATAT TTGGGAATTT TCTGGAAAAC CTAGAGTAGA TATAATTTCC
AACAGCTGGG GCATTTCAAA TTTCCCATCA TTCAAATCTG CACCGGGAAT GGATGTCTTA
TCATTAATCC AAAGTATTCT TTCAACCCCT CATTCACTTG ATGATGATTA TCCTGGAGTT
GTAATGGTAT CTAGTGCTGG GAACTCTGGT CATGGATATG GAACAATTGG ACTACCTAAT
GCATCTCCCT TTGGAATAAC TGTTGGGGCT ACTACTAACA ATGTTTTTGT TGGATATGGT
CCATTCAAAG ACCAACCAAG ATTTGGTAAT TCAACTAATC ACTATAATCA TGTAGTTGAT
TTCTCAAGTC GAGGACCAAC TGCAATTGGT GATCCAAAAC CTGATGTGAT GAGTCTTGGT
GCTCATGGAT TTGTTCCCTC AAATATGATA AAAACTACAA AAGATTCCAA GGATGAATCC
TTTTCATTAT TTGGAGGGAC TAGCATGGCT GCCCCATTAG TTTCTGGAAG TGCTGCCATT
TTGATTGAAG AGATGACAAA ACAATCTCAA GACTATGATT CATTTATGAT AAAAAATATC
TTAATGTCTA CTGCAGTAGA TATGAATAAT GACCCTTTTA CCCAAGGTTC CGGTTTGACA
AATGTTAACT CTGCTTTAGA TTATGTTCAT GGAAAAAATG GTGTGTTTAT GGTAACTAAT
GAAAATTCTT ATGAAAACCT CAAAAAAATT CTTGAGCCTT CGATTGAAAA CTTTAATCAC
ACTGCAATAG GTTTTCAGCA ATTCAAACTT CCTTCACACT CTCTTCCTAT GGCTAGTTGG
TTTGGAGGAC AATTGATTGC CGGGGATAGA ACTACTGCAA CTTTTACTAT CTCAAATCCT
ACAGAAAATG AAATTATTGT TGATGTAAAA CCAGAAACAT TGGCTCTGAT TAAACATACT
CAATTTAACG GAACAACAAC TCCTCGTCAA CACGATTCCA TTTTGAATAA AACTGATACA
TTTATTCCAA ATTATGTAAA ATTATCTGAT GTAAAACCTC ATGAAGACCT AGAGGATTTC
TTTGATGAAA AAAATCCCAT CCCTGATGAA TCTTCTTTAA TGATCTTAAA CGTAAATTTT
CCTTTTGATC AATTTATGAA TAGTACTTCT GATGTTTATG CTGATGATCT TAAAATTTCT
TCATTATACT TGTATGATTG GCTTGATAAC AATAATGATA CAAAAATTAC CAGTGATGAA
ATATCTCTAG TTAATAGAGG CGGATCTTGG GGAACTGTTC AAGAAATTCG AGTGTCAGAA
CCAAATGAAA AATTTGATGG TGTTCCATTA GTTGGTGTTT ATCCAGTTCC TACACGATAC
TCTTACTGGT TAGGAAACAC AAATCAAAAT TCAACATCAA TGGATTATAC CATATCTGCA
AGTTATTATC AAAATACTAA ATGGTCTATG TTGTGGCCTA AATCTGAAAC AATAACTGTT
CCTTCTAATG GCGTTTCAAC AGTAGATGTA ACTCTGGTTA CTCCAACTGA TTTAGAAACA
GGTGTGTATC AGGGATTTTT GAATTTTCAA AGTAAAATGC ATGAAGTAAA CACACCAGTA
TCATTTGTAA TTAAAGAACC AATAACCGAA AATGATTCAA CGATAATAAT TCATGGAAAA
CAAAATGATG ACGTCCTTTA TGGAAATGGA TATACCAAAG GAGCATTTGA TATGACAAAT
CGTTACATGG CAGGTGATTG GCGACAATAT TACTTTGATG TACAAAATGA ATTTGTAAAT
TCTGCAGCTA TAGAACTTTC CTGGACTAGT GATGATACAA ACCTATCTGT ATTTGTTATG
GATCCTCTAG GACAAATCAT TCAAACTAAT GTTCCTTCTG GAGTGTTTGG ACATTTTCTT
GGATGGCCAT CACTTGATTG GTTGGGAAAT TCTCTATTTA GTCAGGGTGG TGGATTTTTC
CCAGTAAAAA ACAAGGATGA TACATCAACT GTATTGTATG TGCCAATTAA TCAAACTGGT
ACTTACACAT TGTTGACACA CTCTACATTG TTTGGAGGAA ATTCTACAAC TGAACCTATT
ACATTAGCTG CAAAATTTAC AAACATCTCT ACTGAATTAG TATCTCAGAA TTCAGAAATC
ATAATTGAAG ATGAAACGTC TTCAGACTTG GAGAAAACCA CTAAAAATGA AACTATTTCC
ATTGAAAAAG AAACTATTGT AAAAGAAACC ATAATTTCAT CTAATGACTC TGATTCTTTG
CTTTTAGTAG GGATTGGAAT AGGAATAGCT ATTGGAATTG CGATCGGAAT TGTTTCTATT
ATTGTAATTA GACAAAAACC TGCAAAGTAA
 
Protein sequence
MPKAAIFFSI LLLSSFLANS YAFTGDDVYL QDNSQTIEFT SQIIDVDSNF FVENHFKRYL 
IFGTNTQNYD YLENNSIYGI KSDHGFFYVS LLSEKSASLL ASQGIHIIED SKLDFHSSDN
EIVDATRIGE ITGSKNAQLN YNASGNGTVI AIIDTGVDFS NPDIQHSLAR DELNHPLMLD
PDGQGIILTN STFFAYIDEN EIIRNYTKPI PEHMISSAYV TREGIFLDLS QGGKGSDIQI
YNSFFPQIGS SPIFKGTLDK DMKIGQDNRN YIKSKSGEYR LGVIYQGGLS GPLAKVQVVP
VLVVDSFIPG VYDTIIPDMS TSWEDYTRFS LPSGQQPNYD FDFTDEKPIV LGSGNEFLVY
DSNNDGKTDY SAGTIGAQVL DVYGVIRNNS TEIDDKLNAI NGTLLPALDP NGEFFGIMTD
FMGHGTSSTA SIVSRGQETY DIYNDTKKYS ITGVAPGAKI LPVKALWFGD TVYAWLWSAG
FENGDNIWEF SGKPRVDIIS NSWGISNFPS FKSAPGMDVL SLIQSILSTP HSLDDDYPGV
VMVSSAGNSG HGYGTIGLPN ASPFGITVGA TTNNVFVGYG PFKDQPRFGN STNHYNHVVD
FSSRGPTAIG DPKPDVMSLG AHGFVPSNMI KTTKDSKDES FSLFGGTSMA APLVSGSAAI
LIEEMTKQSQ DYDSFMIKNI LMSTAVDMNN DPFTQGSGLT NVNSALDYVH GKNGVFMVTN
ENSYENLKKI LEPSIENFNH TAIGFQQFKL PSHSLPMASW FGGQLIAGDR TTATFTISNP
TENEIIVDVK PETLALIKHT QFNGTTTPRQ HDSILNKTDT FIPNYVKLSD VKPHEDLEDF
FDEKNPIPDE SSLMILNVNF PFDQFMNSTS DVYADDLKIS SLYLYDWLDN NNDTKITSDE
ISLVNRGGSW GTVQEIRVSE PNEKFDGVPL VGVYPVPTRY SYWLGNTNQN STSMDYTISA
SYYQNTKWSM LWPKSETITV PSNGVSTVDV TLVTPTDLET GVYQGFLNFQ SKMHEVNTPV
SFVIKEPITE NDSTIIIHGK QNDDVLYGNG YTKGAFDMTN RYMAGDWRQY YFDVQNEFVN
SAAIELSWTS DDTNLSVFVM DPLGQIIQTN VPSGVFGHFL GWPSLDWLGN SLFSQGGGFF
PVKNKDDTST VLYVPINQTG TYTLLTHSTL FGGNSTTEPI TLAAKFTNIS TELVSQNSEI
IIEDETSSDL EKTTKNETIS IEKETIVKET IISSNDSDSL LLVGIGIGIA IGIAIGIVSI
IVIRQKPAK