Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0663 |
Symbol | |
ID | 5773410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 604071 |
End bp | 605186 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 641316299 |
Product | sulfatase |
Protein accession | YP_001581997 |
Protein GI | 161528171 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.65595 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGCCTA ACATTATTTT TGTATTATTA GACGGTGCAC GGTGGGATCG TATAGAAAAT TCATTTGAAT TTTCAGATTT AAGAAAAGAT GGAATTTTTA TAAATAATGT TTCAACTGTT TTTCCATACA CTTCTGGTTC ACTAAATGTA ATTTTTTCAG GACAATTTGG TAAAGAGAAT GGTGTAGATG GCTATTACAA AGTATTGAAG TTAAAAAATT CCATACAAAT TTTGCCTGAG ATTCTTCAAA ATTATGGATA TTTTACTGCT CGGGGGTTAC TTAATGATAA ATTATTGTCT CCTAGAGGAT ATGATCTTCG AACTGTTCAT AATGAATTTG AAGATGATCT AAATGACATT CATCCAAAAT TAATTAATGA TGTATTTCAA AAGGCAAATG GAAAACCTGT TTTTCTTTTT TTACATTTTA CTCGTATCCA TACATTTACG GTTTCTGAAA TTTTAGATAA ATATGACTGG AACGATAAGA CCTTTTATGA TTTAGTAAAT TCCAATCTTA AGAAATATGA TCAAACAATT GATGAAGCTG GGCATTATGC CAAAAAAATC ACCAATACAA TCAAGATTCT TGGAGAAGAG GAAAACACAA TAATTATTTT CTTTTCAGAT CATGGTACTG GTGTTGGAGA AAGATATGGT GAACGAAGCT ATGGTTCATT TACGTATGAA GAAACAATTA GAACTTTTTA TCTTTTCACA GGGTCACTTC TTCTAAAAAA TAAATTTTCT GACAAATTAA GAAGTACTTT GGATATAATG CCTACTATCC TTGAATTGTG TAAAATAGAT GAAAATCTGA ATCTCTTTGG AAAAAGCTTT GCAGGATTTC TCACTGGAGA AACAACTCAT CTTAGTGAAA ACCCATTTAC TTTTTCTGAA ACTGGTGCAT TAGAAGGTCC ATTCCCTTCT CCTGAGCATT CTAATGTTTT TTGTATTAAA ACATCTAATT ATAAATTAAT TTATTATGTT TCTAACAATT CTTGGGAATT ATTTGATTTG CAAAAAGATT TCCATGAAAA AAATAATTTG ATAGGAACTT TACCATTAAT TGAAGATGAT TTAAAACAAA AACTCTTGAA TTGGATAAAT CGTTAA
|
Protein sequence | MKPNIIFVLL DGARWDRIEN SFEFSDLRKD GIFINNVSTV FPYTSGSLNV IFSGQFGKEN GVDGYYKVLK LKNSIQILPE ILQNYGYFTA RGLLNDKLLS PRGYDLRTVH NEFEDDLNDI HPKLINDVFQ KANGKPVFLF LHFTRIHTFT VSEILDKYDW NDKTFYDLVN SNLKKYDQTI DEAGHYAKKI TNTIKILGEE ENTIIIFFSD HGTGVGERYG ERSYGSFTYE ETIRTFYLFT GSLLLKNKFS DKLRSTLDIM PTILELCKID ENLNLFGKSF AGFLTGETTH LSENPFTFSE TGALEGPFPS PEHSNVFCIK TSNYKLIYYV SNNSWELFDL QKDFHEKNNL IGTLPLIEDD LKQKLLNWIN R
|
| |