Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0156 |
Symbol | |
ID | 5774259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 145547 |
End bp | 146671 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 641315774 |
Product | sulfatase |
Protein accession | YP_001581492 |
Protein GI | 161527666 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000000195612 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAAAAG AAAATTTGAT CATCATAATG ATTGATGGTG GAAGATTTGA TTATGGTTTA AATTCAAAAG TTTTTCAGGA AGTAGAAAAA ACTTCAGTAT TTTTTTCAAA TTCTATAACT TATGGGCCCC ATACAATTGC TGCAATGCAT GCAGTATTTA GTGGATGTTA TGGAACTAGA ACAGGAACAA ACAGTTATTG GTCTACATAT GATTTTAAAA AAGAAAGTTT TGTCACACTC ACTGAATATT TGTCTTCTAA TGGATATTTT ACATCTGCAG ATCTGATTAA TGAGTTAGTA GTTCCTAAAC AAGGATTTGA TGAATATATC GTACATGATG AAATTAATGA CGATTTAACT TTAAGACACA AAAAAATCTT ATCAAAAATT CAAACTAAAA ATCAAAAGGG TCAACCATCT TTTCTTTATT TACATTATAG TAAAATTCAC ACAGGTATAA TGAATGAGGT CTTAAAAAAA TATGATAATT TTAGTGACGA ATTCTTTGAT AATCCTGATC AAAACAAAAA TAGATATGAA AAATTATTTA TTTCTGCTGA AAATTATTTA AAAACAATTT TAGAAGAAAT TAAAAAACTA GGATTGGATG ATAATTCTCT TATTCTAATT ATGTCTGATC ATGGTGTAAG TGTAGGTGAA AAATTTGGTG AACGAGCTTA TGGAGCATTC TGTTATGATT ACACGTTAAA AACAATCACC CATTTCATTT CAAAAAAATT TCAATCAAAA AGAATTACAC AGCAAGTACG CACAATAGAT TTCATGCCTA CAATTTTACA ATTTTTAAAG ATCCCATTAG ATAATACCAA AGAACCATTA GACGGGGTTT CTTTGATGCC CTTGATCAAT AACAAAAAAA TTGATGAACA ATTTGCCTAT TCTGAAACAG GTAATCCTCT AAAAGAAAAA CAACCTCCAA AAATTCCAAA TGTTATGTCT ATTCGTAACT CAAATTGGAA ACTAATATAC AATTTACACA ATGATTCCAA AGAAATGTAC AATTTGCTTG AAGATCCGTT AGAATTAAAA AATTTGATTG GAACAAATAA CGAAATTGAA TCCATGCTTT GGAATGAATT ACTCAAAATC CAACAATCTA ACTAA
|
Protein sequence | MAKENLIIIM IDGGRFDYGL NSKVFQEVEK TSVFFSNSIT YGPHTIAAMH AVFSGCYGTR TGTNSYWSTY DFKKESFVTL TEYLSSNGYF TSADLINELV VPKQGFDEYI VHDEINDDLT LRHKKILSKI QTKNQKGQPS FLYLHYSKIH TGIMNEVLKK YDNFSDEFFD NPDQNKNRYE KLFISAENYL KTILEEIKKL GLDDNSLILI MSDHGVSVGE KFGERAYGAF CYDYTLKTIT HFISKKFQSK RITQQVRTID FMPTILQFLK IPLDNTKEPL DGVSLMPLIN NKKIDEQFAY SETGNPLKEK QPPKIPNVMS IRNSNWKLIY NLHNDSKEMY NLLEDPLELK NLIGTNNEIE SMLWNELLKI QQSN
|
| |