Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0154 |
Symbol | |
ID | 5774241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 142688 |
End bp | 144052 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 641315772 |
Product | sulfatase |
Protein accession | YP_001581490 |
Protein GI | 161527664 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0000000668791 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACCAA ATATACTGTT TTTAGTTGTA GACTCTTTAC GTTCGGATAA ATTTTATGGA GAATCTAAAA CATCAATTAC TCCGAACCTT GATTCTCTTT TAGAAAACGG AGTTTATTTT TCTCAGGCAA TTAGTTCTGT ACCTTCAACA TCTCCATCAA TGGGAAGTAT ATTCACTGGA CTGTTTCCAA TAAAAATTGG AATGGGACCT GAATCCTATG AAAAATTAAA TCCAAATGTT TCTACTTTTA TAGAACATTT TAAAAAAAAT GGATATTCAA CTTTTGCAAC TACTTCAGAG ATCAATTCAT TTTTGGGATT AACCGAAGAT TTTGATCTGA CTCTTCAACA GACTTCTCAT AATAATTATT TCAGCCTATT TTCTGGATTG GGGGAAAAAA TTACTGAAAA ATTACGTACT ATGAAAAAAG AACCTTGGTT CTTTTATATT CATATTAATG ATTTACATCA ACCCGTTATT GTACCTGAAA AATACTCTGA AGAAAAATTT GGAATAACAG ATTACGAAAA AATGCTTTCA GCAATAGATT TTTGGATTGG AAAATTTTTT GAACAAATAG ATTTTTCAAA AACTCTAGTA GTTTTAACTG CAGATCATGG TGAATATGTT CGCTCACTCC AAATTGATGG AAAAATGATC AATCTTGAAT CAAGCTCATC TGAAAAGACC TTGTGGAAGT TGGGGAATAA AATTCCCAAT TTTCTTTATG GGCCAAAAAG AAAATTATCT TCAATATTAC AAAAAACTAG AGATAAAAAT CGTCAAAAGA AAATTGAAGA ACTTGATCTT TCTGAATATG AAAAAAGAGT ATTATCAATG TCTAGAATGA GTTCAGGTTC TCATGTTTTT GATGATGTGT TAAAAGTTCC ATTAGTTTTC AAAGGATTCC CGATAAAAAA CCCAAAACTA ATTTCCCAAC AAGTTGGCTT GTTGGACATC TTTCCCACTA TTACAGATCT TATTGAAATT CCAAAGATTA ATGCAAAAAT TGATGGTAAT AGTTTGTATC CATTGATTCA AAATGAAAAA ATTGATGAAA AACCATTATT TATTCAAAGT ATGCCGTCAA TATCTGATGA TAATCTAATT CTTGTTGGAA TTAGAACAAA CTCTTTCAAA TATTTTCGTG AAAAGAACAA CAAAAAGAAA AACAAACTTT TTGATTTGGC AAATGATCCC TTAGAAGAAA AAGATATCTC TTCTCAAAAA CCAGAGATTG TTTTAAAAAT GGAAAAAATT CTCCAAGAAT ATCTGATCAC TGAAAATAAT TTTTCTCCAG ACTCTTTACA GAATGATGAA AGAAAGAAAG TTGAAGACGA ATTAAAAAAA CTGGGATATC TTTAA
|
Protein sequence | MKPNILFLVV DSLRSDKFYG ESKTSITPNL DSLLENGVYF SQAISSVPST SPSMGSIFTG LFPIKIGMGP ESYEKLNPNV STFIEHFKKN GYSTFATTSE INSFLGLTED FDLTLQQTSH NNYFSLFSGL GEKITEKLRT MKKEPWFFYI HINDLHQPVI VPEKYSEEKF GITDYEKMLS AIDFWIGKFF EQIDFSKTLV VLTADHGEYV RSLQIDGKMI NLESSSSEKT LWKLGNKIPN FLYGPKRKLS SILQKTRDKN RQKKIEELDL SEYEKRVLSM SRMSSGSHVF DDVLKVPLVF KGFPIKNPKL ISQQVGLLDI FPTITDLIEI PKINAKIDGN SLYPLIQNEK IDEKPLFIQS MPSISDDNLI LVGIRTNSFK YFREKNNKKK NKLFDLANDP LEEKDISSQK PEIVLKMEKI LQEYLITENN FSPDSLQNDE RKKVEDELKK LGYL
|
| |