Gene Nmar_0154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0154 
Symbol 
ID5774241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp142688 
End bp144052 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content29% 
IMG OID641315772 
Productsulfatase 
Protein accessionYP_001581490 
Protein GI161527664 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000668791 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACCAA ATATACTGTT TTTAGTTGTA GACTCTTTAC GTTCGGATAA ATTTTATGGA 
GAATCTAAAA CATCAATTAC TCCGAACCTT GATTCTCTTT TAGAAAACGG AGTTTATTTT
TCTCAGGCAA TTAGTTCTGT ACCTTCAACA TCTCCATCAA TGGGAAGTAT ATTCACTGGA
CTGTTTCCAA TAAAAATTGG AATGGGACCT GAATCCTATG AAAAATTAAA TCCAAATGTT
TCTACTTTTA TAGAACATTT TAAAAAAAAT GGATATTCAA CTTTTGCAAC TACTTCAGAG
ATCAATTCAT TTTTGGGATT AACCGAAGAT TTTGATCTGA CTCTTCAACA GACTTCTCAT
AATAATTATT TCAGCCTATT TTCTGGATTG GGGGAAAAAA TTACTGAAAA ATTACGTACT
ATGAAAAAAG AACCTTGGTT CTTTTATATT CATATTAATG ATTTACATCA ACCCGTTATT
GTACCTGAAA AATACTCTGA AGAAAAATTT GGAATAACAG ATTACGAAAA AATGCTTTCA
GCAATAGATT TTTGGATTGG AAAATTTTTT GAACAAATAG ATTTTTCAAA AACTCTAGTA
GTTTTAACTG CAGATCATGG TGAATATGTT CGCTCACTCC AAATTGATGG AAAAATGATC
AATCTTGAAT CAAGCTCATC TGAAAAGACC TTGTGGAAGT TGGGGAATAA AATTCCCAAT
TTTCTTTATG GGCCAAAAAG AAAATTATCT TCAATATTAC AAAAAACTAG AGATAAAAAT
CGTCAAAAGA AAATTGAAGA ACTTGATCTT TCTGAATATG AAAAAAGAGT ATTATCAATG
TCTAGAATGA GTTCAGGTTC TCATGTTTTT GATGATGTGT TAAAAGTTCC ATTAGTTTTC
AAAGGATTCC CGATAAAAAA CCCAAAACTA ATTTCCCAAC AAGTTGGCTT GTTGGACATC
TTTCCCACTA TTACAGATCT TATTGAAATT CCAAAGATTA ATGCAAAAAT TGATGGTAAT
AGTTTGTATC CATTGATTCA AAATGAAAAA ATTGATGAAA AACCATTATT TATTCAAAGT
ATGCCGTCAA TATCTGATGA TAATCTAATT CTTGTTGGAA TTAGAACAAA CTCTTTCAAA
TATTTTCGTG AAAAGAACAA CAAAAAGAAA AACAAACTTT TTGATTTGGC AAATGATCCC
TTAGAAGAAA AAGATATCTC TTCTCAAAAA CCAGAGATTG TTTTAAAAAT GGAAAAAATT
CTCCAAGAAT ATCTGATCAC TGAAAATAAT TTTTCTCCAG ACTCTTTACA GAATGATGAA
AGAAAGAAAG TTGAAGACGA ATTAAAAAAA CTGGGATATC TTTAA
 
Protein sequence
MKPNILFLVV DSLRSDKFYG ESKTSITPNL DSLLENGVYF SQAISSVPST SPSMGSIFTG 
LFPIKIGMGP ESYEKLNPNV STFIEHFKKN GYSTFATTSE INSFLGLTED FDLTLQQTSH
NNYFSLFSGL GEKITEKLRT MKKEPWFFYI HINDLHQPVI VPEKYSEEKF GITDYEKMLS
AIDFWIGKFF EQIDFSKTLV VLTADHGEYV RSLQIDGKMI NLESSSSEKT LWKLGNKIPN
FLYGPKRKLS SILQKTRDKN RQKKIEELDL SEYEKRVLSM SRMSSGSHVF DDVLKVPLVF
KGFPIKNPKL ISQQVGLLDI FPTITDLIEI PKINAKIDGN SLYPLIQNEK IDEKPLFIQS
MPSISDDNLI LVGIRTNSFK YFREKNNKKK NKLFDLANDP LEEKDISSQK PEIVLKMEKI
LQEYLITENN FSPDSLQNDE RKKVEDELKK LGYL