Gene Nmar_0663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0663 
Symbol 
ID5773410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp604071 
End bp605186 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content29% 
IMG OID641316299 
Productsulfatase 
Protein accessionYP_001581997 
Protein GI161528171 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.65595 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGCCTA ACATTATTTT TGTATTATTA GACGGTGCAC GGTGGGATCG TATAGAAAAT 
TCATTTGAAT TTTCAGATTT AAGAAAAGAT GGAATTTTTA TAAATAATGT TTCAACTGTT
TTTCCATACA CTTCTGGTTC ACTAAATGTA ATTTTTTCAG GACAATTTGG TAAAGAGAAT
GGTGTAGATG GCTATTACAA AGTATTGAAG TTAAAAAATT CCATACAAAT TTTGCCTGAG
ATTCTTCAAA ATTATGGATA TTTTACTGCT CGGGGGTTAC TTAATGATAA ATTATTGTCT
CCTAGAGGAT ATGATCTTCG AACTGTTCAT AATGAATTTG AAGATGATCT AAATGACATT
CATCCAAAAT TAATTAATGA TGTATTTCAA AAGGCAAATG GAAAACCTGT TTTTCTTTTT
TTACATTTTA CTCGTATCCA TACATTTACG GTTTCTGAAA TTTTAGATAA ATATGACTGG
AACGATAAGA CCTTTTATGA TTTAGTAAAT TCCAATCTTA AGAAATATGA TCAAACAATT
GATGAAGCTG GGCATTATGC CAAAAAAATC ACCAATACAA TCAAGATTCT TGGAGAAGAG
GAAAACACAA TAATTATTTT CTTTTCAGAT CATGGTACTG GTGTTGGAGA AAGATATGGT
GAACGAAGCT ATGGTTCATT TACGTATGAA GAAACAATTA GAACTTTTTA TCTTTTCACA
GGGTCACTTC TTCTAAAAAA TAAATTTTCT GACAAATTAA GAAGTACTTT GGATATAATG
CCTACTATCC TTGAATTGTG TAAAATAGAT GAAAATCTGA ATCTCTTTGG AAAAAGCTTT
GCAGGATTTC TCACTGGAGA AACAACTCAT CTTAGTGAAA ACCCATTTAC TTTTTCTGAA
ACTGGTGCAT TAGAAGGTCC ATTCCCTTCT CCTGAGCATT CTAATGTTTT TTGTATTAAA
ACATCTAATT ATAAATTAAT TTATTATGTT TCTAACAATT CTTGGGAATT ATTTGATTTG
CAAAAAGATT TCCATGAAAA AAATAATTTG ATAGGAACTT TACCATTAAT TGAAGATGAT
TTAAAACAAA AACTCTTGAA TTGGATAAAT CGTTAA
 
Protein sequence
MKPNIIFVLL DGARWDRIEN SFEFSDLRKD GIFINNVSTV FPYTSGSLNV IFSGQFGKEN 
GVDGYYKVLK LKNSIQILPE ILQNYGYFTA RGLLNDKLLS PRGYDLRTVH NEFEDDLNDI
HPKLINDVFQ KANGKPVFLF LHFTRIHTFT VSEILDKYDW NDKTFYDLVN SNLKKYDQTI
DEAGHYAKKI TNTIKILGEE ENTIIIFFSD HGTGVGERYG ERSYGSFTYE ETIRTFYLFT
GSLLLKNKFS DKLRSTLDIM PTILELCKID ENLNLFGKSF AGFLTGETTH LSENPFTFSE
TGALEGPFPS PEHSNVFCIK TSNYKLIYYV SNNSWELFDL QKDFHEKNNL IGTLPLIEDD
LKQKLLNWIN R