Gene Nmar_0156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0156 
Symbol 
ID5774259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp145547 
End bp146671 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content28% 
IMG OID641315774 
Productsulfatase 
Protein accessionYP_001581492 
Protein GI161527666 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000195612 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAAAG AAAATTTGAT CATCATAATG ATTGATGGTG GAAGATTTGA TTATGGTTTA 
AATTCAAAAG TTTTTCAGGA AGTAGAAAAA ACTTCAGTAT TTTTTTCAAA TTCTATAACT
TATGGGCCCC ATACAATTGC TGCAATGCAT GCAGTATTTA GTGGATGTTA TGGAACTAGA
ACAGGAACAA ACAGTTATTG GTCTACATAT GATTTTAAAA AAGAAAGTTT TGTCACACTC
ACTGAATATT TGTCTTCTAA TGGATATTTT ACATCTGCAG ATCTGATTAA TGAGTTAGTA
GTTCCTAAAC AAGGATTTGA TGAATATATC GTACATGATG AAATTAATGA CGATTTAACT
TTAAGACACA AAAAAATCTT ATCAAAAATT CAAACTAAAA ATCAAAAGGG TCAACCATCT
TTTCTTTATT TACATTATAG TAAAATTCAC ACAGGTATAA TGAATGAGGT CTTAAAAAAA
TATGATAATT TTAGTGACGA ATTCTTTGAT AATCCTGATC AAAACAAAAA TAGATATGAA
AAATTATTTA TTTCTGCTGA AAATTATTTA AAAACAATTT TAGAAGAAAT TAAAAAACTA
GGATTGGATG ATAATTCTCT TATTCTAATT ATGTCTGATC ATGGTGTAAG TGTAGGTGAA
AAATTTGGTG AACGAGCTTA TGGAGCATTC TGTTATGATT ACACGTTAAA AACAATCACC
CATTTCATTT CAAAAAAATT TCAATCAAAA AGAATTACAC AGCAAGTACG CACAATAGAT
TTCATGCCTA CAATTTTACA ATTTTTAAAG ATCCCATTAG ATAATACCAA AGAACCATTA
GACGGGGTTT CTTTGATGCC CTTGATCAAT AACAAAAAAA TTGATGAACA ATTTGCCTAT
TCTGAAACAG GTAATCCTCT AAAAGAAAAA CAACCTCCAA AAATTCCAAA TGTTATGTCT
ATTCGTAACT CAAATTGGAA ACTAATATAC AATTTACACA ATGATTCCAA AGAAATGTAC
AATTTGCTTG AAGATCCGTT AGAATTAAAA AATTTGATTG GAACAAATAA CGAAATTGAA
TCCATGCTTT GGAATGAATT ACTCAAAATC CAACAATCTA ACTAA
 
Protein sequence
MAKENLIIIM IDGGRFDYGL NSKVFQEVEK TSVFFSNSIT YGPHTIAAMH AVFSGCYGTR 
TGTNSYWSTY DFKKESFVTL TEYLSSNGYF TSADLINELV VPKQGFDEYI VHDEINDDLT
LRHKKILSKI QTKNQKGQPS FLYLHYSKIH TGIMNEVLKK YDNFSDEFFD NPDQNKNRYE
KLFISAENYL KTILEEIKKL GLDDNSLILI MSDHGVSVGE KFGERAYGAF CYDYTLKTIT
HFISKKFQSK RITQQVRTID FMPTILQFLK IPLDNTKEPL DGVSLMPLIN NKKIDEQFAY
SETGNPLKEK QPPKIPNVMS IRNSNWKLIY NLHNDSKEMY NLLEDPLELK NLIGTNNEIE
SMLWNELLKI QQSN