Gene Nmar_1158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1158 
Symbol 
ID5773209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1059089 
End bp1060267 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content35% 
IMG OID641316801 
Producthypothetical protein 
Protein accessionYP_001582492 
Protein GI161528666 
COG category[R] General function prediction only 
COG ID[COG2110] Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAAT TCAAAATTTA TACTCCTGAC TTGAGCCAAT ACCTTTTCTA CATTACACAT 
ATTGACAATA TTCCGTCCAT GCTGCAAAAT GGAATTCTCT CTCATGATCA AATTGTAAAA
CAAAAACTCG AATACACTTC AATTTATGAC AGCGATATTG TTTCCAGTAG AAGAGAAAAA
ACCGCTAATG GGAAAAGTCT TTGGTATTTT GCTAATCTGT ATTTCCAACC TCGAAATCCT
ATGCTGTACC GTGTAACTAT GGAAAAATCT CCTGATTGCA TAGCTGTTGT CGCTGTTGAT
AAAAAAATCC TTACTACTCC TAACACAATT ATCACTGACG GAAATGCTGC AAATGGTCCT
ACACATTTCT ATCCTAATAC TGAATTCAAA ACTATTGAAC GACAAATTAA CAGAATTACT
AGTTTACAGT GGTGGACAGA TAGTAATGCT ACTAAAAGAC AAATTATGGC TGAATGTCTA
GTTCCAGAAC GAATTCCACC TGAATTCATT CGTGCAATTT ATGTTAGTAA TCATACTCTT
GCAGACGAAA TACGAAATTC TATGTCTTCT AGCATTTCAA TAATTCCTGA ACCTTCTATG
TTCTTCCAAC CTGTAAGACA AATTCCACTC ACTTCAAATC TTTCATTGGT AGAAGGAGAT
TTGTTTTTCT CAAAAATGCA AACCCTTACT GTAAGTGTAA ATTGTATTGG AGTAATGGGA
AAAGGGTTGG CCTCTAGAGC AAAATATCAA TTCCCAGATG TTTATGTACA CTATCAAGAT
CAGTGTAAAA GAAAAACACT TCGAATGGGC AAGCCTGTAT TGTATCAACG TGAAGCCCCA
TATCATGAAC AAATTGCAGA TGATCCTAGC TCTTTAGGAA ATAAAAAAGA CACATGGTTT
TTGCTTTTTG CTACAAAACA ACACTGGCGA GATAATAGCG ATATTGAGGG TATAGAGCAG
GGATTAAAAT GGCTTTTAGA TAACTATGAA CAAAAAGGAA TAGAATCACT TGCAATTCCT
GCACTAGGTT GTGGTTTAGG ACGTCTAAGT TGGGAAGATG TTGGACCCAT ACTTTGCAAG
TATCTATCTC AAATGAACAT ACCCGTGTGG ATATATTTGC CTGCAGAAAA ACAACTTTCT
AATAATTTAC TTACAAAGGA GTTTCTATTG GATAGTTAA
 
Protein sequence
MDEFKIYTPD LSQYLFYITH IDNIPSMLQN GILSHDQIVK QKLEYTSIYD SDIVSSRREK 
TANGKSLWYF ANLYFQPRNP MLYRVTMEKS PDCIAVVAVD KKILTTPNTI ITDGNAANGP
THFYPNTEFK TIERQINRIT SLQWWTDSNA TKRQIMAECL VPERIPPEFI RAIYVSNHTL
ADEIRNSMSS SISIIPEPSM FFQPVRQIPL TSNLSLVEGD LFFSKMQTLT VSVNCIGVMG
KGLASRAKYQ FPDVYVHYQD QCKRKTLRMG KPVLYQREAP YHEQIADDPS SLGNKKDTWF
LLFATKQHWR DNSDIEGIEQ GLKWLLDNYE QKGIESLAIP ALGCGLGRLS WEDVGPILCK
YLSQMNIPVW IYLPAEKQLS NNLLTKEFLL DS