Gene Nmar_1191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1191 
Symbol 
ID5773787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1086413 
End bp1088440 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content34% 
IMG OID641316835 
Producthypothetical protein 
Protein accessionYP_001582525 
Protein GI161528699 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGAAA ATAATCTTAT TCATGAAACA AGTCCTTATC TTCTTCAACA TGCTCATAAT 
CCAGTTGACT GGTATGGTTG GAATGATGAA GCATTAAAAA AAGCAAAAGA TGAAAACAAA
CCGATCTTTC TTAGTATTGG TTACAGTTCT TGTCATTGGT GCCATGTTAT GGCACATGAA
TCATTTGAAA ATGAAGAAGT TGCAAAATTC ATGAATGAAA ATTTTGTAAA TATCAAAGTA
GATAGAGAAG AACGCCCTGA CATTGATGAC ATTTATCAAA AAGCTTGTCA GATAGCTACT
GGTCAAGGAG GATGGCCTTT GAGTATTTTC TTAACTCCTG ATCAAAAACC ATTCTATGTT
GGAACTTATT TTCCAATTCT GGATTCTTAC GGTCGTCCGG GATTTGGGAG TATATGCAGA
CAACTATCTC AAGCTTGGAA AGAAAAACCT AAAGACATTG AAAAATCTGC AGATAATTTT
CTTGATGCAC TAAATAAAAC TGAAAAAGTT TCTATTTCTT CAAAATTAGA AAGAACCATT
CTTGATGAGG CAGCAATGAA TCTTTTCCAA CTGGGTGATT CTGCTTATGG TGGATTTGGT
TCTGCTCCAA AATTTCCAAA CGCTGCTAAT GTTTCCTTTT TGTTTCGTTA TGCAAAGATA
TCTGGGTTGT CAAAATTCAC AGAATTTGGG CTCAAAACTC TCAAAAAAAT GGCAAATGGT
GGAATATTTG ATCAAATTGG TGGTGGATTT CATCGATATT CTACAGATGC AAAATGGCTT
GTACCTCACT TTGAAAAAAT GCTCTATGAT AACGCACTAA TTCCTGTAAA TTATGCTGAG
GCATTTCAGA TAACAAAGGA TCCTTTCTAT CTAGATGTCT TGAAAAAAAC CCTTGATTTT
GTTTTGCGTG AAATGACTTC TCCTGAAGGT GGTTTCTATT CTGCATATGA TGCAGACTCT
GAAGGTGTAG AGGGAAAATT CTATGTCTGG AAGAAAAGCG AGATTAAAGA AATTCTTGGT
GATGATGCTG ACATCTTTTG CTTATTTTAT GATGCCACTG ATGGTGGAAA CTGGGAAGGA
AACAACATTT TGTGTAATAA CTTGAATATC TCTACAGTTG CCTTTAATTT TGGAACTACT
GAAGAAAAGG TTAGAGAAAT TCTTCAGGCC TGTTCTAAAA AGTTACTTGA TGTTCGTTCC
AAGAGAGTTG CCCCTGGACT GGATGATAAA ATTCTAGTTT CGTGGAATTC TTTAATGATT
ACTGCCTTTG CTAAGGGTTA TCGTGTAACA AATGAATCTA GATATCTTGA TGCTGCAAAA
GATTGTATCT CCTTTATTGA AAATAATTTG TTTTCAGGAG ACAAGTTACT ACGAACTTAT
AAAAACAAAA CTGCAAAAAT TGATGGCTAT CTAGAAGACT ATTCTTATTT TGTAAATTGC
TTGTTAGATG TATTTGAAAT TGAACCTGAT CCAAAATATC TAAAACTTGC ACTAAAACTA
GGCCATCACT TGGTGGAACA TTTCTGGGAT TCAGAAAACA ATAGTTTCTT TATGACTTCA
GACAATCATG AAAAACTGAT TATACGACCC AAAAGCAATT ATGATTTGTC TTTGCCTTCT
GGAAACTCTG TTTCTGCATT TGTCATGCTC AGACTATTCC ATTTCTCTCA AGAACAACAA
TTCTTAGATA TTGCTACAAA AATCATGGAA TCTCAGGCAC AAATGGCTGC TGAAAATCCA
TTTGGATTTG GATATCTGCT AAACACAATT TCAATTTATT TGGAAAAACC TGTTGAAATC
ACAATCATAA ACACTGAAAA TTCTCAACTT TGTGACTCAA TTCTTTTGGA ATATTTACCA
AACTCAATTG TTGTCACTAT TCAAAATTCT ACTCAGTTGT CGGCTCTATC TGAATATCCT
TTCTTTGCTG GAAAATCTTT TGAAGAAAAA ACATCTGCAT TTGTTTGTAA AAACTTTACT
TGTTCATTAC CTTTGCATAC TATTGATGAA ATAAACTCAC ATCTTTAG
 
Protein sequence
MTENNLIHET SPYLLQHAHN PVDWYGWNDE ALKKAKDENK PIFLSIGYSS CHWCHVMAHE 
SFENEEVAKF MNENFVNIKV DREERPDIDD IYQKACQIAT GQGGWPLSIF LTPDQKPFYV
GTYFPILDSY GRPGFGSICR QLSQAWKEKP KDIEKSADNF LDALNKTEKV SISSKLERTI
LDEAAMNLFQ LGDSAYGGFG SAPKFPNAAN VSFLFRYAKI SGLSKFTEFG LKTLKKMANG
GIFDQIGGGF HRYSTDAKWL VPHFEKMLYD NALIPVNYAE AFQITKDPFY LDVLKKTLDF
VLREMTSPEG GFYSAYDADS EGVEGKFYVW KKSEIKEILG DDADIFCLFY DATDGGNWEG
NNILCNNLNI STVAFNFGTT EEKVREILQA CSKKLLDVRS KRVAPGLDDK ILVSWNSLMI
TAFAKGYRVT NESRYLDAAK DCISFIENNL FSGDKLLRTY KNKTAKIDGY LEDYSYFVNC
LLDVFEIEPD PKYLKLALKL GHHLVEHFWD SENNSFFMTS DNHEKLIIRP KSNYDLSLPS
GNSVSAFVML RLFHFSQEQQ FLDIATKIME SQAQMAAENP FGFGYLLNTI SIYLEKPVEI
TIINTENSQL CDSILLEYLP NSIVVTIQNS TQLSALSEYP FFAGKSFEEK TSAFVCKNFT
CSLPLHTIDE INSHL