Gene Nmar_0842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0842 
Symbol 
ID5774182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp743238 
End bp744203 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content32% 
IMG OID641316480 
Productluciferase family protein 
Protein accessionYP_001582176 
Protein GI161528350 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.000207214 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTATTG CATGTAGTCT AGGCTCAATG TTATCCGTAA ATGAGGTTCT AAATTGTGCC 
GAAATTATAT CTAAAACCAC TGCAGACGCA ATCTGGATGC CTGAAACATG GGGTATGGAG
AATTTTTCAA TGTTAAGCGC AGTATCAAGC AAAACTTCTA CTCAAAAAAT AGGCTCATCA
ATCATCAACA TCTATTCTCG TAGTCCTGCA GCAATTGCAA TGGGGGCAGT CACAGTAGAT
ACAATATCTA AAGGAAGGGT AATTCTAGGT CTCGGAACTA GTAGTTTGCC AATCGTAGAG
ACTTTTCACG GATATAATTT TGAAAAGCCT TTGCAAAGAA TGAAAGAATA TGTTGAGATA
ATCAAGATGA TAACATCTGG AAAACCAATA AACTATTCAG GAAAAATTTT CAATTTGAAA
AATTTTACAT TATTGATCAA ACCACAAAGA GAATCAATTC CAATATACAT TGCAGCAGTT
AATGAAAAAA TGGTAAATTT AACATGGGAT CTTGGAGATG GTGTGATTTT TTATCTTAGA
CCTAAAAATG AAATGAAAGA AACGATTCAA AAAATGCAAT CAAAAAGAAA GATAGACGTC
ACATGTCAAA TAATTACATG CGTATCAAAT AACGCAGAAG AAGCAATAGA ACGTGCAAAA
AAGACATTAG CATTCTACGT TTCCGTTGGT AAAATCTATA GAGAATTTTT GGCAAAAAAT
GGATTTGAAA AAGAAACATC AAACATATTT GAAGAATTTA AAAAATCAGG ATTTTCATCA
AATCATGAAC TAGTCCCAGA TTCAATGTTA AAAGAACTTA CAATATCAGG AACTCCTGAA
GAATGTAAAA AACAACTTGA TGTTTTCAGA CAAACAGGAA TTGATTTGCC AATAATACAA
TTCAATCCAG TAGGTGACAC AATGGAATCG TTTAGATTAT TACAAAAAAC ATTTTTGGAT
GAATGA
 
Protein sequence
MRIACSLGSM LSVNEVLNCA EIISKTTADA IWMPETWGME NFSMLSAVSS KTSTQKIGSS 
IINIYSRSPA AIAMGAVTVD TISKGRVILG LGTSSLPIVE TFHGYNFEKP LQRMKEYVEI
IKMITSGKPI NYSGKIFNLK NFTLLIKPQR ESIPIYIAAV NEKMVNLTWD LGDGVIFYLR
PKNEMKETIQ KMQSKRKIDV TCQIITCVSN NAEEAIERAK KTLAFYVSVG KIYREFLAKN
GFEKETSNIF EEFKKSGFSS NHELVPDSML KELTISGTPE ECKKQLDVFR QTGIDLPIIQ
FNPVGDTMES FRLLQKTFLD E