Gene Nmar_0592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0592 
Symbol 
ID5773619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp528167 
End bp529426 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content36% 
IMG OID641316226 
Productradical SAM domain-containing protein 
Protein accessionYP_001581926 
Protein GI161528100 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily
[TIGR03551] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofH subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTAA ACATAGACTC ACTTTTTAAA AATGCAGATC CAGTTGTATC AGAAATTCTA 
AACCGGGCAC TATCAGAAAA AGAGATTACA TCAAAAGAAG GATTAGAGTT GTACGATACA
TCAGGTATTG ATTTTCACCT AGTAGGATTA GTTGCAGATG AGCTAAGAAA GAGAAGGGTA
GGAGATGTTG TATCTTACGT AGTAAACAGG AACATCAATT TTACTAATGT CTGCATAAAA
CAATGTGGAT TTTGTGCATT TAGCAGAGAC TTTAGAGAAG AAGAAGGGTA TTTTCTTCCA
ACAGAAGAAA TCGTACGCAG AGCAAAGGAA GCACATCAAC TAGGAGCAAC AGAAGTTTGC
GTCCAAGCAG GTCTTCCACC AGACATGGAA GGAGATGTTT ATGAAAACAT TTGCAGAGAG
ATCAAAAAAG AGGTTCCAGA TATCCACATT CATGGATTCT CACCTGAAGA GATTCTCTAT
GGCGCAACAC GTTCAGGAGT AGAAATAGAA GAATTTCTAA AACGAATGAA GGAAGCAGGA
GTAGATACAC TCCCAGGAAC ATCTGCAGAG ATTTTAGACC AAGAACTCAG AGACAAGATT
TCACCAGGCA GAATAAGTGT AAAAGATTGG GAGAGAGTCA TCAAAAATGC CCACAAAATG
GGAATAAACA CAACATCAAC TATGATGTTT GGACATTTGG AATCACAAGA AGACAGAGTC
AAACACATTG AGAAATTAAG AGATATTCAA AAAGAGACAG GAGGGTTTAC AGAATTTGTT
CCACTTAACT TTATTCATAC AGAAGCACCA ATGTACAAAC ATCAATTACA TGAAGGAATC
AAACAAGGAG GTAGTGGAAA TGATGTCTTA CTCACTCATG CAATTGCAAG AATAATGCTG
AATAATCACA TCAATAACAT ACAAATGTCT TGGGTTAAAG AGGGACAAAA AATGTCTCAA
TTATTGTTAA TGTGGGGAGC TAATGACTTT GGAGGAACTT TGATTAACGA GAGTATTTCG
ACTTCTGCAG GTTCAGAGCA CGGACAATTG TTAAAACCAA AAGAAATCAG ACGCATGATA
AGAGAAATTA AAAGAATTCC AGCAGAAAGG AACACAAAAT ATGAAATATT GCGAAAATTT
GAAGATGACA CAGAGCACGA AGACGAATTA GACAAGATTT CAAACACATC ACAGTTTGGA
TCATATACTG AATTGATTAA GATAAACAAA TTCAATTATG AAAACCCAAG GAGAAAATAA
 
Protein sequence
MTLNIDSLFK NADPVVSEIL NRALSEKEIT SKEGLELYDT SGIDFHLVGL VADELRKRRV 
GDVVSYVVNR NINFTNVCIK QCGFCAFSRD FREEEGYFLP TEEIVRRAKE AHQLGATEVC
VQAGLPPDME GDVYENICRE IKKEVPDIHI HGFSPEEILY GATRSGVEIE EFLKRMKEAG
VDTLPGTSAE ILDQELRDKI SPGRISVKDW ERVIKNAHKM GINTTSTMMF GHLESQEDRV
KHIEKLRDIQ KETGGFTEFV PLNFIHTEAP MYKHQLHEGI KQGGSGNDVL LTHAIARIML
NNHINNIQMS WVKEGQKMSQ LLLMWGANDF GGTLINESIS TSAGSEHGQL LKPKEIRRMI
REIKRIPAER NTKYEILRKF EDDTEHEDEL DKISNTSQFG SYTELIKINK FNYENPRRK