Gene Nmar_0593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0593 
Symbol 
ID5773624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp529423 
End bp530595 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content32% 
IMG OID641316227 
Productradical SAM domain-containing protein 
Protein accessionYP_001581927 
Protein GI161528101 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR03550] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofG subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTGAGCA AACTTGTATT AAATTCTGAG AGTTTAAACA ATGTTTTAGA CAATAAAGAA 
ATCTCTCGTC ACAGTATTTT AGAAATTTAT CATAATGCAA TAAAAAACCC AAATGAACTT
TTTTCAACAG CACAAGTTCT AAGAAAAAAA CACAAAGGGA ATTCTGTAAC ATTTTCAAAA
AAAGCATTCT TCAACATTGT TAATCTCTGT AAAGACAGTT GTTCATACTG TACATACAAA
GCAGAGCCCG GAGAAGAGAA ACTATCATTA ATGTCAAAAC AACAAATTAC AGAATTACTA
GATCTTGCAA AAAAATACAG ATGTGTTGAA GCATTGTTTG TAACAGGGGA ACAACCAGAA
CAAAGATACC AAGAAGCAAA AGACTGGCTA AAAGAAAATG GATTCACATC TACATCAGAA
TATCTAATTC ATGCATCAGA AATTGCATTA GAAAAAGGAC TTTTCCCGCA CACAAATGCA
GGAAATTTGA ATTTTGAAGA GATGAAAGAA CTAAAGAAAA CAAATGTTTC AATGGGCCTA
ATGCTTGAAA ACATTAGTGA AAGATTAACA GAAAAAGGAA TGCCACATTA TTTGGCAGCT
AGTAAAAGAC CAAAAGCAAG ATTAGAAATT TTAGAAAACT CTGGAAAATT ACAAATTCCA
ATGACTACAG GAATTCTTGT AGGAATAGGA GAAACAATAG AAGAAATCAT TGATTCATTA
TTAACAATCA AAGAGTTACA CCAGAAATAT GGAAACATTC AAGAGGTAAT TTTACAAAAT
TTCCAACCAA AACAAGACAC AAGAATGAAA GATGAACCAT CTGCAGATGA AAAATATTTC
AAAACAATTG TTGCCCTATC TAGAATAATT ATGCCAGAGA TGAACATACA GATTCCACCA
AATCTATCTC CAAAGTCTTA TCAGAGTTTT TTGTCTGTAG GAATTAATGA TTGGGGAGGA
ATTTCACCTT TGACTCCTGA TTTTGTGAAC CCCGAATTTT CTTGGCCAGA AATTAACAAA
GTCGATGAAA ATTCAAAAAG TGCAGGATTT GATTTGAAAT GCAGATTCCC AATATACCCG
GAATTCTTTT CTTTTATTAG TAAAGAACTA CAAGAGAAGA TGAAAGAGAT TCAAAATGAG
GAAGGGTTGG TAAAAGAGGA GTATTGGAGA TGA
 
Protein sequence
MLSKLVLNSE SLNNVLDNKE ISRHSILEIY HNAIKNPNEL FSTAQVLRKK HKGNSVTFSK 
KAFFNIVNLC KDSCSYCTYK AEPGEEKLSL MSKQQITELL DLAKKYRCVE ALFVTGEQPE
QRYQEAKDWL KENGFTSTSE YLIHASEIAL EKGLFPHTNA GNLNFEEMKE LKKTNVSMGL
MLENISERLT EKGMPHYLAA SKRPKARLEI LENSGKLQIP MTTGILVGIG ETIEEIIDSL
LTIKELHQKY GNIQEVILQN FQPKQDTRMK DEPSADEKYF KTIVALSRII MPEMNIQIPP
NLSPKSYQSF LSVGINDWGG ISPLTPDFVN PEFSWPEINK VDENSKSAGF DLKCRFPIYP
EFFSFISKEL QEKMKEIQNE EGLVKEEYWR