Gene Nmar_1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1622 
Symbol 
ID5773044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1477386 
End bp1478462 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content40% 
IMG OID641317275 
Productalcohol dehydrogenase 
Protein accessionYP_001582956 
Protein GI161529130 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAG TCGTATACAA TGAATATGCA CCAGATGATA ATTACGCTAA GATCCTTAAA 
GTCCAGGATA TAGACGAACC AAAACCAAAA GCAGATGAGG TAATTTTTAC CAATAAAGCA
TCTGCCCTAA ATTATAATGA TATTTGGGGG ATGAGAGGAG TTCCAGTAGC AGTTCCTCTT
CCACATGTTT CAGGTTCTGA TGTAGCTGGA GATGTTATCG CCGTAGGCGA AGATGTTAAA
AATTTCAAAG TAGGTGACAG AGTTGTCTCT CACTCAAATC TTGCATGCAG AGTTTGTAGT
GCATGTACTG ATGGAAGAGA ATTTGACTGT ACCCGAAGAC AAGTTTGGGG TTTCCAAACT
GGACCACTAT GGGGTGCATA CTCTGAACAA ATACACTTAC CAGAAGTCAA TGTTTCAAAA
ATTCCTGATG GAGTTTCATA TGAAGATGCA GCAGCAGCTT CAATGACAAT TCTTACCTCC
TGGCACATGT TAGTTGGTAG AGCAAAGATT ACTCCAGGAC AAACAGTACT CGTAATGGGT
GGTGGTTCTG GTGTCGGAAG CTTTGCAATT CAAATTGCTA AACTATACAA CTGTGATGTC
ATTGCAACTG CAAGTCCTGA CAAATTAGAC AAATGTAAGG AACTTGGAGC AGATTATGCA
GTAGACCACA GAAAAGACGA CTGGAGTAAA GAAGTCTTCA AAATTTCAAA AGAAATTGCA
AAAACAAAAG GTGAAGCACC TGGAATTGAT CTTGCATTTG ATCACATTGG TCAAACTCAC
TTCAACAAGC AACTAACATT GCTCAAGTAT GGTGCAACAC TAGTTTCATG TGGTGCAACA
ACAGGTTATG ACGCACAAAT AGATCTTAGA CACATCTTCT TCAAAGGAAT CAATGTCTTA
GGTTCAACAC AAGGAACTAA AGCTGAATTA GATCAAGGTC TATACTGGAT GGGTCAAGGA
AAGATAAAAT CAATTGTTGA CTCTGTCTTT ACCTTCGAAC AAGCAGCAGA GGCTCATACA
AAGATGCTAA AGGGTGACTT CTTTGGCAAA ATCATTATGA AGCCTGAAGG CGCTTAG
 
Protein sequence
MKAVVYNEYA PDDNYAKILK VQDIDEPKPK ADEVIFTNKA SALNYNDIWG MRGVPVAVPL 
PHVSGSDVAG DVIAVGEDVK NFKVGDRVVS HSNLACRVCS ACTDGREFDC TRRQVWGFQT
GPLWGAYSEQ IHLPEVNVSK IPDGVSYEDA AAASMTILTS WHMLVGRAKI TPGQTVLVMG
GGSGVGSFAI QIAKLYNCDV IATASPDKLD KCKELGADYA VDHRKDDWSK EVFKISKEIA
KTKGEAPGID LAFDHIGQTH FNKQLTLLKY GATLVSCGAT TGYDAQIDLR HIFFKGINVL
GSTQGTKAEL DQGLYWMGQG KIKSIVDSVF TFEQAAEAHT KMLKGDFFGK IIMKPEGA