Gene Nmar_1565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1565 
Symbol 
ID5774126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1433386 
End bp1434471 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content37% 
IMG OID641317218 
Productalcohol dehydrogenase 
Protein accessionYP_001582899 
Protein GI161529073 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TCATGAAGGC TCTAGTTTAT GAAGAATATA CTACTGATGA TGATTTTTCT 
AAAATTTTAA AAATTAAAGA TCTGCCAATT CCTGAACCAA AATCAAACGA AGTAGTTTTC
AAAGTAAAGG CAGCCGCATT AAATTATGAT GATATTTGGG GAATGAGAGG CAAACCTCTT
GCAATTCCTT TACCTCATAT TTCTGGAACT GATGCCGCAG GTGAAGTAAC TGCAGTAGGT
GAAGATGTAA AAAATTTCAA AGTAGGTGAT AGAGTGGTTT CACATGGAAA CATGTCTTGT
AGGGTGTGTA AGAGATGTAC ATCCGGACGC GAATATGATT GTAAAAAACG AACCATTTGG
GGATTTGAAA CAGGTCCTCT TTGGGGAGGA TACTGTGAAT ATACTCATCT TCCAGAAGTC
AATGTTGTAA AAATCCCTGA AGGAATATCA TATGAAGAAG CAGCAGCTGC ATCTATGACC
ATGTTAACTT CTTGGCATAT GTTAGTTGGC AGAGCAAAAA TTCAACCTGG ACAATTAGTT
TTGATCATGG GCGGAGGTTC TGGTGTTGGA AATTATGGAA TTCAGATTGC AAAACTTTTT
GGTTGTACTG TAATTGCAAC TGCTAGTCCT GATAAATTAG ATCAACTACT TGAACTTGGA
GCAGACTATG CAATTGATCA TAGAAAAGAA GACTGGCATA AAGAAGTAAG AGCAATTGCA
AAAAAACTTC CAAAACCATT TGGGGAGGTT CCTGGTGTAG ATGTAATTTT TGAACATATT
GGAGGCTCTC ATTGGAACAA AGAACTCACT CTTCTAAACT ATGGAGGCAC TGTGATTACT
ACTGGTGCGA CTACTGGTTA TATGGCAAAA ACTGATCTTA GACATATTTT CTTTAAAGGA
CTAAACATTT TGGGTTCAAC TCAGGGAACA AGAGCTGAGC TTGAAGAGGG ATTTTATTGG
ATGTCTAAAG GAAAAATAAA ATCCATAATT GATTCTGAAT ATACGCTTGA GCAAGCTGCT
GAAGCCCATA CAAAGATGCT AAAAGGTAAA GGACTTTTTG GAAAAATCAT TATGAAACCA
AACTGA
 
Protein sequence
MKKIMKALVY EEYTTDDDFS KILKIKDLPI PEPKSNEVVF KVKAAALNYD DIWGMRGKPL 
AIPLPHISGT DAAGEVTAVG EDVKNFKVGD RVVSHGNMSC RVCKRCTSGR EYDCKKRTIW
GFETGPLWGG YCEYTHLPEV NVVKIPEGIS YEEAAAASMT MLTSWHMLVG RAKIQPGQLV
LIMGGGSGVG NYGIQIAKLF GCTVIATASP DKLDQLLELG ADYAIDHRKE DWHKEVRAIA
KKLPKPFGEV PGVDVIFEHI GGSHWNKELT LLNYGGTVIT TGATTGYMAK TDLRHIFFKG
LNILGSTQGT RAELEEGFYW MSKGKIKSII DSEYTLEQAA EAHTKMLKGK GLFGKIIMKP
N