Gene Nmar_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1079 
Symbol 
ID5774107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp983428 
End bp984456 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content36% 
IMG OID641316721 
Productalcohol dehydrogenase 
Protein accessionYP_001582413 
Protein GI161528587 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACTG CATCTGTTAA AGAACCATCA GTTATCTCTG TAAGTGAAAC AGAAAATCCT 
TCTTTGGAGT CTGGTGAAAT TTTAGTTCAG ATGCATGCAT GTGGAATATG TGGCTCTGAT
TTGGAAAAAG TATTTGGACA ATATGGACAA CCATCAATGC GTCTAGGCCA TGAACCTGCT
GGTATTGTTT TAGATGTTGG TTCTGGTGTA ACTGAATTCA AAAAAGGTGA CAGAATATTT
ACTCACCATC ATGTTCCTTG TTATGATTGT CATTTTTGTA ATCATGGAAA CGAGACAATG
TGCAAAAAAT ACTATGAAAC TAATCTGTCT CCCTGTGGTT TATCTGAGCA ATATGTTGTT
CCTGCATGGA ATGTTTCTCA TGGTGGAGTT TTGAAAATAT CTGATTCTCT TAGTTTTGAA
GAAGCTGCGA TGATTGAACC ACTTGCATGT TGTGTTCGGG CCTGGACAAA ATACCATTAT
CAGGAAGGAG ACAGTGCTGC AATCTTTGGA GTTGGTCCTA CTGGAATGAT GCATGTGATG
CTTGCTCAGG CAAAGAAATT CTCCAAAATC TTCTGTTTTG ATGTTAATGA TTTTAGATTG
GACTTTGCAA AAAAATTCAA CATTACAGAA TCCATCAACT CTATGGATGA AACTAAGAAA
CAGAAAATCT TAGAGCATAC TGACAACCAA GGAGTTGATG TTGCTATTGT TGCAACTAGC
AGTCTCAAAG CTCTTGATGA TGCAATTGAT ATGGTTAGAA AAGGCGGTGC TATAATGATG
TTTGGAGTTC CTTCAAAAGG TGCAAAAATG GATTTAGACA TGAGTAAAAT CTATTCAAAA
GAAATCACTC TTGTTACTAG TTATGCTGCA TCTGATAATG ATACAAAAGA AGCATTGAAT
CTAATCGAAT CATTACAAAT TGATGTCAAA CAGTTAATCA CACACACTTA TCCAATTGAT
GATACTCAAA AGGCATTTGA TCATGCACGA AGTGGTGACA ATGCAATGAA AATAATCATT
ACAAAATAA
 
Protein sequence
MKTASVKEPS VISVSETENP SLESGEILVQ MHACGICGSD LEKVFGQYGQ PSMRLGHEPA 
GIVLDVGSGV TEFKKGDRIF THHHVPCYDC HFCNHGNETM CKKYYETNLS PCGLSEQYVV
PAWNVSHGGV LKISDSLSFE EAAMIEPLAC CVRAWTKYHY QEGDSAAIFG VGPTGMMHVM
LAQAKKFSKI FCFDVNDFRL DFAKKFNITE SINSMDETKK QKILEHTDNQ GVDVAIVATS
SLKALDDAID MVRKGGAIMM FGVPSKGAKM DLDMSKIYSK EITLVTSYAA SDNDTKEALN
LIESLQIDVK QLITHTYPID DTQKAFDHAR SGDNAMKIII TK