Gene Nmar_1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1098 
Symbol 
ID5773962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1000832 
End bp1001974 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content33% 
IMG OID641316740 
Productphosphoesterase DHHA1 
Protein accessionYP_001582432 
Protein GI161528606 
COG category[R] General function prediction only 
COG ID[COG2404] Predicted phosphohydrolase (DHH superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGCCG CAACAAAGAA ACCTGCTACA AAGAAATCAA CAACTAAAAA GACAAAAACA 
ACTACTAAAA AAACTACAAC CAAAAAAGTT GTTAAAAAAA CAACTAAAAA AACTGCAACC
AAAAAAGTTG CTAAAAAAAC AACTACTAAA AAAGTAACAA AATCAAATCG TACCAAAGTA
ATTTGCATAT CTCACAAAGA AGATGCTGAT GGTATTAGTT CTGCAGCATT AATTCGACAA
GCATTTGGCG GTGATGCAAT TTTAGTTGAC TATCCTGGAC AAATGGAAGC ACTCCAACAA
GTTGTTACTG ATGAAAAATT AAAGTCATTA TACATTTGTG ATTTAGGTCT AAGCAAAAAA
ACTCAAGATG AATTTATTGA TATTATGAAA ACCCTGAGAA AAAATAAAGT TTCAATTACA
TACATTGATC ATCATGACAT TGATCCTAAT GTTGTCAAAT CATTAGAGAA ACTCAAAGTC
AAAGTAATTC ACGACATTAA CGAATGTACT ACTGTTCAAG TTTACAATGC CTATAAATCA
AAATTAAACG AACATGCTAC TTTTGTTGCA ACATGTGCTG CAATCACTGA TTACATGGAG
GATAGACCAC TTGGTTCTAA ACTACTTCAA ATGTATGATA GACAATTTGC ACTCATCAGT
GCTACTGTTC TAACCTACAA CATTGTAGGT CATCAAAAAG AACCTGACTA TCTCTTGTAC
TTGGTTGAAG AACTAGCAGA ATCCAAATTC CCTCATGATA TTCCAAACAC CTTTGAATTT
GCACAAATTC AAGTTGAAAA ACTCTCACAA ATGATTGCCA AAGTAAAAGC CGGTATGAAA
ACAATGAAAA ATCTAGGACA CATGGAGATT CTTGATTCTG GTGCTAGTGG TGCTGTGAAT
TTTGTAATGG GTTTGTCTGG TAAGGATGTT GGTGTTGCAT ACAAAGAAAG AGTAGACCAT
GGAATTTACG CTGTATCTGT TAGAGGTTCT AAGAATTGTA AAGTACATTT AGGTAAAATT
GTTAATTTGT TGGCAACTGA TCTTGGTGGT TCTGGTGGTG GACATGATAG AGCATGCGGC
GCTGTAATTC CAAAACCGAA AATAAAAAAA TTCATTACAG AATTAAATAA AAAAATAAAG
TAA
 
Protein sequence
MLAATKKPAT KKSTTKKTKT TTKKTTTKKV VKKTTKKTAT KKVAKKTTTK KVTKSNRTKV 
ICISHKEDAD GISSAALIRQ AFGGDAILVD YPGQMEALQQ VVTDEKLKSL YICDLGLSKK
TQDEFIDIMK TLRKNKVSIT YIDHHDIDPN VVKSLEKLKV KVIHDINECT TVQVYNAYKS
KLNEHATFVA TCAAITDYME DRPLGSKLLQ MYDRQFALIS ATVLTYNIVG HQKEPDYLLY
LVEELAESKF PHDIPNTFEF AQIQVEKLSQ MIAKVKAGMK TMKNLGHMEI LDSGASGAVN
FVMGLSGKDV GVAYKERVDH GIYAVSVRGS KNCKVHLGKI VNLLATDLGG SGGGHDRACG
AVIPKPKIKK FITELNKKIK