Gene Nmar_1794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1794 
Symbol 
ID5773349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1638948 
End bp1642325 
Gene Length3378 bp 
Protein Length1125 aa 
Translation table11 
GC content37% 
IMG OID641317450 
ProductDNA polymerase II large subunit 
Protein accessionYP_001583128 
Protein GI161529302 
COG category[L] Replication, recombination and repair 
COG ID[COG1933] Archaeal DNA polymerase II, large subunit 
TIGRFAM ID[TIGR00354] DNA polymerase, archaeal type II, large subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA ACGACGCAAT TTCTCGTATT AGTGGCATTA AGATGCCTGA TTACTATTCT 
GATTACTACT CTAGTCTATC TACAGAAACA TACAACATTT TTGAGACTGC TGCATCTGCA
AAATCAAGCC TAGTTGACTC TTCCGGCATA ATAGAGCCGA AAATCGCATT TGATCTGGCT
GATCGTGTAG CAAAAATGCA CGAAATTGAT ATCGCTGAAC CACTTCGAGA ACTTCTAAAA
ACTAACGGAA AAGAACTCTC TGCATTAATT CTATCAAAAG AGATTGCACA GGGAAAATAC
TCACTTCCTG ATTCTACTTT GGAAGAAAAA CTTGATCTGG CAGTACGTGT TGGATTGGCA
ATTGTTACAG AAGGAGTCAC AATTGCTCCG TTGCAAGGAA TTAGTGAGGT AAAGATTAAA
AAAAACAAGG ATGGTTCTGA ATATCTTTCA GTTTCCATTG CAGGTCCTAT GCGTTCTGCT
GGAGGAACAG AATCTGCCGT AACTATGTTG ATTGCAGATC ATGTCAGAAA GACAGCAGGA
CTATCAAAAT TCCAGGCCAA TTCATTTGAT GATGAAACAG GTCGATTTGT TGAGGAGTTA
AGAATCTATG AAAGAGAGGC TAGTAGTTTT CAATTCCATA TCCTAGATGA AGATATTGAA
CATGTGATTT CCAATCTTCC AGTAGAGCTA GACGGTGTTG ATACTGACCC TTATGAGGTT
GTAAATCACA AATCTATGGC GCGAATCAAA ACTGATAGAG TAAGGGGAGG TGCCTTACGT
GTCCTAAATG ATGGTCTTAT TGGAAGATCC AAGAAACTAC TCAAAAGAAT TGAAATGTAC
AATCTTGATG GTTGGGAGTG GCTTAATGAC CTAAAGGGTG CTGTTCAGAC TGGTGAAAAA
CAGGAGGATG CAGCTACCAA AAGAATGCAT GAGGTAATTA CAGGCAGATC AGTCCTTTCT
ATGCCTAACA GACTAGGAGG ATTCCGACTA AGATATGGAC GAGCATGCAA TACTGGTTTT
GCCGCAGTTG GACTTCATCC TGTAGTTGCA GAAATACTTG ATCACACAAT TGCAGTTGGA
ACTCAAATTA AGATTGACAT TCCAGGAAAA GGCTCAACAG TAGCATTTGT TGATTCGATT
GAAACTCCTG TAGTTCGTTT GAAAGATGGC AGTGTTGTAA AAATTAGAGA TGTAAAACAT
GGAATAGAAA TCAAAAATGA CATTGAAAAA ATACTTCACC TTGGAGATAT TCTGATATCT
TTTGGTGATT TCCTAGAAAA TAACGCACAA CTTGTTCCAT CAGGATATGT AGAAGAATAT
TGGATTGAGG AATTAAAACA AAAAATTGCG AAATACGAGC CTGAAGACCA TTATCTGACT
CGATTTCTCA ATAGAATTCC TACAATAGAT GAGGCATTAA AAATCTCGAT TGATTTTGAT
ATGACTCTTC ATCCTCATTA CCTGTATTAT TGGGATAAGC TATCTGCTGA AGAACTTGGA
CTACTTTTGA ATCCCAAAAC AATCAATGAA ACATCAGTAG AGTACTCTCC ACAAACAAAG
AAAATTCTTG AAAAATTAGG AGTTCCACAT ATCGTCAAAA ATGACAGTAT TATTCTTGAA
AATACTGAAG CAAAAATTTT CTTTAATTTG TTGTTTAGAG AAGAACCAAC AATTGATGAT
TCTTCTGTTC CTCAAATTAT CTCAAAATCT TCTGGAATCA AAATTAGAAA CAAATTCTCT
ACATCTATTG GAGTACGAAT TGGAAGACCT GAGAAATCTG CCCCTAGACA GATGAAACCA
CCAACACACG TATTATTCCC AATTAGTGAC AAAGGAGGAC CTACACGAGA TCTTCTAAAG
GCATCTAGAA ATGAGCACTT TTTCACCAGT ATTTACAATA GGCATTGTAA TCAATGTAAT
ATCCCGTCAA TTGGGATTAA ATGTTCAAAA TGTGGCACAA AGACCACTGT CACCTATAGG
TGCCCACATT GCAGAGATTC ACTCGAAGAA TCTTTTTGTG AAAAATGCAA ACGAAACGCT
CTAGCCTATT CTCATAAGGA ATTTCCCCTA AAATCAAAAC TTCTAGAAGC TCAAGAAAAA
ATTCGCCTAC GTGCCCAAGA ACCCTTCAAA GGAGTCAAAG AATTAATCAG TCAAGATAAG
ATTGCTGAAC CTCTGGAAAA AGGATTGGTT CGTCAAAACT TTGGACTGAC AGTCTTTAAG
GATGGAACTG TTAGATTTGA TGCAACAAAT TCTCCTCTAA CTCAATTCAA ACCCTCTTGG
ATTGGAACCT CTGTTGAGAA ACTCAAAGAA TTGGGATACT CTCATGATGT TGATGGAATT
CCACTTGAAG ATCCAGAGCA GATAGTTGAA CTTCGAATGC AAGATGTGAT AATTCCATAT
GAAAGTGGCA AGTATCTTGT TTCGATTTGC AAATACATTG ATACTCTTTT AGAAAAATTC
TATGGAAAAA CCTCATTTTA CAATGTAACC AACTCTGAGG AATTGATTGG ACATCTGATA
ATTGGTCTTG CTCCTCATAC ATCAGTTGGA ATTGTAGGTA GAATCATTGG ATATTCTGAA
ACCCATGTAT GTTTTGCAAC TCCCAATTGG CATTCTGCAA AAAGAAGAGA CGCAGACGGT
GATGCTGATT CTATAATGCT GTTGATGGAT AGTCTTTTGA ATTTCTCAAG ACAGTTCCTT
TCAGATGCGA TTGGTGGATT GATGGATGCA CCACTACTTG TTCAACCTCT TGTTTTACCA
CATGAATCCC AGCCACAAGC CCATAATCTT GAGGTAACAA AATCATTGCC TTTAGAATTT
TTTGAATCAA CACTTCAACA AGCCAAAGCA CCGGACATTT CATCTGTTGA AATCATCAAA
TCTAGACTTG AGACAGAAAG ACAGTTTTAT GATTATCATT TTACACATAC TACCTCATCG
CTTACAACTT CAAAATCACG TAGTGCATAT TCCACTCTTG GCTCAATGCT TGACAAATTC
GATATGCAAG TTAGAAATGC TGAACTGATT GATGCTGTAA ATACTTCAGA AATTGTTTCA
GATGTTATCT CAACTCATCT AGTACCAGAC ATTATGGGAA ATCTGAGAGC ATATGCAAGA
CAAAACTTTA GGTGTACTGG ATGTGGCAAA TCCTATCGTC GTATGCCATT AATCCAAACT
TGTGTTTGTG GGCATAAACT AATTCCAACA ATAACTCGTG GTTCTGTAGA AAAATATCTA
AAACTTGCAA AAAGACTTGT TGACAAGTAT GATGTTAGTG AATATCAAAG AGGACGTATT
CATGCACTTT CTGATGAAAT TGAACTAGTA TTTGGAAAAA GCCCAGGTGA CCAGTCACTT
CTTACTGATT ATGCCTGA
 
Protein sequence
MSENDAISRI SGIKMPDYYS DYYSSLSTET YNIFETAASA KSSLVDSSGI IEPKIAFDLA 
DRVAKMHEID IAEPLRELLK TNGKELSALI LSKEIAQGKY SLPDSTLEEK LDLAVRVGLA
IVTEGVTIAP LQGISEVKIK KNKDGSEYLS VSIAGPMRSA GGTESAVTML IADHVRKTAG
LSKFQANSFD DETGRFVEEL RIYEREASSF QFHILDEDIE HVISNLPVEL DGVDTDPYEV
VNHKSMARIK TDRVRGGALR VLNDGLIGRS KKLLKRIEMY NLDGWEWLND LKGAVQTGEK
QEDAATKRMH EVITGRSVLS MPNRLGGFRL RYGRACNTGF AAVGLHPVVA EILDHTIAVG
TQIKIDIPGK GSTVAFVDSI ETPVVRLKDG SVVKIRDVKH GIEIKNDIEK ILHLGDILIS
FGDFLENNAQ LVPSGYVEEY WIEELKQKIA KYEPEDHYLT RFLNRIPTID EALKISIDFD
MTLHPHYLYY WDKLSAEELG LLLNPKTINE TSVEYSPQTK KILEKLGVPH IVKNDSIILE
NTEAKIFFNL LFREEPTIDD SSVPQIISKS SGIKIRNKFS TSIGVRIGRP EKSAPRQMKP
PTHVLFPISD KGGPTRDLLK ASRNEHFFTS IYNRHCNQCN IPSIGIKCSK CGTKTTVTYR
CPHCRDSLEE SFCEKCKRNA LAYSHKEFPL KSKLLEAQEK IRLRAQEPFK GVKELISQDK
IAEPLEKGLV RQNFGLTVFK DGTVRFDATN SPLTQFKPSW IGTSVEKLKE LGYSHDVDGI
PLEDPEQIVE LRMQDVIIPY ESGKYLVSIC KYIDTLLEKF YGKTSFYNVT NSEELIGHLI
IGLAPHTSVG IVGRIIGYSE THVCFATPNW HSAKRRDADG DADSIMLLMD SLLNFSRQFL
SDAIGGLMDA PLLVQPLVLP HESQPQAHNL EVTKSLPLEF FESTLQQAKA PDISSVEIIK
SRLETERQFY DYHFTHTTSS LTTSKSRSAY STLGSMLDKF DMQVRNAELI DAVNTSEIVS
DVISTHLVPD IMGNLRAYAR QNFRCTGCGK SYRRMPLIQT CVCGHKLIPT ITRGSVEKYL
KLAKRLVDKY DVSEYQRGRI HALSDEIELV FGKSPGDQSL LTDYA