Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1794 |
Symbol | |
ID | 5773349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 1638948 |
End bp | 1642325 |
Gene Length | 3378 bp |
Protein Length | 1125 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641317450 |
Product | DNA polymerase II large subunit |
Protein accession | YP_001583128 |
Protein GI | 161529302 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1933] Archaeal DNA polymerase II, large subunit |
TIGRFAM ID | [TIGR00354] DNA polymerase, archaeal type II, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAA ACGACGCAAT TTCTCGTATT AGTGGCATTA AGATGCCTGA TTACTATTCT GATTACTACT CTAGTCTATC TACAGAAACA TACAACATTT TTGAGACTGC TGCATCTGCA AAATCAAGCC TAGTTGACTC TTCCGGCATA ATAGAGCCGA AAATCGCATT TGATCTGGCT GATCGTGTAG CAAAAATGCA CGAAATTGAT ATCGCTGAAC CACTTCGAGA ACTTCTAAAA ACTAACGGAA AAGAACTCTC TGCATTAATT CTATCAAAAG AGATTGCACA GGGAAAATAC TCACTTCCTG ATTCTACTTT GGAAGAAAAA CTTGATCTGG CAGTACGTGT TGGATTGGCA ATTGTTACAG AAGGAGTCAC AATTGCTCCG TTGCAAGGAA TTAGTGAGGT AAAGATTAAA AAAAACAAGG ATGGTTCTGA ATATCTTTCA GTTTCCATTG CAGGTCCTAT GCGTTCTGCT GGAGGAACAG AATCTGCCGT AACTATGTTG ATTGCAGATC ATGTCAGAAA GACAGCAGGA CTATCAAAAT TCCAGGCCAA TTCATTTGAT GATGAAACAG GTCGATTTGT TGAGGAGTTA AGAATCTATG AAAGAGAGGC TAGTAGTTTT CAATTCCATA TCCTAGATGA AGATATTGAA CATGTGATTT CCAATCTTCC AGTAGAGCTA GACGGTGTTG ATACTGACCC TTATGAGGTT GTAAATCACA AATCTATGGC GCGAATCAAA ACTGATAGAG TAAGGGGAGG TGCCTTACGT GTCCTAAATG ATGGTCTTAT TGGAAGATCC AAGAAACTAC TCAAAAGAAT TGAAATGTAC AATCTTGATG GTTGGGAGTG GCTTAATGAC CTAAAGGGTG CTGTTCAGAC TGGTGAAAAA CAGGAGGATG CAGCTACCAA AAGAATGCAT GAGGTAATTA CAGGCAGATC AGTCCTTTCT ATGCCTAACA GACTAGGAGG ATTCCGACTA AGATATGGAC GAGCATGCAA TACTGGTTTT GCCGCAGTTG GACTTCATCC TGTAGTTGCA GAAATACTTG ATCACACAAT TGCAGTTGGA ACTCAAATTA AGATTGACAT TCCAGGAAAA GGCTCAACAG TAGCATTTGT TGATTCGATT GAAACTCCTG TAGTTCGTTT GAAAGATGGC AGTGTTGTAA AAATTAGAGA TGTAAAACAT GGAATAGAAA TCAAAAATGA CATTGAAAAA ATACTTCACC TTGGAGATAT TCTGATATCT TTTGGTGATT TCCTAGAAAA TAACGCACAA CTTGTTCCAT CAGGATATGT AGAAGAATAT TGGATTGAGG AATTAAAACA AAAAATTGCG AAATACGAGC CTGAAGACCA TTATCTGACT CGATTTCTCA ATAGAATTCC TACAATAGAT GAGGCATTAA AAATCTCGAT TGATTTTGAT ATGACTCTTC ATCCTCATTA CCTGTATTAT TGGGATAAGC TATCTGCTGA AGAACTTGGA CTACTTTTGA ATCCCAAAAC AATCAATGAA ACATCAGTAG AGTACTCTCC ACAAACAAAG AAAATTCTTG AAAAATTAGG AGTTCCACAT ATCGTCAAAA ATGACAGTAT TATTCTTGAA AATACTGAAG CAAAAATTTT CTTTAATTTG TTGTTTAGAG AAGAACCAAC AATTGATGAT TCTTCTGTTC CTCAAATTAT CTCAAAATCT TCTGGAATCA AAATTAGAAA CAAATTCTCT ACATCTATTG GAGTACGAAT TGGAAGACCT GAGAAATCTG CCCCTAGACA GATGAAACCA CCAACACACG TATTATTCCC AATTAGTGAC AAAGGAGGAC CTACACGAGA TCTTCTAAAG GCATCTAGAA ATGAGCACTT TTTCACCAGT ATTTACAATA GGCATTGTAA TCAATGTAAT ATCCCGTCAA TTGGGATTAA ATGTTCAAAA TGTGGCACAA AGACCACTGT CACCTATAGG TGCCCACATT GCAGAGATTC ACTCGAAGAA TCTTTTTGTG AAAAATGCAA ACGAAACGCT CTAGCCTATT CTCATAAGGA ATTTCCCCTA AAATCAAAAC TTCTAGAAGC TCAAGAAAAA ATTCGCCTAC GTGCCCAAGA ACCCTTCAAA GGAGTCAAAG AATTAATCAG TCAAGATAAG ATTGCTGAAC CTCTGGAAAA AGGATTGGTT CGTCAAAACT TTGGACTGAC AGTCTTTAAG GATGGAACTG TTAGATTTGA TGCAACAAAT TCTCCTCTAA CTCAATTCAA ACCCTCTTGG ATTGGAACCT CTGTTGAGAA ACTCAAAGAA TTGGGATACT CTCATGATGT TGATGGAATT CCACTTGAAG ATCCAGAGCA GATAGTTGAA CTTCGAATGC AAGATGTGAT AATTCCATAT GAAAGTGGCA AGTATCTTGT TTCGATTTGC AAATACATTG ATACTCTTTT AGAAAAATTC TATGGAAAAA CCTCATTTTA CAATGTAACC AACTCTGAGG AATTGATTGG ACATCTGATA ATTGGTCTTG CTCCTCATAC ATCAGTTGGA ATTGTAGGTA GAATCATTGG ATATTCTGAA ACCCATGTAT GTTTTGCAAC TCCCAATTGG CATTCTGCAA AAAGAAGAGA CGCAGACGGT GATGCTGATT CTATAATGCT GTTGATGGAT AGTCTTTTGA ATTTCTCAAG ACAGTTCCTT TCAGATGCGA TTGGTGGATT GATGGATGCA CCACTACTTG TTCAACCTCT TGTTTTACCA CATGAATCCC AGCCACAAGC CCATAATCTT GAGGTAACAA AATCATTGCC TTTAGAATTT TTTGAATCAA CACTTCAACA AGCCAAAGCA CCGGACATTT CATCTGTTGA AATCATCAAA TCTAGACTTG AGACAGAAAG ACAGTTTTAT GATTATCATT TTACACATAC TACCTCATCG CTTACAACTT CAAAATCACG TAGTGCATAT TCCACTCTTG GCTCAATGCT TGACAAATTC GATATGCAAG TTAGAAATGC TGAACTGATT GATGCTGTAA ATACTTCAGA AATTGTTTCA GATGTTATCT CAACTCATCT AGTACCAGAC ATTATGGGAA ATCTGAGAGC ATATGCAAGA CAAAACTTTA GGTGTACTGG ATGTGGCAAA TCCTATCGTC GTATGCCATT AATCCAAACT TGTGTTTGTG GGCATAAACT AATTCCAACA ATAACTCGTG GTTCTGTAGA AAAATATCTA AAACTTGCAA AAAGACTTGT TGACAAGTAT GATGTTAGTG AATATCAAAG AGGACGTATT CATGCACTTT CTGATGAAAT TGAACTAGTA TTTGGAAAAA GCCCAGGTGA CCAGTCACTT CTTACTGATT ATGCCTGA
|
Protein sequence | MSENDAISRI SGIKMPDYYS DYYSSLSTET YNIFETAASA KSSLVDSSGI IEPKIAFDLA DRVAKMHEID IAEPLRELLK TNGKELSALI LSKEIAQGKY SLPDSTLEEK LDLAVRVGLA IVTEGVTIAP LQGISEVKIK KNKDGSEYLS VSIAGPMRSA GGTESAVTML IADHVRKTAG LSKFQANSFD DETGRFVEEL RIYEREASSF QFHILDEDIE HVISNLPVEL DGVDTDPYEV VNHKSMARIK TDRVRGGALR VLNDGLIGRS KKLLKRIEMY NLDGWEWLND LKGAVQTGEK QEDAATKRMH EVITGRSVLS MPNRLGGFRL RYGRACNTGF AAVGLHPVVA EILDHTIAVG TQIKIDIPGK GSTVAFVDSI ETPVVRLKDG SVVKIRDVKH GIEIKNDIEK ILHLGDILIS FGDFLENNAQ LVPSGYVEEY WIEELKQKIA KYEPEDHYLT RFLNRIPTID EALKISIDFD MTLHPHYLYY WDKLSAEELG LLLNPKTINE TSVEYSPQTK KILEKLGVPH IVKNDSIILE NTEAKIFFNL LFREEPTIDD SSVPQIISKS SGIKIRNKFS TSIGVRIGRP EKSAPRQMKP PTHVLFPISD KGGPTRDLLK ASRNEHFFTS IYNRHCNQCN IPSIGIKCSK CGTKTTVTYR CPHCRDSLEE SFCEKCKRNA LAYSHKEFPL KSKLLEAQEK IRLRAQEPFK GVKELISQDK IAEPLEKGLV RQNFGLTVFK DGTVRFDATN SPLTQFKPSW IGTSVEKLKE LGYSHDVDGI PLEDPEQIVE LRMQDVIIPY ESGKYLVSIC KYIDTLLEKF YGKTSFYNVT NSEELIGHLI IGLAPHTSVG IVGRIIGYSE THVCFATPNW HSAKRRDADG DADSIMLLMD SLLNFSRQFL SDAIGGLMDA PLLVQPLVLP HESQPQAHNL EVTKSLPLEF FESTLQQAKA PDISSVEIIK SRLETERQFY DYHFTHTTSS LTTSKSRSAY STLGSMLDKF DMQVRNAELI DAVNTSEIVS DVISTHLVPD IMGNLRAYAR QNFRCTGCGK SYRRMPLIQT CVCGHKLIPT ITRGSVEKYL KLAKRLVDKY DVSEYQRGRI HALSDEIELV FGKSPGDQSL LTDYA
|
| |