Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0610 |
Symbol | |
ID | 5774659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 548173 |
End bp | 550365 |
Gene Length | 2193 bp |
Protein Length | 730 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 641316245 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_001581944 |
Protein GI | 161528118 |
COG category | [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCAGTA TACACCAACA AACTGAATCA GAAATACGCA AAAAGTATAT TTCTAAACTG AGTATTGAAT ATGGGTTTGA CGAAAAAAAC ATTCGACTCG ATGTTCCTGT ACAATTAGGG CTAAAACGAT TCATCGCTGA TGCCATTATT TATCAAGATG ATCGACCATT TGTCTTGGTT GAAATCAAAA CTGATCTGAC CCTAAATTTT GAACAGGCAA AAAACACACT GGAACAAATG TCTTCTGTTT TGGGAAATAA TTTTGTAATT CTTACAAATG GTTATTATGA CATATGCTAC GAAATAGAAA AATCCTCAAA ACAAATCAAA TTTAATGAGA TTCCTGATAT TCCTGCACTA TCAAAAGATA AACGAACTAA GAAAAATCCT CCTCAATTGG AATTAAATGA ACCTGAACCA TTTTTCTATG GTATTTTTGA ATCTCTGCAA AATGATCCAA GATTGCTTCA TGTTACTGAA AAAGAAATTG TTGATGATCT TCATAAAATA ATTGGCGCAA AAATTTTTGA TGAAAAAAAT TCTGGTACCT CATTATTTTT TGCAAGAAAA AACGACTCTT ATCTTAAAAT TGTTTCTAGA CTTGAAGAAC TCCTTCATCA AATCAATTCA AAAGAAAATA ATTTTATCTT TGATCTTGAT TTTAAATTAC CTCATGATTT GATTTCCAAA ATTGTCTTTA AACTTCAAAA ATATTCTTTA ACAAAATCTC AATTAAAAAA TATGCCATTA GGATTTTCAC AAGGCATTTT GTCTAAATCG ACTGGTGCTT ATTTGACTCC TGATGCTATT TCTGAGTTTA TGAGCCATCT GTTTAAAATT AATTCTAAAA TGAAAGTTTT AGATTTGGCT TGTGGCAGTG GGGCGTTTCT TGTTAATGCA GGAAAATTTG GTGCTTCAGT TGTTGGTGTT GATGCCAACC GACAAATAGC CAATATCGCT AAAATAAACT GCTATTTGAA TGGCATCAAA AATGCTTCTG TTATCTGTGC TGACAGTCTT GGACCTCTTG AGAATTTGGC AAAAATGTCT TCTGGCAACA TTCAAACAAA TTCCTTTGAT CTTGTTTTAA CCCACCCACC TTTCGGATTA CGTTTAACCA AGGATTATGC CAATTTTTCA ATGCTAACTA TTTCTGGGAA TAGGTTCATG GAATCGTTGT TCATTGAAAG AAGTTGGGAA TTGTTAAAAG AAGGTGGAAA ATTAATTATT ATTCTACCTG AGGGTATTAC TTCCAACAAA TCTACTCGAA AAATAAGAGA ATTTATCACT ACAAATTTCA AAGTTCTAGG AATTATCAGC TTGCCTGATT ATGCCTTTTT TCCTTATTCC TCCATAAAAA CAACTATTCT GGTTCTAGAA AAATTAGGGC CAAAAATCAT CTCCAATTCT TATATGATAT TTACCGCCTA TGCAAAAAAT TTGGGTTATG ACAAACAAGG AATTCTTGCT AAAGAAAGTG ATTTCTCAAA AATTTTAGAA GATTTTAACA AATTTTTACA AACAAACAAA GGTTCTAAAT TTGATCAAAA TCTAAATTCT GATTTACGTC TAGATGAAAA TTATTTCCAG AATACTAGTG ATTTAGGAAA CCAAACAAAC ATGTGTATGT TAAAAGATAT TGCAGACATT ACTATTGGTG TTAAGAACAG TAAACTCAAA AAAGATACAA AATACCTTTT GGTCAAAGGT CAACAAATCA AAGACTTTGA AGTAGATTTA TCAAATGCAT CTGAAGTTGG GGTTGAATTT TCTATTGAAA AATATTTGTT ACAAAAAGGT GATATTGCAA TAACTCGTTC TGGTACTGTT GGTAATGTCG GATTATGTAA TAAGGATGCT AATGTAATCT TTAGTGATAA TATAATACGT ATAAGAATTA ATTCAGATAA AATAATTTCT CAATATCTTG CATCATTTCT ATATTCTGAA TTAGGCCAAA GACAAATTAG ACAATGCACC ACCGGAAGTA CAATTCGTGG CATCAGTCTA AGTAATCTTG AAAAAATACA AATCCCATTA ATTTCAATTT CTAAACAACA CAAAATTGCA AATGATTTGA AAAAAATTCT TGATGCTAAA TCTGAACTTA ACCATCTAAT AAAGAACTTG GAAAATTCAA AAACATCTCT ATCAAAAAAT GTAGAAAAAA CTATTGAGGA GTTACTAAAT TGA
|
Protein sequence | MGSIHQQTES EIRKKYISKL SIEYGFDEKN IRLDVPVQLG LKRFIADAII YQDDRPFVLV EIKTDLTLNF EQAKNTLEQM SSVLGNNFVI LTNGYYDICY EIEKSSKQIK FNEIPDIPAL SKDKRTKKNP PQLELNEPEP FFYGIFESLQ NDPRLLHVTE KEIVDDLHKI IGAKIFDEKN SGTSLFFARK NDSYLKIVSR LEELLHQINS KENNFIFDLD FKLPHDLISK IVFKLQKYSL TKSQLKNMPL GFSQGILSKS TGAYLTPDAI SEFMSHLFKI NSKMKVLDLA CGSGAFLVNA GKFGASVVGV DANRQIANIA KINCYLNGIK NASVICADSL GPLENLAKMS SGNIQTNSFD LVLTHPPFGL RLTKDYANFS MLTISGNRFM ESLFIERSWE LLKEGGKLII ILPEGITSNK STRKIREFIT TNFKVLGIIS LPDYAFFPYS SIKTTILVLE KLGPKIISNS YMIFTAYAKN LGYDKQGILA KESDFSKILE DFNKFLQTNK GSKFDQNLNS DLRLDENYFQ NTSDLGNQTN MCMLKDIADI TIGVKNSKLK KDTKYLLVKG QQIKDFEVDL SNASEVGVEF SIEKYLLQKG DIAITRSGTV GNVGLCNKDA NVIFSDNIIR IRINSDKIIS QYLASFLYSE LGQRQIRQCT TGSTIRGISL SNLEKIQIPL ISISKQHKIA NDLKKILDAK SELNHLIKNL ENSKTSLSKN VEKTIEELLN
|
| |