Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3082 |
Symbol | |
ID | 9246938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3686800 |
End bp | 3690057 |
Gene Length | 3258 bp |
Protein Length | 1085 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_003680997 |
Protein GI | 297562023 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGT ACGCCGATGA CGGCATCCCC ATCTACCCGC GCGATCCGGA CGAGGCCCAG TGGGAGGCGT GGGCTCTGGA ATGGCTCGCC GAACCCTGCG GATGGGAACC GGTTCGCGGG CAGGATCTCG CCCCGCGCAA GGACGGCAGC GGCGAGCGCA GAGCCTGGGA CGACCTTCTG CTCACCGAAC GACTGAGATC CGCCCTCACC CGCATCAACC CCCAACTGCC CGACGACGCG GTGGACGAGG TGATCGAGGA GCTCGGCCGC CGTGAGAGCT CGGACCCCTT CCATGAGTTC CACCGTCTGC ACACCCTGCT CACCCAAGGC GTCAAGGTGG AGGTGACCGA CCCGGACACC GGGCAGACGG TCACCGAGAC CGCGTGGCCG ATCGACTTCA ACGACCCGCA CTCCAACGCC TTCGTCGCCG CCAACCAGGT CACCGTCAAG GACCTGGCTG GCTCCGCCTC ACGTACCCGG CCCCGGCGGC TGGACATCGT AGGCTACGTC AACGGCATCC CCTTGGCCGT CTTCGAGCTG AAGGCAGCGG GCTCCGAGGA CGGCAGCCGC GAAGCCCACG CGCAGCTGCT GACCTACCGC ACCGAGTACG GTGCCACCGC GCTCGCCCCT GTGGCCTTCG CCATCGCCTC CGACGGCATC ACCGCCCGTG TCGGGACTCC GCACACGCCC TGGGAGCACA TGGCCCCCTG GGAGGTGAAC GACGCTGGTG ACCCGCTCGA ACTCGCCGCA GGGCAGGACG ACCACGAGCA CCTCGTCGCC TTGGAAAGCC TCGTCAGCGG AGTGTTCGGC CCGGTCCGAT TCCTGGACCT GCTGGAGAAC TTCCTCGCCT TCTCTCGGGA CGAGGGGAGC ACGGTCGACA CTGTACGGCT GGCCAAAGCC CACCAGTACA TCGCCGTGAA CAAGGCGATC GACCGCACGA TCACCGCTGT GTCCAGCAAC GGCAAGATCG GCGTGGTCTG GCACACCCAG GGCTCCGGCA AGAGCAAGGA GATGGAGTTC TACGCGGCCA AGGCGCTCAA GCATCCCGCG TTGCGCAACC CGACCATCGT CGTACTCACA GACCGGCTTG ACCTGGACAG TCAGCTGTAC GCCACCTTCG CAGCCTCCGC CCTCCTCCCC GAGGAACCCA AGCAGGCCGC GTTGTCCGAG CACCTCTCCC ACCTGCTTGA GCGCCCCTCG GGCGGCATCG TCTTCACCAC CCTGCAAAAG TTCCGGATCA CCAAGGACGA GAAGGAAGCC GGTCTCCAAC ACCGGGTGCT CAGCGCACGG CGCAACGTCA TCGTCATCGT CGACGAGGCC CACCGCAGCC ACTACGGCTT GTTGGAGGGT TACGCGAAGA ACCTGCGGGA CGCGCTTCCC AACGCCGCCT ACATCGCGTT CACCGGCACC CCGATCGCCG CGGCCGACCG CGACACCCGT GCCGTCTTCG GCGACGACAT CGACGTCTAC GACCTGACGC GGGCCGTGCG GGACGGTGCG ACCGTCCGGG TCTTCTATGA GAACCGGCAC ATCCCGGTCT CACTCCCCCA GGACATCGAT CCCGAACTGC TCGACGGCCG AGCCGAGGAC CTGACGGCGG AACTGGACGA GGAGGAGCGC AAACGCGCCA ACCGGGCCCT TGCGGCCTAT GAGGACGTGG TCGGTTCCCC CGAGCGGATC CGCAAGCTGG CCGCCGACAT CGTCGACCAT TGGAAGCAGC GCCGAGAGGA GATGGGCAAG CTCCTCGTCA CCACGGGGGA GAACCCGCGT CCGTCCCCGG GAAAGGCGAT GATCGTCGGG CTGAGCCGGA AGGTCTGCGC CGACCTGTAC GCCGCCATCA TCCAGCTGGA GCCGACCTGG CACTCCGATG ACGACGCCGA CGGTGTCATC AAGTGCGTCT ACACTGGTCA GGCCTCCGAT CCTGAGCCGA CCAGGACGCA CGTTCGCACA CCCACCCGGA TCAAGGCGAT CCAGCGCAGG GCCACCGATC CCGAGGACAA GCTGGAACTG GTCATCGTCC AGTCCCTGTG GCTCACCGGA TTCGACTCTC CTCCACTGCA CACCCTGTAC CTGGACAAAC CGATGCGCGG GGCAGCGCTC ATGCAGGCCA TCGCACGGGT GAACCGGCGG TTCGGGGAGA AGCCATCCGG GCTCGTCGTC GACTTCCTGG GCATCGCGGA CAAACTCACT CAGGCCCTGA TGGAGTACAC CCTCACCGAC CAGGACGAGC GTCCCGTCGG CAGAGAGGTC TCCGAAGCCG TCGCCATCGT GCAGGAACAG CACCACATCC TCGACGGAAT CCTGCACGGG CTCAACTGGA GGCTCACACG GGACTCGGGT CGGCCCAAAG CGTTCGTGAA CGCTGTCCTG GACGCGGTGG AGTTCCTTCA ACAACCGGAG CCTGACCTCG AGGAGGGCAG GCCCACTCTG GTTCGGCGCT TCACCAAGCA TGCCCGTGAC CTCGTCCGCG CGTTCGCGCT CTGCCCGACC GAGCCGGAGC TGGATCCGAT CCGTCCTGAT CTGAAGTTCT TCGACTCCGT CCGGCACTAT GTCGTCAAGT TGGACACAGA GGCAAAAGCC GCGCGTGGTC TGGCGACGGC CGCGGACGTC GAACTCGCCA TCCGTCAGCT CACGGCGAGT GCGGTGGCCG CCGACGAGGT CGTGGACATC TACGAGGCCG CCGGCCTGCA AAAACCCGAC CTCTCGCATC TGGACGAGGA GTTCGTTCGC AGGCTCAGGG AGAGTGAGCG CCCCCACCTG GCGATCGAGG CACTGCGCCG GTCGATCGAG CGGGAGGTGA AGGCCGTCTA TCCGCACAAC GTCGTCAAGC AGGAGAAGTT CGTCGAGAAA CTGCTCCACA CGATGAACCG GTACCGAAAC GGGGCACTCA CCTCCGCTGC GGTCATCGCC CAGATGGTGG AACTCGCCAA GGAGGTGTCA GCGGATCGGG GGCGCGCCGC GGAACTGGGG TTGTCCGAGG ATGAGTTGGC GTTCTACGAC GCCGTCGCAA AGAACGAGGC AGCGGTCAAG GTCATGGGCA CGGGAAAACT CGCGGCGATT GCCCGTGACC TGGTCACACA GGTGCAGTCC AGCGCCTCCA TCGACTGGTC CCAGCGCGAC GAGGTGAAGT CCTACATGAT GAGCCGCATC AAGAGACTGC TGCGTCGGCA CGGGTATCCA CCGGACGCTC AGCCCTCGGC TGTCGAGGAG GTCCTCAAAC AGGCTCGCGG ATACGGCGAG TACTGGAGTA ACCGGTGA
|
Protein sequence | MSQYADDGIP IYPRDPDEAQ WEAWALEWLA EPCGWEPVRG QDLAPRKDGS GERRAWDDLL LTERLRSALT RINPQLPDDA VDEVIEELGR RESSDPFHEF HRLHTLLTQG VKVEVTDPDT GQTVTETAWP IDFNDPHSNA FVAANQVTVK DLAGSASRTR PRRLDIVGYV NGIPLAVFEL KAAGSEDGSR EAHAQLLTYR TEYGATALAP VAFAIASDGI TARVGTPHTP WEHMAPWEVN DAGDPLELAA GQDDHEHLVA LESLVSGVFG PVRFLDLLEN FLAFSRDEGS TVDTVRLAKA HQYIAVNKAI DRTITAVSSN GKIGVVWHTQ GSGKSKEMEF YAAKALKHPA LRNPTIVVLT DRLDLDSQLY ATFAASALLP EEPKQAALSE HLSHLLERPS GGIVFTTLQK FRITKDEKEA GLQHRVLSAR RNVIVIVDEA HRSHYGLLEG YAKNLRDALP NAAYIAFTGT PIAAADRDTR AVFGDDIDVY DLTRAVRDGA TVRVFYENRH IPVSLPQDID PELLDGRAED LTAELDEEER KRANRALAAY EDVVGSPERI RKLAADIVDH WKQRREEMGK LLVTTGENPR PSPGKAMIVG LSRKVCADLY AAIIQLEPTW HSDDDADGVI KCVYTGQASD PEPTRTHVRT PTRIKAIQRR ATDPEDKLEL VIVQSLWLTG FDSPPLHTLY LDKPMRGAAL MQAIARVNRR FGEKPSGLVV DFLGIADKLT QALMEYTLTD QDERPVGREV SEAVAIVQEQ HHILDGILHG LNWRLTRDSG RPKAFVNAVL DAVEFLQQPE PDLEEGRPTL VRRFTKHARD LVRAFALCPT EPELDPIRPD LKFFDSVRHY VVKLDTEAKA ARGLATAADV ELAIRQLTAS AVAADEVVDI YEAAGLQKPD LSHLDEEFVR RLRESERPHL AIEALRRSIE REVKAVYPHN VVKQEKFVEK LLHTMNRYRN GALTSAAVIA QMVELAKEVS ADRGRAAELG LSEDELAFYD AVAKNEAAVK VMGTGKLAAI ARDLVTQVQS SASIDWSQRD EVKSYMMSRI KRLLRRHGYP PDAQPSAVEE VLKQARGYGE YWSNR
|
| |