Gene Ndas_3082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3082 
Symbol 
ID9246938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3686800 
End bp3690057 
Gene Length3258 bp 
Protein Length1085 aa 
Translation table11 
GC content66% 
IMG OID 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_003680997 
Protein GI297562023 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGT ACGCCGATGA CGGCATCCCC ATCTACCCGC GCGATCCGGA CGAGGCCCAG 
TGGGAGGCGT GGGCTCTGGA ATGGCTCGCC GAACCCTGCG GATGGGAACC GGTTCGCGGG
CAGGATCTCG CCCCGCGCAA GGACGGCAGC GGCGAGCGCA GAGCCTGGGA CGACCTTCTG
CTCACCGAAC GACTGAGATC CGCCCTCACC CGCATCAACC CCCAACTGCC CGACGACGCG
GTGGACGAGG TGATCGAGGA GCTCGGCCGC CGTGAGAGCT CGGACCCCTT CCATGAGTTC
CACCGTCTGC ACACCCTGCT CACCCAAGGC GTCAAGGTGG AGGTGACCGA CCCGGACACC
GGGCAGACGG TCACCGAGAC CGCGTGGCCG ATCGACTTCA ACGACCCGCA CTCCAACGCC
TTCGTCGCCG CCAACCAGGT CACCGTCAAG GACCTGGCTG GCTCCGCCTC ACGTACCCGG
CCCCGGCGGC TGGACATCGT AGGCTACGTC AACGGCATCC CCTTGGCCGT CTTCGAGCTG
AAGGCAGCGG GCTCCGAGGA CGGCAGCCGC GAAGCCCACG CGCAGCTGCT GACCTACCGC
ACCGAGTACG GTGCCACCGC GCTCGCCCCT GTGGCCTTCG CCATCGCCTC CGACGGCATC
ACCGCCCGTG TCGGGACTCC GCACACGCCC TGGGAGCACA TGGCCCCCTG GGAGGTGAAC
GACGCTGGTG ACCCGCTCGA ACTCGCCGCA GGGCAGGACG ACCACGAGCA CCTCGTCGCC
TTGGAAAGCC TCGTCAGCGG AGTGTTCGGC CCGGTCCGAT TCCTGGACCT GCTGGAGAAC
TTCCTCGCCT TCTCTCGGGA CGAGGGGAGC ACGGTCGACA CTGTACGGCT GGCCAAAGCC
CACCAGTACA TCGCCGTGAA CAAGGCGATC GACCGCACGA TCACCGCTGT GTCCAGCAAC
GGCAAGATCG GCGTGGTCTG GCACACCCAG GGCTCCGGCA AGAGCAAGGA GATGGAGTTC
TACGCGGCCA AGGCGCTCAA GCATCCCGCG TTGCGCAACC CGACCATCGT CGTACTCACA
GACCGGCTTG ACCTGGACAG TCAGCTGTAC GCCACCTTCG CAGCCTCCGC CCTCCTCCCC
GAGGAACCCA AGCAGGCCGC GTTGTCCGAG CACCTCTCCC ACCTGCTTGA GCGCCCCTCG
GGCGGCATCG TCTTCACCAC CCTGCAAAAG TTCCGGATCA CCAAGGACGA GAAGGAAGCC
GGTCTCCAAC ACCGGGTGCT CAGCGCACGG CGCAACGTCA TCGTCATCGT CGACGAGGCC
CACCGCAGCC ACTACGGCTT GTTGGAGGGT TACGCGAAGA ACCTGCGGGA CGCGCTTCCC
AACGCCGCCT ACATCGCGTT CACCGGCACC CCGATCGCCG CGGCCGACCG CGACACCCGT
GCCGTCTTCG GCGACGACAT CGACGTCTAC GACCTGACGC GGGCCGTGCG GGACGGTGCG
ACCGTCCGGG TCTTCTATGA GAACCGGCAC ATCCCGGTCT CACTCCCCCA GGACATCGAT
CCCGAACTGC TCGACGGCCG AGCCGAGGAC CTGACGGCGG AACTGGACGA GGAGGAGCGC
AAACGCGCCA ACCGGGCCCT TGCGGCCTAT GAGGACGTGG TCGGTTCCCC CGAGCGGATC
CGCAAGCTGG CCGCCGACAT CGTCGACCAT TGGAAGCAGC GCCGAGAGGA GATGGGCAAG
CTCCTCGTCA CCACGGGGGA GAACCCGCGT CCGTCCCCGG GAAAGGCGAT GATCGTCGGG
CTGAGCCGGA AGGTCTGCGC CGACCTGTAC GCCGCCATCA TCCAGCTGGA GCCGACCTGG
CACTCCGATG ACGACGCCGA CGGTGTCATC AAGTGCGTCT ACACTGGTCA GGCCTCCGAT
CCTGAGCCGA CCAGGACGCA CGTTCGCACA CCCACCCGGA TCAAGGCGAT CCAGCGCAGG
GCCACCGATC CCGAGGACAA GCTGGAACTG GTCATCGTCC AGTCCCTGTG GCTCACCGGA
TTCGACTCTC CTCCACTGCA CACCCTGTAC CTGGACAAAC CGATGCGCGG GGCAGCGCTC
ATGCAGGCCA TCGCACGGGT GAACCGGCGG TTCGGGGAGA AGCCATCCGG GCTCGTCGTC
GACTTCCTGG GCATCGCGGA CAAACTCACT CAGGCCCTGA TGGAGTACAC CCTCACCGAC
CAGGACGAGC GTCCCGTCGG CAGAGAGGTC TCCGAAGCCG TCGCCATCGT GCAGGAACAG
CACCACATCC TCGACGGAAT CCTGCACGGG CTCAACTGGA GGCTCACACG GGACTCGGGT
CGGCCCAAAG CGTTCGTGAA CGCTGTCCTG GACGCGGTGG AGTTCCTTCA ACAACCGGAG
CCTGACCTCG AGGAGGGCAG GCCCACTCTG GTTCGGCGCT TCACCAAGCA TGCCCGTGAC
CTCGTCCGCG CGTTCGCGCT CTGCCCGACC GAGCCGGAGC TGGATCCGAT CCGTCCTGAT
CTGAAGTTCT TCGACTCCGT CCGGCACTAT GTCGTCAAGT TGGACACAGA GGCAAAAGCC
GCGCGTGGTC TGGCGACGGC CGCGGACGTC GAACTCGCCA TCCGTCAGCT CACGGCGAGT
GCGGTGGCCG CCGACGAGGT CGTGGACATC TACGAGGCCG CCGGCCTGCA AAAACCCGAC
CTCTCGCATC TGGACGAGGA GTTCGTTCGC AGGCTCAGGG AGAGTGAGCG CCCCCACCTG
GCGATCGAGG CACTGCGCCG GTCGATCGAG CGGGAGGTGA AGGCCGTCTA TCCGCACAAC
GTCGTCAAGC AGGAGAAGTT CGTCGAGAAA CTGCTCCACA CGATGAACCG GTACCGAAAC
GGGGCACTCA CCTCCGCTGC GGTCATCGCC CAGATGGTGG AACTCGCCAA GGAGGTGTCA
GCGGATCGGG GGCGCGCCGC GGAACTGGGG TTGTCCGAGG ATGAGTTGGC GTTCTACGAC
GCCGTCGCAA AGAACGAGGC AGCGGTCAAG GTCATGGGCA CGGGAAAACT CGCGGCGATT
GCCCGTGACC TGGTCACACA GGTGCAGTCC AGCGCCTCCA TCGACTGGTC CCAGCGCGAC
GAGGTGAAGT CCTACATGAT GAGCCGCATC AAGAGACTGC TGCGTCGGCA CGGGTATCCA
CCGGACGCTC AGCCCTCGGC TGTCGAGGAG GTCCTCAAAC AGGCTCGCGG ATACGGCGAG
TACTGGAGTA ACCGGTGA
 
Protein sequence
MSQYADDGIP IYPRDPDEAQ WEAWALEWLA EPCGWEPVRG QDLAPRKDGS GERRAWDDLL 
LTERLRSALT RINPQLPDDA VDEVIEELGR RESSDPFHEF HRLHTLLTQG VKVEVTDPDT
GQTVTETAWP IDFNDPHSNA FVAANQVTVK DLAGSASRTR PRRLDIVGYV NGIPLAVFEL
KAAGSEDGSR EAHAQLLTYR TEYGATALAP VAFAIASDGI TARVGTPHTP WEHMAPWEVN
DAGDPLELAA GQDDHEHLVA LESLVSGVFG PVRFLDLLEN FLAFSRDEGS TVDTVRLAKA
HQYIAVNKAI DRTITAVSSN GKIGVVWHTQ GSGKSKEMEF YAAKALKHPA LRNPTIVVLT
DRLDLDSQLY ATFAASALLP EEPKQAALSE HLSHLLERPS GGIVFTTLQK FRITKDEKEA
GLQHRVLSAR RNVIVIVDEA HRSHYGLLEG YAKNLRDALP NAAYIAFTGT PIAAADRDTR
AVFGDDIDVY DLTRAVRDGA TVRVFYENRH IPVSLPQDID PELLDGRAED LTAELDEEER
KRANRALAAY EDVVGSPERI RKLAADIVDH WKQRREEMGK LLVTTGENPR PSPGKAMIVG
LSRKVCADLY AAIIQLEPTW HSDDDADGVI KCVYTGQASD PEPTRTHVRT PTRIKAIQRR
ATDPEDKLEL VIVQSLWLTG FDSPPLHTLY LDKPMRGAAL MQAIARVNRR FGEKPSGLVV
DFLGIADKLT QALMEYTLTD QDERPVGREV SEAVAIVQEQ HHILDGILHG LNWRLTRDSG
RPKAFVNAVL DAVEFLQQPE PDLEEGRPTL VRRFTKHARD LVRAFALCPT EPELDPIRPD
LKFFDSVRHY VVKLDTEAKA ARGLATAADV ELAIRQLTAS AVAADEVVDI YEAAGLQKPD
LSHLDEEFVR RLRESERPHL AIEALRRSIE REVKAVYPHN VVKQEKFVEK LLHTMNRYRN
GALTSAAVIA QMVELAKEVS ADRGRAAELG LSEDELAFYD AVAKNEAAVK VMGTGKLAAI
ARDLVTQVQS SASIDWSQRD EVKSYMMSRI KRLLRRHGYP PDAQPSAVEE VLKQARGYGE
YWSNR