Gene Ndas_4075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4075 
Symbol 
ID9247947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4872840 
End bp4874219 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content72% 
IMG OID 
ProductHNH endonuclease 
Protein accessionYP_003681977 
Protein GI297563003 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCCGG GGGCCGGGGA GCAGGCCCGT GAGGACCGTC CACCGAAGAC CACGCCCCGC 
GAGGGGCACC CGTACGTGTT CGTGCCGGAC CGCCACGGCA CACCACTACA ACCCACCCAC
CCCGCGCGGG CCCGCCGGCT CCTGGCCAGG GGCCGGGCGA TCGCGGCCCG GCACACCCCC
TTCGTCATCA GGTTGAAGGA TCGCACCACC GCCGAGTCCG GGGTCGACGG GGTCGAGACA
GGCATTGACC CAGGCAGCAA GCGCGCCGGC ACCGCCGTGT TCACCGACCG GCGTGGAGAA
CGCCGAGGCC GCTGTGCGAT ACGGCCGGAC CACCGAGGCG GTCAGATCGC CAAGAGGATG
CGCCAACGCG CCGCCTACAG AAAGGGGCGC CGTTCTCGTA ACCTGCGTTA CCGGACGGCC
CGGTTCGACA ACCGCACCCG TCCACGCGGG TGGCCGCCAC CCTCGCTCCG GCACCGGGTG
GACACCACCC TGGCCTGGGT GGGTCGTCCG GCCCGGTGGG CTCCGGTCCG TGCGGTGCGC
GTGGAGCGGG TCGCCTTCGA CGTCCACGCC CTGGGCGCGG GGCGTCCTGC GGAAGGGGTG
GAGTACCGAC ACGGCACCCT GCACGGCTAC GAGGTGCGCG AGTACCTGCT GGCCAAGTGG
GGACGGGCGT GCGCCTACTG CGGTGCCACC GGCACCCCGC TCAACATCGA CCACGTCCGT
CCCCGTTCCC GAGGCGGAAC GGACCGGGTG GCCGACCTCG CGCTGGCCTG TGTCCCGTGC
AATCGGGCCA AAGCCGACCA GCCCGTCGAG GAGTTCGTGA CCAACCCCCG CGCTCTCGCC
GGGATCAGCG CGCAGACCAA GGCGCCGCTG CGCGATGCCG CAGCGGTCGA CGCCACTCGG
TGGGCGCTGT GGCGGTCCCT GGACGAGTGC CCGCCCACCC GTGTGGGATC GGGAGGTCGG
ACGAAGTGGA ACCGCACCCG CGACCACCTG CCCGAGTCCC ACACCCTGGA CGCCCTGGCC
GTGGGCAGGG TCGAAGCCGT CACCACGGCT GTTTCAACGG TCCTGGTCGT CGGGGCCGCT
GGGCGCGGCC CCTATGCCCG TACCCGGGCG GACAGGCACG GTTTCCCCCG GTTGCGTCTG
CCCCGCCGCA AACGGTTCTT CGGTTTCCAG ACCGGGGATC TGGCCCGAGC GGCCGTGCCG
TCGGACATGA ACGCGGGGAC GCACACAGGC CGGGTGGCGG TGCGCAGCAG CGGGAGCCAC
ACCCTGCAAA CCTCGTATGG GCCGATCAAG ACCTCATGGA AGAACCTGTG TCTGCTCCAG
CGAGCGGACG GTTACGGCTA CACCACCCAG AAGGAGGCAG GGGCTCCCTC CACCGCCTGA
 
Protein sequence
MPPGAGEQAR EDRPPKTTPR EGHPYVFVPD RHGTPLQPTH PARARRLLAR GRAIAARHTP 
FVIRLKDRTT AESGVDGVET GIDPGSKRAG TAVFTDRRGE RRGRCAIRPD HRGGQIAKRM
RQRAAYRKGR RSRNLRYRTA RFDNRTRPRG WPPPSLRHRV DTTLAWVGRP ARWAPVRAVR
VERVAFDVHA LGAGRPAEGV EYRHGTLHGY EVREYLLAKW GRACAYCGAT GTPLNIDHVR
PRSRGGTDRV ADLALACVPC NRAKADQPVE EFVTNPRALA GISAQTKAPL RDAAAVDATR
WALWRSLDEC PPTRVGSGGR TKWNRTRDHL PESHTLDALA VGRVEAVTTA VSTVLVVGAA
GRGPYARTRA DRHGFPRLRL PRRKRFFGFQ TGDLARAAVP SDMNAGTHTG RVAVRSSGSH
TLQTSYGPIK TSWKNLCLLQ RADGYGYTTQ KEAGAPSTA