Gene Ndas_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1054 
Symbol 
ID9244900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1299113 
End bp1301080 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content72% 
IMG OID 
ProductPeptidyl-dipeptidase Dcp 
Protein accessionYP_003679002 
Protein GI297560028 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.391003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.428625 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCGACA ACCCGTTCCT GTCCCCGAGC GAGCTTCCCT ACCGGCTCCC CGACTTCGCG 
GCCATCCGCG AGGAGCACTT CCTGCCCGCG TTCGACAAGG GCGTGGCCGA GCACCTCGCC
GAGGTGGACG CGATCGTCCG CGACCCGCGG CCCCCGACCT TCGACAACAC CATCGCGGCA
CTGGAGCGCT CCGGCGCGCT CCTGGCCAGG GTGGAGACCG TCCTGCACAC CCTGGCCGGT
TCCGACGCCA CCGACGGCAT CGAGGAGATC GAGCGGGAGA TCGCCCCGAG GGCCGCACAG
CACCGGGACG CCATCTCGCT GAACCGGGAC CTGTGGGAGC GCGTGCGGCA GGTCACCGCC
TCCGACCCGC AGGAGGCCTG GCTGCTGGAG CGGTACCGCC TCGACTTCGT CAAGGCGGGC
GCCGACCTGG ACGACGACCA GCAGGCCCGG CTGCGCGAGC TCAACACCGA ACTCGCCGGA
CTGAGCACCG AGTTCTCCCG CAACGTGGTC CGGGCCACCC GCGAGGCCTC CCTCGTCACC
GGCGACGTCT CCGACCTCGA CGGCCTGGAC GAGGCGCACA TCAGCGCGAT CGAGCGGGAC
GGGGAGTACG TCCTGCCCCT GCTCAACACC ACCGTGCAGC CCGCGCTGGC CCAGCTCACC
AACCGCGCCA CCCGTGAGCG GCTCTACACC CTGAGCGCCG AGCGCGCCCC CGAGAACCTC
GACATCGCCG CGCGCATGGC CGTCCTGCGC GCCGAGCGGG CCGCGCTGCT CGGCTACCCC
GACCACGCGG CCTACACGGT CGCGGACCAG ACCGCCAAGA CCGTCGACGC GGTCGAGGAG
CGGCTCGGCC AGCTCGTCGG ACCCGCCCGG CGCAACGTCG AGAAGGAGGC CCGGGCCCTG
GCCGAGCACG CCGGGCACGA CATCGAGCCC TGGGACTGGC CCTTCTACGC CGAGCAGGTG
CGCAGGGAGC GCTACGACTT CGACGACAGC GTCCTGCGCC CCTACTTCGA ACTCGGCAGG
GTGGTCCGGG ACGGCGTCTT CCACGCCGCG ACGCTGCTGT ACGGGATCAC CTTCGCCGAG
CGGCCCGACC TGCGCGGCTA CCACGAGGAC GTGCGGGTGT GGGAGGTGTT CGACCGGGAC
GGCTCGCCCC TGGGGCTGTT CCTGCTGGAC CCCTACGCCA GACCGACCAA GCGCGGCGGC
GCGTGGATGC ACAACCTGGT CGACCAGTCC TTCCTGCTGG ACGAGCGGCC GGTGGTGGTG
AACAACCTCA ACATCACCAA GCCTGCCTCG GGCCCCACCC TGCTCACCTT CGACGAGGTC
GAGACGGCCT TCCACGAGTT CGGCCACGCC CTGCACGGGC TGCTGTCGGC CGTGCGGTTC
CCGCGCGTGC AGGGCACGAG CGTGCCGCGC GACTTCGTGG AGTTCCCCTC CCAGGTGAAC
GAGATGTGGG CGACCTGGCC GGAGGTCCTG TCCCACTACG CCCGCCACCA CGAGACCGGT
GAGCCGGTGC CCGCCGAACT CGTGGAGCGC CTGACGGCCG CCCGCCAGTT CAACCAGGGC
TTCGCGACCT TCGAGTACCT GGCCGCGGCG CTCCTGGACT GGTCGTGGCA CCGCCTGGCC
CCGGGCGAGG CCGTGGAGGA CCCGGCCTCC TTCGAGGCGC GCGCCCTGGA GGCGGCGGGG
GCCCTGCACC CCCTGGTCCG TCCGCGCTAC CGGTCGGCGT ACTTCATGCA CGTGTTCGCC
AACGGCTACA GCGCGGGCTA CTACTCCTAC GTGTGGAGCG AGGTCCTGGA CGCCGAGAGC
GTGGAGTGGT TCACCGAGAA CGGCGGCCTC ACCCGGGAGG GCGGGGACCG CTTCCGGGAG
AGGGTGCTGT CCGTGGGCGG CGGTGTGGAC CCCATGGAGG CGGTCCGCGA CTTCCTGGGC
CGTGAGCCCC GGATGGAGCC CCTGCTGGTC CGCCGCGGGC TGGTCTGA
 
Protein sequence
MTDNPFLSPS ELPYRLPDFA AIREEHFLPA FDKGVAEHLA EVDAIVRDPR PPTFDNTIAA 
LERSGALLAR VETVLHTLAG SDATDGIEEI EREIAPRAAQ HRDAISLNRD LWERVRQVTA
SDPQEAWLLE RYRLDFVKAG ADLDDDQQAR LRELNTELAG LSTEFSRNVV RATREASLVT
GDVSDLDGLD EAHISAIERD GEYVLPLLNT TVQPALAQLT NRATRERLYT LSAERAPENL
DIAARMAVLR AERAALLGYP DHAAYTVADQ TAKTVDAVEE RLGQLVGPAR RNVEKEARAL
AEHAGHDIEP WDWPFYAEQV RRERYDFDDS VLRPYFELGR VVRDGVFHAA TLLYGITFAE
RPDLRGYHED VRVWEVFDRD GSPLGLFLLD PYARPTKRGG AWMHNLVDQS FLLDERPVVV
NNLNITKPAS GPTLLTFDEV ETAFHEFGHA LHGLLSAVRF PRVQGTSVPR DFVEFPSQVN
EMWATWPEVL SHYARHHETG EPVPAELVER LTAARQFNQG FATFEYLAAA LLDWSWHRLA
PGEAVEDPAS FEARALEAAG ALHPLVRPRY RSAYFMHVFA NGYSAGYYSY VWSEVLDAES
VEWFTENGGL TREGGDRFRE RVLSVGGGVD PMEAVRDFLG REPRMEPLLV RRGLV