Gene Ndas_0217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0217 
Symbol 
ID9244051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp271508 
End bp274159 
Gene Length2652 bp 
Protein Length883 aa 
Translation table11 
GC content72% 
IMG OID 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_003678173 
Protein GI297559199 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.161537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTACCCC ACCGCACGCG CCTGGCGTCG CTCGCCGGGT TGACACTCGT CGCGGGCCTG 
GTCGCGGCCG CACCGGCCAG CGCCGACGAC GCCTCCGAGC TCGCCCCCCT GCACCCCGCC
CCCTCCGCCG AGAACGGCGA ACTGACCCAC CAGGCGGACG GCCTGTGGTT CATCGAACTG
GAGAGCCCCC CGACCACCGC CGGCACCTCC TCGGCACAGG TCGAGAACGA GCACGAGGCG
TTCCGCGCCG AGGCCGAGGA GTCGGGCCTG GAGTACACCG AACGGCACTC GTTCGGCGAC
CTCTGGAACG GCTTCTCCGT GGAGATGGAC GACGCGCAGG TGGGCACGGC ACGGGAGATC
CCCGGTGTCA GCGCCATCTA CCCCGTCGTC ACCTACGAGG TCCCCGAGGG CGACACCAGC
GTCCCCGACC TGGACACCGC GCTGCCGATG ACCGGGTCCG ACATCGCCCA GAACGAACTC
GGCCTCACCG GCGAGGGACT GCGCGTGGCA GTCATGGACA CCGGCGTCGA CTACACCCAC
CCCGACCTGG GCGGCGGCGG CTTCCCGAAC GACAGGGTGG TCACCGGGTA CGACTTCGTC
GGCGACGACT TCAACGCGGG TGACCCCACC ACCGTGCCCG CCCCCGACGA CGACCCCCAG
GACTGCCACG GCCACGGCAC CCACGTGGCC GGGATCGTCG GCGCCCGGGG CGAGGTCACC
GGTGTGGCGC CCGGCGTGGA CTTCGGCGCC TACAAGGTGT TCGGCTGCGA GGGCTCCACC
ACCTCCGACA TCATGATCGC CGCCATGGAG CGGGCCCTGG CCGACGACAT GGACGTGCTC
AACATGAGCA TCGGCTCGGC GCACTCCTGG CCGCAGTACC CCACCGCGGT CGCCTCCGAC
AACCTGGTCG ACGAGGGCAT GGTCGTGGTC GCCTCCATCG GCAACGAGGG CGACACCGGC
CTGTACTCCG CGGGCGCCCC CGGCCTGGGC GAGGACGTGA TCGGCGTCGC CTCCTACGAC
AACACCCACA TCCGGTCCGC GTCGGCGACC GCCAACCCCA GCGGCGAGAC CCTGGCCTAC
ATGGAGATGG GCGAGGCCTC CCCGCCGCCC GCCTCGGGCG AGACCGACGA ACTCGTCCAC
GTCGGACGCG GCTGCCCGTC CCTGGGCGAC GAACTGGAGG CCGACCCCGA GGGCAGAACA
GCCCTGATGG TGCGCGGCGC GTGCACCTTC GCCGAGAAGT ACGACGCGGC CGTGGCCGCG
GGCGCCACCG GCGTGGTGAT GTACAACAAC GTCCCCGGCA TGTTCGCGGG CGGCGGCATC
GTCGACCAGG GCGCGTTCTC CATCGGGATC TCCGACACCT CCGGCGCCCA CCTGCTGGAA
CTCCTGGAGG GCGACGAGCC CGTCACCCTG TCCTGGACCG GTGAGAGCAC CACCATCCCC
AACCCGACCG GCGGGCTGAT CAGCTCCTTC AGCTCCTTCG GCCTGTCCCC CGACCTGGCC
CTGAAGCCGG ACATCGGCGC GCCCGGCGGC CTGATCAACT CCACCTACCC CATGGCCAAG
GGCGGCTACG CCACCATCAG CGGCACCTCG ATGTCCTCGC CGCACGTGGC GGGCGGCGTC
GCGCTGCTGC TGGAGGCCCG CCCCGACCTC GGCGCGCACG AGGTGCGCGA CGTCCTCCAG
AACAGCGCCG ACCCCAAGGC CTGGTGGGGT GACCCCGACG CGGGCTACAC CGACAACGTC
CACCGCCAGG GCGCGGGCAT GATGGACGTG CCCGGCGCCG TGCTGGCCAC CACCGCCGTG
ACCCCGGGCA AGCTGTCCCT GGGCGCCACC GAGGGCGAGG TGACCGAGAC GATCACCATC
GCCAACGACG GTGACGAGGA GGCCACCTAC ACCCTGGACC ACGAAAGCGC TCTGGGCACC
CACGGCAACA CCTTCACCCC CGGTTACAAC GACGCCTCCG CCGAGGTGGC CTTCGACCGG
GACGAGGTGA CCGTCGCGCC GGGCGGCACG GCCGAGGTGC AGGTGACCTT CACCCCGCCC
GCCCAGGACT TCCAGCAGAT GATCTACGGC GGCTACGTCT CCGTGGCCGA GACCGGCGGC
GAGACCTACC GGGTCCCCTA CGCCGCCTAC AACGGCGACT ACCAGCAGAT CGAGGCCATG
ACCCCGATCA CCGACGGCAG CGGCAACGTG CTCGAACTGC CGTGGCTGAC CAGGATCACC
GAGTGCGGCG CGTTCTCCGG CCTGGAGTGC GTGGGCGAGG GCGGCGGCAC GTTCGAGAAC
CAGCCCGAGG GCGCCGCCTA CACCCTGGAG TGGGTCGACG GCCTGCCCGA CGTCCCGTAC
GTCATCGCGC ACTTCGACCA CCACGTCACG CTGCTGGAGA TGACCGTCGT CGACGAGCGC
ACCGGGCGCC CCGTGCACCC GGACCGCAAC GTCGGGGTGT CGGTGGACCA CGTGAACCGC
AGCGCGACCG GGACGTCGTT CTTCAGCTAC GCGTGGGACG GCACGGTCCT GGACCGCCAC
GACCGGATCA CCCCGGTCCG GGACGGCCAG TACCGCCTGG AGGCCCGCGC GCTCAAGGCG
CTCGGCGACC CGGACAACCC CGACCACTGG GAGACCTGGA CCTCCCCGGT CATCACCATC
GACCGCGGCT AG
 
Protein sequence
MVPHRTRLAS LAGLTLVAGL VAAAPASADD ASELAPLHPA PSAENGELTH QADGLWFIEL 
ESPPTTAGTS SAQVENEHEA FRAEAEESGL EYTERHSFGD LWNGFSVEMD DAQVGTAREI
PGVSAIYPVV TYEVPEGDTS VPDLDTALPM TGSDIAQNEL GLTGEGLRVA VMDTGVDYTH
PDLGGGGFPN DRVVTGYDFV GDDFNAGDPT TVPAPDDDPQ DCHGHGTHVA GIVGARGEVT
GVAPGVDFGA YKVFGCEGST TSDIMIAAME RALADDMDVL NMSIGSAHSW PQYPTAVASD
NLVDEGMVVV ASIGNEGDTG LYSAGAPGLG EDVIGVASYD NTHIRSASAT ANPSGETLAY
MEMGEASPPP ASGETDELVH VGRGCPSLGD ELEADPEGRT ALMVRGACTF AEKYDAAVAA
GATGVVMYNN VPGMFAGGGI VDQGAFSIGI SDTSGAHLLE LLEGDEPVTL SWTGESTTIP
NPTGGLISSF SSFGLSPDLA LKPDIGAPGG LINSTYPMAK GGYATISGTS MSSPHVAGGV
ALLLEARPDL GAHEVRDVLQ NSADPKAWWG DPDAGYTDNV HRQGAGMMDV PGAVLATTAV
TPGKLSLGAT EGEVTETITI ANDGDEEATY TLDHESALGT HGNTFTPGYN DASAEVAFDR
DEVTVAPGGT AEVQVTFTPP AQDFQQMIYG GYVSVAETGG ETYRVPYAAY NGDYQQIEAM
TPITDGSGNV LELPWLTRIT ECGAFSGLEC VGEGGGTFEN QPEGAAYTLE WVDGLPDVPY
VIAHFDHHVT LLEMTVVDER TGRPVHPDRN VGVSVDHVNR SATGTSFFSY AWDGTVLDRH
DRITPVRDGQ YRLEARALKA LGDPDNPDHW ETWTSPVITI DRG