Gene Ndas_0012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0012 
Symbol 
ID9243839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp15797 
End bp18352 
Gene Length2556 bp 
Protein Length851 aa 
Translation table11 
GC content76% 
IMG OID 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003677971 
Protein GI297558997 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.037328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0184635 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCCGA GGTGTGTCGC GGACGTCCGC GTCGCCAGGG CCGGGGCCGC CTCCAGGGGA 
GGGACCGGCT ACCTCGTCGG GCCCTCCCTC GTCCTCACCG CGGCCCATGT GCTCGACGGC
CACGAGCGTG TCGACGTGCG CCTGGGAGCC GACCGGCCGA GCGGTTTCCT GCGGTGCTCC
GTCGTCTGGT CGGACCGGAG CACGGACGTC GCCCTGCTGC GGCTGGAGGA GCCGCGGCCC
CACCCCTCCG TGCGCTGGGG GAAGCTCTCG GCGGACGAAC CGCAGCCGTA CCGTTCGCTC
GGCTACCCCG ACCTCGCGGC CAACGACGGG GGCCGCGACC TGAAGGACCT GCGGGGGACC
CTCAGCCCGT GGGACGGCAT GGTCAGGAAG GACCTCCGGC TCGACGTCCA GCGGGAGTAC
GGCAACGACG TGTCGTGGAA GGGGGCCTCC GGCACCGCCG TGTTCGTCAG GGAACGGCTG
GTGGGCGTCA TCGTCGACCA CGACCGGCGC AACGGGGACC TGATCGCCCG CCGGTCCACG
CGGTTCGCCC GCGATAAGGA CTTCCGGCGC CTGGTCAGGG AGGACGCGGG GACGGAGCCG
TGGCTCGTCC CGGTGGACCT CCCGCGCGCG GCGCGCAACA CCGCCAGGCG TTTCGTCCGC
GCCCGGCTCT CCTCCCTGCG CAGGCTTCCG CACACGCTCG CGGTGGCCGT CCGGGAACGG
CGCGGTCACG GCACGCGGGC GGGGACGGCG GCCGTCCCCG CCCGGAGGAG CCGTGCCACG
GCCGGGATGC GGGAGCGCCG TGGTGCCGGG CGACCGGTGT GGCCCCTGGC CGTCGCCGCC
GTCGCGCTGC TCACCCTGGT CCCGAACGGC GTGTACTGGC TGCGGGCCAA CGGCCACCTG
GGGCTGGGCC TCCAGTGCGC CCCGCCCACC GAGCTCACCG TGCTCACCAC CCCGGACCAG
CACGCCGTCG TCCAGCGGGC CGCCGACGAC TTCTCCGCCC ACGTCGGCGG CCGGGACGAG
AGGAGCGGGT GCGTACCCGT GCGCGTCAGC GTCACCGTCG CGGGCGGGGC CGCGCGGACG
CGCGAGCTGC TGCGCGAGGG GTGGACCGAC CTGCGGGCGG GGCCCCGCCC GCACGTGTGG
CTGCCGGACT CCAGCGCGGA CGTCGCCCTG CTGCGCGGGG ATCCGGGTGA CGCGCCCGGC
CTGGAGACGG GGGAGGGGAG CGCCACCAGG CTCACACCCG TGGTGCTGGG GCTGCCCGAG
TCCGCGGGCG AGGTCCCCGC CTGTTCCGGG ACCGACCCGG GCGGAGCGCG TTCGAACCTC
GTCGTCTGCA CGGCCGCGGC GGCGGAGGCC GGGCTCCTCC TGGCCCGCCC CTCACCGGAG
GCCTCGGTCT CGGCGCTGAT CCAGACCGAG GCCCTGTACT CCGCGTACGG GGACGGGGGC
GCCGAGGAGA TCCAGATCGC GGAGGCCCGC GCGACCTCGG CCGGGCTCGA CGCCGAGGGC
AACCTCGGCC TGCTGTGCGC GCTGCGCGGC GGGAGCGCCG ACCCCGCCTC CGTCGGGGTG
TTCAGCACCG AGTACGCGAT CAACGCCTAC AACACGGGCG CGCAGCTGGG CCCCGGGTGC
GGGCCGACGG GCGAGGAACC GGACGAGCCC CTGGTCCCCG TCTACTTCTC CGAGACGCCC
GGCCTGGACC ACCCGTTCGT ACGGCTGGAC TGGGGCGGCG ACGGGGTCGA ACGGGAGGCG
GAGGCGTTCG GGGAGTGGAT GCGGCACCCC CGGGAGACAA CGGCCTTCCA GGGCTACCGG
ACGGTGGAGG GCGCCATGAT CGGCGGGGAT GCGGAGCGGT CCCTGATGGA GGTCACACAC
CTCAACTCGG CCCGTCAGGA CGGGGAGTGG CGGGAGCGGC TGGAGTCCGG GCTGCGGCAG
CAGGAGCGCT CCCGCGCCCC GGTCGAGGTC CTGCTCGTGG TCGACCGCTC CGACTCCATG
AGCGGCCCCG GCACGGGCGG CACCCGGCTG GCGACCGCGC AACAGCTGGC CCGGACCGCC
GTCGGCCTCC TGGGGGAGCA GGACACCGTG GGGATCTGGA CCTTTCCCGA GGGCGGACCG
GGCAACGACG TCACCGGCCA GGAACGCGTC CTGCCCCCGG AACCGGGCAT GGACGCGCGC
AGGGAGGAGT CGGCGCGCGA GGCGATCGAC GCGCTGAGCG CCGAGTTCCC GGCCACCCCG
CTGGCCGACG CCGTCCGCGA CGGCGCCGAG GAACTGGAGT CCTGCGCCGA CCGTCCGGGG
GAGCCCGGCG CCTGCGCCCT GGTCGTCCTC ACCGACGGCG TCGCCCTCCC CGAGCCGCGC
GGCGGCGCGC GGGCGGACGA CGTGGCCGCC GTGCTGGAGG ACCTCGACGA GCGCGTGCGG
GTGCACGTGG TGTCCGTGGG CGACGAGGGG TGCGGCGGGG ACGGGCTGCT CAGCCGGCTG
GCGGGCGCGG GGGCCGAGTG CCACCACCCG CGCGCCGACG AACTCGAACA GGTCGTCTAC
GGCATCGTGG CCGGGACCAG GGCGGCGACG CCGTGA
 
Protein sequence
MDPRCVADVR VARAGAASRG GTGYLVGPSL VLTAAHVLDG HERVDVRLGA DRPSGFLRCS 
VVWSDRSTDV ALLRLEEPRP HPSVRWGKLS ADEPQPYRSL GYPDLAANDG GRDLKDLRGT
LSPWDGMVRK DLRLDVQREY GNDVSWKGAS GTAVFVRERL VGVIVDHDRR NGDLIARRST
RFARDKDFRR LVREDAGTEP WLVPVDLPRA ARNTARRFVR ARLSSLRRLP HTLAVAVRER
RGHGTRAGTA AVPARRSRAT AGMRERRGAG RPVWPLAVAA VALLTLVPNG VYWLRANGHL
GLGLQCAPPT ELTVLTTPDQ HAVVQRAADD FSAHVGGRDE RSGCVPVRVS VTVAGGAART
RELLREGWTD LRAGPRPHVW LPDSSADVAL LRGDPGDAPG LETGEGSATR LTPVVLGLPE
SAGEVPACSG TDPGGARSNL VVCTAAAAEA GLLLARPSPE ASVSALIQTE ALYSAYGDGG
AEEIQIAEAR ATSAGLDAEG NLGLLCALRG GSADPASVGV FSTEYAINAY NTGAQLGPGC
GPTGEEPDEP LVPVYFSETP GLDHPFVRLD WGGDGVEREA EAFGEWMRHP RETTAFQGYR
TVEGAMIGGD AERSLMEVTH LNSARQDGEW RERLESGLRQ QERSRAPVEV LLVVDRSDSM
SGPGTGGTRL ATAQQLARTA VGLLGEQDTV GIWTFPEGGP GNDVTGQERV LPPEPGMDAR
REESAREAID ALSAEFPATP LADAVRDGAE ELESCADRPG EPGACALVVL TDGVALPEPR
GGARADDVAA VLEDLDERVR VHVVSVGDEG CGGDGLLSRL AGAGAECHHP RADELEQVVY
GIVAGTRAAT P