Gene Ndas_1266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1266 
Symbol 
ID9245116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1570295 
End bp1572190 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content71% 
IMG OID 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003679211 
Protein GI297560237 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.211135 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTGC CCGACCTGAT CCCGGTCGAG GACTTCTTCA GCCCACCGGA GCGCGCCGGT 
GCGACGATCT CGCCGGACGG CACCCGCATC GCCTACCTGG CGCCGTGGAG GGACCGGCTC
AACGTCTGGG TCCAGGGCCT CGACCCCTCC GGAGACCTCG ACGGCGAACC GCGCTGCGTG
ACCGCGGACC AGGCCCGCAG CGTGCAGAGC TACCAGTGGA CCGACGATCC GCGCTGGCTG
CTCTACCTCC AGGACAGCGG CGGCGACGAG AACTGGCACC TGTTCCGGGT CGACCTGGAC
GCCGCCGAGC CCACCGCGGT GGACCTGACA CCCTTCCCCG GCGCCCGGGT CGTCAGCTTC
GAACCGTCCC GGGGTCGGCC GGGGAAGCTG GCCGTCTCGC TCAACGCCCG CGACGCCGCC
GCGTTCGACC TGCACGAACT CGACGTGGCC ACCGGCGAGC TCACCGTGCT GGCCGAGAGC
CCCGGAGCCG CAAAGGTCTG GGCACAGGGC CGGGAGGGCG AACTGATCGC CACCGCGCTC
AACGCCGACA ACGACTTCGA GATCTCCCGC CACGACTCCG AGACGGACAC CACGAGCACC
GTCCTGGTGT ACGAGGGCGC CGACTACCCG CTGGGCGTCT TCCCTGCGCA GGTCACACCG
GACGGCACCG GGATGTGGAT CGGCTCCAGC CGGGGCACCG ACCGCACCCG CCTGGTGCGC
GTGGACCTGT CCACGGGTGA GGAGACCGAG GTCGACAGCC ACGCGACCTT CGACATCGAT
ACGCGCGCGC AGGTCTTCCC CACCTTTCCC TCACCGCTGA TCCGGGACCG GCGGGGCGAG
CTGCTGGGGG TGCGCTACCT GGGTGAGCGC CAGGTCGTCC ACGCCCTGGA CCCGGACTTC
GCCGAGGTGC TGGCCAACCT GGAGAAGCTG TCCGAGGGCG ACCTGGCCGC GGTCTCCTGT
GACGACAGCG GGCAGCGATG GGTCGTGGGC TTCACCCACG ACCGCGCCCC GGGCGCGACC
TGGTTCTACG ACCACTCCAC CGGGGAGTCC CGGCTGCTGT TCCGCCCCCA CCCGCACCTG
GACCCCGACG CCATGGCCCC GATGCGGCCG GTCACCATCA CCGCGCGCGA CGGGCTGGAG
CTGCCCTCCT ACCTGACCCT GCCGGTGGGC GTCGAGCCGA GGAACCTGCC GATGGTGCTG
CTGGTGCACG GCGGCCCGTG GGCGCGGGAC GCCTGGGGCT TCGATCCGAC CGTGCAGCTG
CTGGCCAACC GGGGCTACGC GGTGCTCCAG GTCAACTTCC GCGGCTCCAC CGGGTTCGGC
AAGGCCCACA TGAAGGCCGC GATCGGCGAG TTCGCCGGGA AGATGCACGA CGACCTCATC
GACGCCGTGG ACTGGGCGGT GGAGCGGGGC TACGCCGACC CGGACCGGGT CGCGATCTTC
GGAGGTTCCT ACGGCGGCTA CGCCGCGCTC GTGGGGGTCA CCTTCACCCC CGACCGCTTC
GCCGCCGCCG TCGACTACGT CGGCATCTCC GACCTGGCCA ACTTCATGCG CAACCAACCC
GTCTTCGTGC GGCCCGCGCT GGCCAACAAC TGGTACCGCT ACGTCGGCGA CCCGGACATC
CCCGAACAGG AGGCCGACAT GCTGGCCCGC TCGCCGATCA GCCGGGTGGA CCGGATCACC
GCGCCCTTGT TCGTGGCGCA GGGGGCCAAC GACGCCCGCG TCGTCAAGGC CGAGTCCGAC
AACATCGTCG CCGCTCTGCG CGAGCGCGGC GTGGACGTGG AGTACCTGCT CAAGGAGGAC
GAGGGACACG GGTTCGTCAA CCCGGAGAAC CAGCTCGACC TCCACCGCGC GGCCGAGCGC
TTCCTCGCCC GCCACCTCGA CGAACGCCGG GACTGA
 
Protein sequence
MALPDLIPVE DFFSPPERAG ATISPDGTRI AYLAPWRDRL NVWVQGLDPS GDLDGEPRCV 
TADQARSVQS YQWTDDPRWL LYLQDSGGDE NWHLFRVDLD AAEPTAVDLT PFPGARVVSF
EPSRGRPGKL AVSLNARDAA AFDLHELDVA TGELTVLAES PGAAKVWAQG REGELIATAL
NADNDFEISR HDSETDTTST VLVYEGADYP LGVFPAQVTP DGTGMWIGSS RGTDRTRLVR
VDLSTGEETE VDSHATFDID TRAQVFPTFP SPLIRDRRGE LLGVRYLGER QVVHALDPDF
AEVLANLEKL SEGDLAAVSC DDSGQRWVVG FTHDRAPGAT WFYDHSTGES RLLFRPHPHL
DPDAMAPMRP VTITARDGLE LPSYLTLPVG VEPRNLPMVL LVHGGPWARD AWGFDPTVQL
LANRGYAVLQ VNFRGSTGFG KAHMKAAIGE FAGKMHDDLI DAVDWAVERG YADPDRVAIF
GGSYGGYAAL VGVTFTPDRF AAAVDYVGIS DLANFMRNQP VFVRPALANN WYRYVGDPDI
PEQEADMLAR SPISRVDRIT APLFVAQGAN DARVVKAESD NIVAALRERG VDVEYLLKED
EGHGFVNPEN QLDLHRAAER FLARHLDERR D