Gene Ndas_1669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1669 
Symbol 
ID9245519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2039937 
End bp2041826 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content71% 
IMG OID 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003679604 
Protein GI297560630 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTGC CCGCGAGGAT CTCGGTCGAG GACTTCTTCG GTTCGCCGGA GCGCGCCGGT 
GCGACGATCT CGCCGGACGG CGCGCGGATC GCCTACCTGG CTCCGTGGAG GGACCGGCTC
AACGTCTGGG TCCAGGACCT CACCCCGTCC GGGGACTTCG ACGGCGAACC GCGCTGCGTG
ACCGCCGACG AGGTCCGCAG CGTGCAGAGC TACCAGTGGA CCGAGGACCC GCGCTGGCTG
CTCTACCTCC AGGACAGCGG CGGCGACGAG AACTGGCACC TGTTCCGGGT CGACCTGGAG
GCCCCCGAGC CCACCGCCGT GGACCTGACC CCCTTCCCCG GCGCCCGGGT GGTGGGCTTC
GAGCCCTCCC GGGGAAGGCC GGGGAGGATG ACCGTCCTGC TCAACGCCCG CGACGCCGCC
GAGTTCGACC TGCACGAACT CGACGTGGCC ACCGGCGGGC TCACCATGCT GGCCCAGAGC
CCCGGCGCCA CGGGGACCTG GTTCCAGGGC CGCGAGGGGG AGCTGTTCAC CAGCTCCCTC
AACGCCGACG GCGACTTCGA GGTCTCCCGC CGCGACCCCG AAACGGGCGC CCTGCACCCG
GTCCTGGTCC ACGAGGGCGC CGACTACCCG GTGGGAGTCT TCCCCACCCA GGTCACCCCG
GACGGGACCG GGATGTGGAT CGGCTCCAGC AAGGGCACCG ACCGCACCCG CCTGGTGCGA
GCCGACCTGT CCACGGGTGA GGAGACCGAG GTGGACAGCC ACCCGACCTT CGACATCGAC
ACGCGCGCAC AGGTCTTCCC CACCTTTCCC CCGCCGCTGA TCCGGGACCG GCGGGGCGAG
CTGCTGGGCG TGCGCTACAC GGGTGAGCGC CAGGTCGTCC ACGCCCTGGA TCCGCACTTC
GCCGAGGTGC TGGCGAACCT GGAGAAGCTG TCCGAGGGCG ACCTGGCAGC GATCTCCTGC
GACGACGGCG GGCGGCGGTG GGTCGTGGGC TTCACCCACG ACCGGGACCC GGGTGTGACC
TGGTTCTACG ACCACTCCAC CGGGGAGTCC CGGCTGCTGT TCCGCGCCCA CCCGCACCTG
GACCCCGACG CGATGGCCCC CATGCGGCCG GTCACCATCA CCGCGCGCGA CGGGCTGGAA
CTGCCCTCGT ACCTGACCCT GCCGGTGGGT GTCGAACCCG AGAACCTGCC GATGGTGCTG
ATGGTGCACG GCGGTCCCTG GGCCCGCGAC AACTGGGGGT TCAACGGTTC CGCGCAGCTG
TGGGCCAACC GGGGCTACGC GGTGCTCCAG GTCAACTTCC GCGGCTCCAG CGGGTTCGGC
AAGGCCCACA TGAAGGCGGC GATCGGCGAG TTCGCCGGGA AGATGCACGA CGACCTCATC
GACGCCGTGG ACTGGGCTGT GGAGCAGGGC TACGCCGACC CGGACCGGGT GGCGATCCTG
GGCGGCTCCT ACGGCGGCTA CGCCGCGCTG GTCGGGGCGG CCTTCACCCC CGACCGCTTC
GCCGCCGCCG TCGACGTCGT CGGCATCTCC GACCTGGCCA ACTTCATGCG GACCCAGCCC
GCGTTCGTGC GACCCGCGCT GGTCAACAAC TGGTACCGCT ACGTGGGCGA CCCGGCCGTC
CCCGAGCAGG AGGCCGACAT GCTGGCCCGC TCACCGATCA GCAGGGTGGA CCGGATCGCC
GCGCCGCTGA TGGTCGTCCA GGGGGCCAAC GACGCCCGCG TGGTCAAGGC CGAGTCCGAC
AACATCGTCG CGTCGGTGCG CGGGCGCGGC GTGGACGTGG AGTACCTGGT CTTCGACGAC
GAGGGGCACG CCATCGTCAA CCCGGAGAAC CTGATCACCA TGTTCGGCGC CATCGACCGC
TTCCTCGCCC GCCACCTCGG CGGACGGTGA
 
Protein sequence
MALPARISVE DFFGSPERAG ATISPDGARI AYLAPWRDRL NVWVQDLTPS GDFDGEPRCV 
TADEVRSVQS YQWTEDPRWL LYLQDSGGDE NWHLFRVDLE APEPTAVDLT PFPGARVVGF
EPSRGRPGRM TVLLNARDAA EFDLHELDVA TGGLTMLAQS PGATGTWFQG REGELFTSSL
NADGDFEVSR RDPETGALHP VLVHEGADYP VGVFPTQVTP DGTGMWIGSS KGTDRTRLVR
ADLSTGEETE VDSHPTFDID TRAQVFPTFP PPLIRDRRGE LLGVRYTGER QVVHALDPHF
AEVLANLEKL SEGDLAAISC DDGGRRWVVG FTHDRDPGVT WFYDHSTGES RLLFRAHPHL
DPDAMAPMRP VTITARDGLE LPSYLTLPVG VEPENLPMVL MVHGGPWARD NWGFNGSAQL
WANRGYAVLQ VNFRGSSGFG KAHMKAAIGE FAGKMHDDLI DAVDWAVEQG YADPDRVAIL
GGSYGGYAAL VGAAFTPDRF AAAVDVVGIS DLANFMRTQP AFVRPALVNN WYRYVGDPAV
PEQEADMLAR SPISRVDRIA APLMVVQGAN DARVVKAESD NIVASVRGRG VDVEYLVFDD
EGHAIVNPEN LITMFGAIDR FLARHLGGR