Gene Ndas_3909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3909 
Symbol 
ID9247780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4681079 
End bp4683277 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content75% 
IMG OID 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003681812 
Protein GI297562838 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTTC CTCGCCAACA GGCCCGCACG CAGCGCTTCT CGATCGGTGT CCCCCGCGCG 
TTCCAGATCT CACCGGACGG ACGGCGCGTC GCCTTCCTCC GGGGGCGCGA CGGGGTGGAC
AAGGCCACCT GCCTGTGGGT GCACGACACG GAAGGGGCGG GAACGGACAC GGTGGTCGCC
GACCCCCGCT CCCTGGGCGC CGACGACGAG AACCTTCCGC CCGAGGAGCG GGCCCGCCGC
GAGCGGCTGC GCGAGAGCGG CGGCGGGATC GTGTCGTACT CGGTGGACGA GGCGTTCACC
CGCGCGGTGT TCACCCTGTC CGGACGGCTG TTCTACGTCG ACCTGGTCGG CGACGACACC
GCTCCGCGCG AACTGCCCGC CGCCACCCCC GTCGTGGACC CGCGCATCAG CCCGGCGGGT
GACCGGGTCG CCTACGTCAG CGGCGGCGCG GTGCGCGTCC TGGACATCGC CGCCGCCGAA
CCGGACCACG GCGACCGCCC GGTGGTCGAA CCGGACGGCC CCGACGTCAC GTGGGGCCTG
GCCGAGCTGG TGGCCGCCGA GGAGATGGGG CGCTACCGGG GCTTCTGGTG GGCTCCGGAC
GGCTCCGCGC TCGCCGTCGC GCGCGTGGAC GAGTCCGGGG TGAACACCTG GTACGTCTCC
GACCCCGGCA ACCCCGCCCA GGAGCCGACG GCCCTGCGCT ACCCGCCCGC GGGCGGCGCC
AACGCCGACG TGCGCCTGGC CGTGTTCCGG GTCGGCCCGC GCGGGGACGG GCGGCCCGAA
CCGGTCTGGG TCGAGTGGGA CCGCGAGGCC CTGCCCTATC TGGCCACGGT CGGGTGGACG
ACCGGGCCGG ACGGCACACC GACCGTGGTG TTCACCGCGC AGAGCCGCGA CCAGCGCACG
CTGACCCTGT TCAGCGCCGA TCCCGCGACC GGCCTGGTGG TGGAGTCGCG CACCGAGTCC
GACGGCGTGT GGGTCGAGCT GATGCCGGGC GTGCCCGCCT TCACCGGTGC GGGCGACCTG
GTGTGGATCG GCCGCGAGGC CGGGGGCGAG CGCCGGGTCT ACGTCGGCGA CGCCCCGGTC
AGCCCCCCGG ACGTGTACGT GCGCGGCGTG GTGGACGTGG ACGGCGACCG GCTCCTGTAC
TCGGGGTCGC CCGCCGGGAG CCCCGGGGAC GTGTCGCTGT GGCTGGTCGA GCTGGGCACG
GGCCTGGCCG CGCCGGTGGA GGTGCCCGGG CACGGGTGCA GCAGGTCCGC GGACTCGGGC
GCGCACAGCG GTCTGCGCTC GGGGCGGCTG CGCGGTGACA CGCTGGTGGT GCAGCACCGG
TCGATGGACT TCCCGGGCGC GCACACGGTG GTGCTGCGCG GCGCCGGTAC CGAGACGCGC
CGGTCCTGCT CGGAGATCGA GAGCCTGGCC GAGGCCCCGG ACCTGCCGGA GCCGCGCGTG
GAGTTCTGGC GCGCCGGTGA GCGCCGTATC CCGAGCGCCC TGGTGCTGCC GTCCTGGTAC
CGGGAGGGGC TGCGTCCGCT GCCGGTGCTG ATGGCGCCCT ACGGCGGCCC GCACGCCCAG
CGGGTGCTCA ACGCGCGCGG GGCGTACCTG ACCGCCCAGT GGTACGCCGA ACAGGGGTTC
GCGGTGCTGA TCGCGGACGG CCGGGGCACC CCGGGCCTCG GGGTGGAGTG GGAGCAGAGC
GTCCACCTCG ACCTGGCCGC GCCGGTCCTG GAGGACCAGG TGGCGGCGCT GGAGGACGCG
GCGGAGCGGT TCGACTTCCT GGACGTGTCG CGGGTGGGCA TCCACGGCTG GTCGTTCGGC
GGCTACCTGG CGGCGCTGGC GGTGCTGCGC CGCCCGGACG TGTTCCACGC GGCGGTGGCG
GGCGCGCCGG TCATCGACTG GGAGCTGTAC GACACCCACT ACACCGAGCG CTACCTGGGC
ACCCCCGGGG ACGAGCCGGA GGCCTACGGG CGCAGCTCGC TCCTGGCGGA GGCGGCCAAG
CTGGAGCGCC CGCTGATGAT GATCCACGGA CTGGCGGACG ACAACGTGGC CTTCGCGCAC
ACGCAGCGGA TGTCGTCGGC GCTGATGGCG GCGGGGCGCC CGCACACGGT GCTGCCGCTG
TCGGGGGTGA CGCACTCGCC CTCGGACCCG ACGGTCGCGG AGAACCTGAT GCTGCTCCAG
GTGGAGTTCC TCAAGGAGAA CCTGCGCGGC GAGGGGTAG
 
Protein sequence
MSFPRQQART QRFSIGVPRA FQISPDGRRV AFLRGRDGVD KATCLWVHDT EGAGTDTVVA 
DPRSLGADDE NLPPEERARR ERLRESGGGI VSYSVDEAFT RAVFTLSGRL FYVDLVGDDT
APRELPAATP VVDPRISPAG DRVAYVSGGA VRVLDIAAAE PDHGDRPVVE PDGPDVTWGL
AELVAAEEMG RYRGFWWAPD GSALAVARVD ESGVNTWYVS DPGNPAQEPT ALRYPPAGGA
NADVRLAVFR VGPRGDGRPE PVWVEWDREA LPYLATVGWT TGPDGTPTVV FTAQSRDQRT
LTLFSADPAT GLVVESRTES DGVWVELMPG VPAFTGAGDL VWIGREAGGE RRVYVGDAPV
SPPDVYVRGV VDVDGDRLLY SGSPAGSPGD VSLWLVELGT GLAAPVEVPG HGCSRSADSG
AHSGLRSGRL RGDTLVVQHR SMDFPGAHTV VLRGAGTETR RSCSEIESLA EAPDLPEPRV
EFWRAGERRI PSALVLPSWY REGLRPLPVL MAPYGGPHAQ RVLNARGAYL TAQWYAEQGF
AVLIADGRGT PGLGVEWEQS VHLDLAAPVL EDQVAALEDA AERFDFLDVS RVGIHGWSFG
GYLAALAVLR RPDVFHAAVA GAPVIDWELY DTHYTERYLG TPGDEPEAYG RSSLLAEAAK
LERPLMMIHG LADDNVAFAH TQRMSSALMA AGRPHTVLPL SGVTHSPSDP TVAENLMLLQ
VEFLKENLRG EG