Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3909 |
Symbol | |
ID | 9247780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4681079 |
End bp | 4683277 |
Gene Length | 2199 bp |
Protein Length | 732 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | peptidase S9 prolyl oligopeptidase active site domain protein |
Protein accession | YP_003681812 |
Protein GI | 297562838 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTTC CTCGCCAACA GGCCCGCACG CAGCGCTTCT CGATCGGTGT CCCCCGCGCG TTCCAGATCT CACCGGACGG ACGGCGCGTC GCCTTCCTCC GGGGGCGCGA CGGGGTGGAC AAGGCCACCT GCCTGTGGGT GCACGACACG GAAGGGGCGG GAACGGACAC GGTGGTCGCC GACCCCCGCT CCCTGGGCGC CGACGACGAG AACCTTCCGC CCGAGGAGCG GGCCCGCCGC GAGCGGCTGC GCGAGAGCGG CGGCGGGATC GTGTCGTACT CGGTGGACGA GGCGTTCACC CGCGCGGTGT TCACCCTGTC CGGACGGCTG TTCTACGTCG ACCTGGTCGG CGACGACACC GCTCCGCGCG AACTGCCCGC CGCCACCCCC GTCGTGGACC CGCGCATCAG CCCGGCGGGT GACCGGGTCG CCTACGTCAG CGGCGGCGCG GTGCGCGTCC TGGACATCGC CGCCGCCGAA CCGGACCACG GCGACCGCCC GGTGGTCGAA CCGGACGGCC CCGACGTCAC GTGGGGCCTG GCCGAGCTGG TGGCCGCCGA GGAGATGGGG CGCTACCGGG GCTTCTGGTG GGCTCCGGAC GGCTCCGCGC TCGCCGTCGC GCGCGTGGAC GAGTCCGGGG TGAACACCTG GTACGTCTCC GACCCCGGCA ACCCCGCCCA GGAGCCGACG GCCCTGCGCT ACCCGCCCGC GGGCGGCGCC AACGCCGACG TGCGCCTGGC CGTGTTCCGG GTCGGCCCGC GCGGGGACGG GCGGCCCGAA CCGGTCTGGG TCGAGTGGGA CCGCGAGGCC CTGCCCTATC TGGCCACGGT CGGGTGGACG ACCGGGCCGG ACGGCACACC GACCGTGGTG TTCACCGCGC AGAGCCGCGA CCAGCGCACG CTGACCCTGT TCAGCGCCGA TCCCGCGACC GGCCTGGTGG TGGAGTCGCG CACCGAGTCC GACGGCGTGT GGGTCGAGCT GATGCCGGGC GTGCCCGCCT TCACCGGTGC GGGCGACCTG GTGTGGATCG GCCGCGAGGC CGGGGGCGAG CGCCGGGTCT ACGTCGGCGA CGCCCCGGTC AGCCCCCCGG ACGTGTACGT GCGCGGCGTG GTGGACGTGG ACGGCGACCG GCTCCTGTAC TCGGGGTCGC CCGCCGGGAG CCCCGGGGAC GTGTCGCTGT GGCTGGTCGA GCTGGGCACG GGCCTGGCCG CGCCGGTGGA GGTGCCCGGG CACGGGTGCA GCAGGTCCGC GGACTCGGGC GCGCACAGCG GTCTGCGCTC GGGGCGGCTG CGCGGTGACA CGCTGGTGGT GCAGCACCGG TCGATGGACT TCCCGGGCGC GCACACGGTG GTGCTGCGCG GCGCCGGTAC CGAGACGCGC CGGTCCTGCT CGGAGATCGA GAGCCTGGCC GAGGCCCCGG ACCTGCCGGA GCCGCGCGTG GAGTTCTGGC GCGCCGGTGA GCGCCGTATC CCGAGCGCCC TGGTGCTGCC GTCCTGGTAC CGGGAGGGGC TGCGTCCGCT GCCGGTGCTG ATGGCGCCCT ACGGCGGCCC GCACGCCCAG CGGGTGCTCA ACGCGCGCGG GGCGTACCTG ACCGCCCAGT GGTACGCCGA ACAGGGGTTC GCGGTGCTGA TCGCGGACGG CCGGGGCACC CCGGGCCTCG GGGTGGAGTG GGAGCAGAGC GTCCACCTCG ACCTGGCCGC GCCGGTCCTG GAGGACCAGG TGGCGGCGCT GGAGGACGCG GCGGAGCGGT TCGACTTCCT GGACGTGTCG CGGGTGGGCA TCCACGGCTG GTCGTTCGGC GGCTACCTGG CGGCGCTGGC GGTGCTGCGC CGCCCGGACG TGTTCCACGC GGCGGTGGCG GGCGCGCCGG TCATCGACTG GGAGCTGTAC GACACCCACT ACACCGAGCG CTACCTGGGC ACCCCCGGGG ACGAGCCGGA GGCCTACGGG CGCAGCTCGC TCCTGGCGGA GGCGGCCAAG CTGGAGCGCC CGCTGATGAT GATCCACGGA CTGGCGGACG ACAACGTGGC CTTCGCGCAC ACGCAGCGGA TGTCGTCGGC GCTGATGGCG GCGGGGCGCC CGCACACGGT GCTGCCGCTG TCGGGGGTGA CGCACTCGCC CTCGGACCCG ACGGTCGCGG AGAACCTGAT GCTGCTCCAG GTGGAGTTCC TCAAGGAGAA CCTGCGCGGC GAGGGGTAG
|
Protein sequence | MSFPRQQART QRFSIGVPRA FQISPDGRRV AFLRGRDGVD KATCLWVHDT EGAGTDTVVA DPRSLGADDE NLPPEERARR ERLRESGGGI VSYSVDEAFT RAVFTLSGRL FYVDLVGDDT APRELPAATP VVDPRISPAG DRVAYVSGGA VRVLDIAAAE PDHGDRPVVE PDGPDVTWGL AELVAAEEMG RYRGFWWAPD GSALAVARVD ESGVNTWYVS DPGNPAQEPT ALRYPPAGGA NADVRLAVFR VGPRGDGRPE PVWVEWDREA LPYLATVGWT TGPDGTPTVV FTAQSRDQRT LTLFSADPAT GLVVESRTES DGVWVELMPG VPAFTGAGDL VWIGREAGGE RRVYVGDAPV SPPDVYVRGV VDVDGDRLLY SGSPAGSPGD VSLWLVELGT GLAAPVEVPG HGCSRSADSG AHSGLRSGRL RGDTLVVQHR SMDFPGAHTV VLRGAGTETR RSCSEIESLA EAPDLPEPRV EFWRAGERRI PSALVLPSWY REGLRPLPVL MAPYGGPHAQ RVLNARGAYL TAQWYAEQGF AVLIADGRGT PGLGVEWEQS VHLDLAAPVL EDQVAALEDA AERFDFLDVS RVGIHGWSFG GYLAALAVLR RPDVFHAAVA GAPVIDWELY DTHYTERYLG TPGDEPEAYG RSSLLAEAAK LERPLMMIHG LADDNVAFAH TQRMSSALMA AGRPHTVLPL SGVTHSPSDP TVAENLMLLQ VEFLKENLRG EG
|
| |