Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2658 |
Symbol | |
ID | 9246509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3166334 |
End bp | 3168262 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | peptidase S9 prolyl oligopeptidase active site domain protein |
Protein accession | YP_003680581 |
Protein GI | 297561607 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.55127 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.367777 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCTC CGTCGAAGCT GATCACCGTC GAGGAGTTCT TCGGCCCGCC CGTCCGCAGC CGGGCCCTTC TGTCACCCGA CGGCACGAGG GTCGCCTACC TCGCTCCCTG GCGCGGTCGG CTCAACGTGT TCGTCCGCGA CCCGGACTCG GACTGGACCG CCCCCGACCA GGTACACGAG GCCGACGCCC CGGGCGTCCG GCGCGTCACC TCCGACACCC GGCGCAACAT CGACGCCTTC TTCTGGACCG CCGACGGCCG CTACCTGCTG TTCCAGCAGG ACACCGACGG CGACGAGAAC TGGCACCTGC ACCGCGTGGA CCCGAACCGG CCCGACGAGC CCGCGGTCGA CCTGACCCCC TTCGAGGGCG TGCGGCTGCT CGGCGCGCAA CTCCCGCCGG ACCGCCCCGG CACCGCCTTC GTACAGCTCA ACATGCGCCG TCCCGACCTG GCCGACCTGT TCGAGCTCGA CCTGGAGACC GGCCGCCTGA CCACCGCCGC GGAGAACCCC GGCGACGTCC TCTCCTGGCT GCGCACCCCG GACCGGCTGC TGGCGTTCAC CATGGAGGAG GGAGGCGACC ACGTGCTGTC GGAGCACACC GAGGGCGCAC GGCGCGCGAT CGCGCGGTTC CCCGGCACCG ACGCCCTCTT CGGCATCCTC CCGGCTGTGC TCACCCCGGA CGGGAACGGA CTGTGGATCG GTTCGTCGCG GGGTTCCGAC CGCACCCGCC TGGTCCGGCT CGACCTGGAG ACCGGCGAGC AGGCCGGCGT GGACAGCCAC CCCGTGTTCG ACCTGGACAC CCCGCGCCCC GAGGCCGACC CGCGTTTCCC GTCCTCGCTG ATCCTGCACC CGGGAACCGG GGACCTGCTC GGCGCCCGCT ATCTCGGCAC CCGTCAGGAG ATCCACGCGC TCGACCCGCG CTTCGCCGAG GTCCTGCCGC GGCTGGCCGA GCTGTCCGAC GGCGACCTGG CCCACGTCTC CTGCGACACC GCGGCGCGGC GCTGGGTGGT GGACTTCACC CACGACCGCG ACCCCGGCGT CACCTGGTTC TACGACCACG CCACGGGACG GGCGCGCCGC CTCTTCCGGC CCTTCCCCCA CCTGGACCCG GCCGAGTTGG CCCCGGTCAC CCCGGTCACC GTCAGCGCCC GCGACGGCCT GACCCTGCCC TGCCACCTCA CCCTGCCGGT CGGGGTCGAA CCGCGCGACC TGCCGACCGT GCTGCTGGTG CACGGCGGAC CGTGGTACCG CGACAGCTGG TGCTACGACC CGGAGGTGCA ACTCCTGGCC AACCGCGGTT ACGCGGTGCT GCAGGTCGAC TTCCGCGGCT CCACCGGCTA CGGCAAGGCC CACACACAGG CCGCGATCGG CCAGTTCGCC GGGCGCATGC ACGACGACCT GATCGACGCC CTCGACTGGG CGGTCGAACA GGGCTACACC GACCCGGACC GGGTGGCGGT CTACGGCTGC TCCTACGGCG GTTACGCGGC GCTGGTCGGA GCGGCGTTCA CCCCGGACAG GTTCGCCGCC GCGGTCAGCT ACACCGGAAT GTCCGACCTG GTCGACCTCG TCGAGTCGGT CGTCCCGTTC GCCCGCCGTA CCGTCGAGAA CAGCTACCTG CGCTACATCG GCGACCCGGA CGACCCCCGC CAGAGGGCCG ACATGCTCGC CCGCTCGCCC ATCAGCCGGG TCGACGACAT CACCGCGCCG GTTCTGCTGA TCCACGGCGC CAACGACGTC CGCGTCCACC GGCGCAACTC CGACCGGGTC TTCGACGCGC TCCGCTCCCG CGGCGCCGAG GTCGAGTACC TGCTGAACGA GACCGAGGGC CACTGGTTCA CCAACCCGGA CAGCAACATC GAGTTGTACG GGAGGCTGGA GCGCTTCCTG GCCCGCCACC TGGGCGGGCG GTCCGCGACC GGGTCCTGA
|
Protein sequence | MTAPSKLITV EEFFGPPVRS RALLSPDGTR VAYLAPWRGR LNVFVRDPDS DWTAPDQVHE ADAPGVRRVT SDTRRNIDAF FWTADGRYLL FQQDTDGDEN WHLHRVDPNR PDEPAVDLTP FEGVRLLGAQ LPPDRPGTAF VQLNMRRPDL ADLFELDLET GRLTTAAENP GDVLSWLRTP DRLLAFTMEE GGDHVLSEHT EGARRAIARF PGTDALFGIL PAVLTPDGNG LWIGSSRGSD RTRLVRLDLE TGEQAGVDSH PVFDLDTPRP EADPRFPSSL ILHPGTGDLL GARYLGTRQE IHALDPRFAE VLPRLAELSD GDLAHVSCDT AARRWVVDFT HDRDPGVTWF YDHATGRARR LFRPFPHLDP AELAPVTPVT VSARDGLTLP CHLTLPVGVE PRDLPTVLLV HGGPWYRDSW CYDPEVQLLA NRGYAVLQVD FRGSTGYGKA HTQAAIGQFA GRMHDDLIDA LDWAVEQGYT DPDRVAVYGC SYGGYAALVG AAFTPDRFAA AVSYTGMSDL VDLVESVVPF ARRTVENSYL RYIGDPDDPR QRADMLARSP ISRVDDITAP VLLIHGANDV RVHRRNSDRV FDALRSRGAE VEYLLNETEG HWFTNPDSNI ELYGRLERFL ARHLGGRSAT GS
|
| |