Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2888 |
Symbol | |
ID | 9246739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3447101 |
End bp | 3449137 |
Gene Length | 2037 bp |
Protein Length | 678 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | peptidase S9 prolyl oligopeptidase active site domain protein |
Protein accession | YP_003680805 |
Protein GI | 297561831 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0760896 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGTCCGTT CCTGGTATGA ACTGCCGGAG TACGTCCGAC TTCGCCGTGT CAACGGACTG CGCCTCTCGC CGGACGGAAC CCGTCTGGTC GCGCCCGTCA GCGGCCCCGC CCCCGACGCC AAGTCCTTCC GCAGCGCCCT GTGGGAGATC GACACCGCCC CCGAGTCCGA GGGCGGGCGC GCCCCCAGAC GCCTCACCCG CTCGGCCAAG GGCGAGAGCG GCGTCGGCTT CCTCCCCGAC GGCTCTGTCC TGTTCACCAC CGGCCGCCCC GACCCCGAGG CCGAGGCCGA CGCCAAGGCC CGCACCGCCC TGTGGCTCCT GCCCGCAGAC GGCGGCGAGG CCCGCCAGGT CGCCTCCCGC CCCGGCGGCA TCGGCTCCTT CACGGTCGCC CGCGACTCCG GACTGGTCGC CCTCACCGCC GACACCCTCC CCCGCAGCGA GGACGAGGAG GCCGACAGGG AGGCGCGCAA GGCCCGCGAG GAGGCCGGGG TCACCGCCAT CCTGCACGAG GCGCTGCCCG TGCGCTCCTG GGACAGCGAC ATCGGCCCCG GACACCCCCG CTACCTGGTC GCCGAACCGC CCGCGGACGA GGACTCCAGG CTCGGCGACC CGCGCGACCT GACCCCCGAC GCGGGAATGG CGCTCCTGGG CGCGAGCGGC GATCTCACCC CCGACGGCAC CACCCTGGTC ACCGAGTGGC AGGTCCCCGT CGGCCGGGGC GCCCGGCGCA CCGACGTCGT CGCCATCGAC ACCGCCACGG GCCGGCGCCG CACCCTGGCC ACCGACCCCG CCCACGACTT CGACAGCCCG CTCGTCTCCC CCGACGGCCG CCACGTCCTG CTCGTGCGCA GCTCCCAGGG CGACTACGAC GGCGAACCCC GGGACGAGAC CGCCTGGCTG GTCGACCTGG CCACCGGACA GGGCCGCGAC CTGCTCGCCG ACCACGAGCT GTGGCCCCGC GAACTCGCCT GGGCCGCCGA CTCCGGCGCG GTGTTCCTCG TCGCCGACCA CGGCGGCCGC CGGCCCGTCT TCCGGATCGA CCTGGCCAGC GGCGCGCTCA CCCGCGTCAC CGGCGACCAC GGCGCCTACA GCAACCTCAA CCCGTCCCCC GACGGCCGCC ACGTCTACGC CCTGCGCGAC GCCTGGGACG CCCCGCCCGC GCCCGTGCGC CTGGCCGCCG ACGCCGCCGA CGGCCAACCC GTGCACCTGC GCACCCCCGG CTCCGAGCTC ACCATGCCCG GCACCCTCAC CGAGATCGAG GCCACCGCCG ACGACGGCAC CCCGATCCGC TCCTGGCTGG TGCTGCCCGA GGAGGCCTCG GCCGACTCGC CCGCCCCGCT CATGCTCTGG GTCCACGGCG GCCCCTACAT GAGCTTCAAC GGCTGGTCCT GGCGCTGGAA CCCGTGGCTG CTCGCCGCGC GCGGCTACGC CGTCCTGCTG CCCGACCCCG CCCTGTCCAC CGGCTACGGC CAGGACATGC TCCGCCGCGC CTGGGGCCAG TGGGGGCCGC GCACCTTCGC CGACGTCATG GCCGTCACCG ACGCCGCCGA GGCGCGCGAG GACATCGACG CCGAGCGCAC CGCCATGATG GGCGGCTCCT TCGGCGGCTA CATGGCCAAC TGGATCGCCG GGCACACCGA CCGGTTCAGG GCCATCGTCT CCCACGCCTC CCTGTGGGGC CTGGACGGCT TCAACGGCAC CACCGACTAC CCGCCGGTGT GGGAGCGCGA GTTCGGCACC CCGCTGGAGC GCCCCGAGCG CTACACCCTC AACTCCCCGC ACCTGCACGC GGACCGCATC CGCACCCCGA TGCTCGTCAT CCACGGCGAC AAGGACTACC GGGTGCCCAT CTCCGAGGGC CTGCGCCTGT GGCGCGACCT GATGCTGCAC GAGGTGGACG CCAAGTTCCT GTACTTCCCG GACGAGAACC ACTGGATCCT CACCCCGGGG AACGCCCGGA TCTGGTACGA GACGGTCTTC GCCTTCCTCG ACCACCACGT GCACGGCAAG GAGTGGAGCA GGCCCGAACT GCTCTGA
|
Protein sequence | MVRSWYELPE YVRLRRVNGL RLSPDGTRLV APVSGPAPDA KSFRSALWEI DTAPESEGGR APRRLTRSAK GESGVGFLPD GSVLFTTGRP DPEAEADAKA RTALWLLPAD GGEARQVASR PGGIGSFTVA RDSGLVALTA DTLPRSEDEE ADREARKARE EAGVTAILHE ALPVRSWDSD IGPGHPRYLV AEPPADEDSR LGDPRDLTPD AGMALLGASG DLTPDGTTLV TEWQVPVGRG ARRTDVVAID TATGRRRTLA TDPAHDFDSP LVSPDGRHVL LVRSSQGDYD GEPRDETAWL VDLATGQGRD LLADHELWPR ELAWAADSGA VFLVADHGGR RPVFRIDLAS GALTRVTGDH GAYSNLNPSP DGRHVYALRD AWDAPPAPVR LAADAADGQP VHLRTPGSEL TMPGTLTEIE ATADDGTPIR SWLVLPEEAS ADSPAPLMLW VHGGPYMSFN GWSWRWNPWL LAARGYAVLL PDPALSTGYG QDMLRRAWGQ WGPRTFADVM AVTDAAEARE DIDAERTAMM GGSFGGYMAN WIAGHTDRFR AIVSHASLWG LDGFNGTTDY PPVWEREFGT PLERPERYTL NSPHLHADRI RTPMLVIHGD KDYRVPISEG LRLWRDLMLH EVDAKFLYFP DENHWILTPG NARIWYETVF AFLDHHVHGK EWSRPELL
|
| |