Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1531 |
Symbol | |
ID | 9245381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1875287 |
End bp | 1877467 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | |
Product | peptidase S9 prolyl oligopeptidase active site domain protein |
Protein accession | YP_003679466 |
Protein GI | 297560492 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.303438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGTCCA ACCGAGCCCC CGGAGCCGCT CTCGTCACCG AGTCCGCGCC CGAGCCCGAG CCCGCCGTCG ACGTCCTGCA CGGCCGACCC GTGCCCGACC CCTACCGGTG GCTGGAGTCC GCCGACTCCC CCCGGACACT GCGCTGGCTG GAGGAGCGGG GCCGCGAGTA CGCTGCCGAG GCGCTCACCT GGCCCCTGCG CGACCGCCTG GCCGGGCTCA TCCGCTCCCT GGTGGACACC GACCTGTGGA GCCCGCCGGT GCACCGGCCC GGCGCGGTGT TCGCCACCGT GCGCCCGGCC GGAGGCGAGC ACCCCCGCCT GGTGGTCCTG AGCGGGGACG GCCCGCCGCG CACCGTCTAC GACCCCGTGG CCGAGGACCC CGCCGGACAC ACCACCCTGG ACGCCTGGGA ACCCTCCCCC GACGGGGGGC TGGTCGCCGT GCAGACCTCG TCGGGGGGCG TGGAGCGCGG TGCCCTGCGA GTGATCGACA CGGCCAGCGG GGAGCCGGTG GGCCCCGCCG TGCCCGGAGT GCGCTACTCC CATGTGGCCT GGGCGCCCCG GGGCGTCCCC GCCTTCTACT ACGTGCGCCG CGACGGCGCC AGGGGCGTGC GCGGCGTGTG GCTGCGCCGG GTCCTCGACG GCACCGAGAC GCTGGTCCAC GCCTGCGCGG CGTCCGGAAC CGTGCCCGGG GTGCGCGTCC TGGGGAGCCG GTGGCTGCTG GTGTCCGAGA GCCACGGCAC CGGGCACCGC ACCGACCTGT GGTTGGCCGA CCTGACCGGC CCGCCCCGAC CCGCGGCTCC GGCCGACCCG AGCGCCCCGA CCTGTGAGAC CGGCCCGGCC GACCTCCCCG AGCGGCCCCC GCTGCGGCCC GTCCAGGTGG GCGAGGAGGC CGAGACCGAG GCCCGGCTGG GTCCGGACGG CCTCCTGTAC CTGCGCACCA CCCTGGGCGC GCCCTGGCGG CGGATCTGCG CGGTCTCCCC CGAGGAGCCG GGGGTGGGGC ACTGGCGCGA GGTGGTGTCC GAGGAGGACG GGGCGACCCT GGACGCCTTC GCGCCCTGCC CCGCCGGACC CGAGGGCGGC ACCGCCCTGT TCGTCGCGCG CACCCGCCTG GGGATCAGCG CCGCGGCCGT CCACGACACC CGCACCGGCG GGCTCCTGTA CACGGTGGAC CTGCCCGGGG AGGGGATGGT CTCCGCGCCG GAGACGGCGC CCGACGGGTC GGTGTACCTG GGGTACGCGG ACGTGGCCAC CCAGCAGCGG GTCCTGCGAC TGGCCCCCGG GGAGCGCACC CCGCGCCCCT GGCCGCCCGG CGCCGCCGGG GCCCCGCCCT CCCCGGACGT GGAGCGCTCG GTGGTCTGGT GCCGCTCCGC GGACGGCACG CGGGTACCGG TGACCGTGTT CACCGCCGCG GGCGGCGACC ACGCCGCGCC CGGCAGCGGG CTGATCCCCG AGCCGACCCT CCTGCACGCC TACGGCGGCT TCGGGCGCCC GCGCCAGTTC GGGTTCAGCG CGACCGTGCT GGCCTGGCTG CTCTCGGGCG GCCGGTACGC CGTCGCCCAC GTGCGCGGCG GCGGGGACGC GGGCCGCGAC TGGCACCTGG CCGGTTCGGG CCGCGACAAG CCGCGCGCCG TGGAGGACCT GGTGGCCGCG GCGGCGGCCC TGGTCGGAGC GGGGATGTGC ACCCGCGAAC AGCTGTGCCT GTCGGGCGGC TCGGCGGGCG GGCTGCTCGT CCTGGCCGCC GCGACCGCGC GCCCGGACCT GTGCGGGGCG GTGATCGCCT CCGCGCCGCT GGCGGACATG GCCCGGTTCG AGCGGATGGG CCTGGGCCGG ATGTGGACCC GCGAGTTCGG CACCGCCGCC GACCCCGACG ACTTCGCCGC CCTGATGTCC TACTCCCCCT ACCACCGGGC GCTGGAGGGC GCCGGGGACG GGCCGGGACG GCGCTTCCCC AGCGTCCTGC TCACCGGCTT CCACGGTGAC ACCCGCACCG ACGCCGCCCA CCCGCGCAAG ATGTGCGCGG CGCTGCTGGC GGCGGCCGCG CGGGAGGAGG GGCGCCCGCC CGTCCTGCTC CGGTACGAAC GCGACGTGGG CCACGGCCCG CGCGCGGTCA GCCGGGCGGT GGGACTGGCC GCCGACGCGC ACGCGTTCGC GGCCCACCGG ACGGGGCTGT CGCCGCGCTG A
|
Protein sequence | MVSNRAPGAA LVTESAPEPE PAVDVLHGRP VPDPYRWLES ADSPRTLRWL EERGREYAAE ALTWPLRDRL AGLIRSLVDT DLWSPPVHRP GAVFATVRPA GGEHPRLVVL SGDGPPRTVY DPVAEDPAGH TTLDAWEPSP DGGLVAVQTS SGGVERGALR VIDTASGEPV GPAVPGVRYS HVAWAPRGVP AFYYVRRDGA RGVRGVWLRR VLDGTETLVH ACAASGTVPG VRVLGSRWLL VSESHGTGHR TDLWLADLTG PPRPAAPADP SAPTCETGPA DLPERPPLRP VQVGEEAETE ARLGPDGLLY LRTTLGAPWR RICAVSPEEP GVGHWREVVS EEDGATLDAF APCPAGPEGG TALFVARTRL GISAAAVHDT RTGGLLYTVD LPGEGMVSAP ETAPDGSVYL GYADVATQQR VLRLAPGERT PRPWPPGAAG APPSPDVERS VVWCRSADGT RVPVTVFTAA GGDHAAPGSG LIPEPTLLHA YGGFGRPRQF GFSATVLAWL LSGGRYAVAH VRGGGDAGRD WHLAGSGRDK PRAVEDLVAA AAALVGAGMC TREQLCLSGG SAGGLLVLAA ATARPDLCGA VIASAPLADM ARFERMGLGR MWTREFGTAA DPDDFAALMS YSPYHRALEG AGDGPGRRFP SVLLTGFHGD TRTDAAHPRK MCAALLAAAA REEGRPPVLL RYERDVGHGP RAVSRAVGLA ADAHAFAAHR TGLSPR
|
| |