Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1074 |
Symbol | |
ID | 9244920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1320951 |
End bp | 1322606 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Pyridoxal-dependent decarboxylase |
Protein accession | YP_003679022 |
Protein GI | 297560048 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0179371 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.433316 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTCCC TCGCGCGGTC CGCACCGGGC CCGCTGCACG GGAACGCCGG TGCCCTCGGC ACCAGGCCCA CACCCCCCTC CGGCCAGGAC CCCGACGACC CCCGGACACA GCTGTTCGAC GCCCGGAGCG CCGAACGCTA CCGCGACCTG ACCGGGGAGG CCGCGGCCCG CGTCGCCCGA CGCATCGCCG GAGCCGACCG GCCGCTCACC GGAGCCACCG CCGAGGACCT GCGCCCCCAG ATCGACAAGG TCGACCTCGA CCAGCCGCTC CACGACCCCA CCGCGGCCCT GGACGAACTC GAACGCGTCT ACCTCGACGA CGCCGTCTAC TTCCACCACC CCCGCTACAT GGGACACCTC AACTGCCCCG TGGTCCTGCC CGCCCTGCTC GGCGAGACCG TCCTGTCCGC CGTCAACTCC TCCCTCGACA CCTGGGACCA GAGCGCCGGG GGCACCCTGA TCGAACAGCG GCTCATCGAC TGGACGTGCG AGCGCGTCGG CTTCGGCGAG ACCGCGGACG GCGTCTTCAC CAGCGGCGGC AGCCAGTCCA ACCTCCAGGC CCTGCTCATG GCACGCGACG AGGCCCACCA CCGCGCCAAG GCGCAGGAGG GCCAGGACAC CCGCCTGGCC GAACTCCTGC CCCGGATGCG CGTCCTGACC TCCGAGGCCG GACACTTCAG CGTCGCCAAG TCCGCCGCCC TCCTCGGACT GGGCTACGAA TCCGTCATCA CCGTGGCCTG CGACGACAGA CGCCGCATGC GCCCCGACGC CCTCGCCGCC CAACTGCGCC GATGCCGCGC CGAAGGACTC CTGCCCATAG CCGTCGTCGC CACCGCCGGA ACCACCGACT TCGGCAGCAT CGACCCCCTG CCCCGCATCG CCGACCTGTG CCGACAGCGC GGCGTGTGGA TGCACGTCGA CGCCGCCTAC GGCTGCGGCC TGCTGGTCTC GCGCCACCGC CACCTGCTGG AGGGCGTCGA ACGCGCCGAC TCGGTCACCG TGGACTTCCA CAAGTCCTTC TTCCAACCGG TCAGCTCCAG CGCGATCGTG GTCCGCGACC GCGACGTCCT GCGCCACGTC ACCTACCACG CCGACTACCT CAACTCCCGC TCGGACGGCA GCACCCCCCT GCTCTCCCCC AACCAGGTCG ACAAGAGCCT GCAGACCACA CGCCGCTTCG ACGCCCTCAA ACTGTGGCTC ACCCTGCGCG TCATGGGCGC CGACGGCGTG GGCGCCCTCT TCGACAGCGT CCTGGACCTG GCCGCCACCG CCTGGACCCT GCTCGACGCC GACCCGCGCT TCACCGTGGT CACCCGGCCC AGCCTGAGCA CCCTGGTCTT CCGCTGCGCC GTACCCGGCG CCGACCCCGA CACCGCCGAC GCCGCCCACC GCTACGCACG CGAGGCGCTG CTGGCCTCGG GCCGCGCCTT CGTGGCCCGC ACCACCGTCG ACGGCAGGCC CCACCTCAAA CTCACCCTGC TCAACCCCAG GGCCACCCGG GAGGACGTCG CCGAGGTACT GGACCTGATC GCCGCGCACG TCGACCACTT CATGAACGGA CGCGACATCC CCGACCGCCC GGCCCCACCC GCCCCGACCA CGACCGCGGC CCACGCGGCC TCCGCCCTGC CGACCACCAC CGGGAGGTCC CGTTGA
|
Protein sequence | MSSLARSAPG PLHGNAGALG TRPTPPSGQD PDDPRTQLFD ARSAERYRDL TGEAAARVAR RIAGADRPLT GATAEDLRPQ IDKVDLDQPL HDPTAALDEL ERVYLDDAVY FHHPRYMGHL NCPVVLPALL GETVLSAVNS SLDTWDQSAG GTLIEQRLID WTCERVGFGE TADGVFTSGG SQSNLQALLM ARDEAHHRAK AQEGQDTRLA ELLPRMRVLT SEAGHFSVAK SAALLGLGYE SVITVACDDR RRMRPDALAA QLRRCRAEGL LPIAVVATAG TTDFGSIDPL PRIADLCRQR GVWMHVDAAY GCGLLVSRHR HLLEGVERAD SVTVDFHKSF FQPVSSSAIV VRDRDVLRHV TYHADYLNSR SDGSTPLLSP NQVDKSLQTT RRFDALKLWL TLRVMGADGV GALFDSVLDL AATAWTLLDA DPRFTVVTRP SLSTLVFRCA VPGADPDTAD AAHRYAREAL LASGRAFVAR TTVDGRPHLK LTLLNPRATR EDVAEVLDLI AAHVDHFMNG RDIPDRPAPP APTTTAAHAA SALPTTTGRS R
|
| |