Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4236 |
Symbol | |
ID | 9248110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5053468 |
End bp | 5054502 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | 2-nitropropane dioxygenase NPD |
Protein accession | YP_003682133 |
Protein GI | 297563159 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.637642 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTGG CGGAGCTGCT GCGGGAGCGC CCGATCGTAC AGGCGCCCAT GGCGGGCGGG GCCGCCACGC CCGCGCTGGT GGCGGCCGTG GCGGGAGCGG GCGGAACGGG TTTCCTCGCC GCCGGGTACC TGGCCCCCGA GGTCCTCGCC GACCAGCTCG GGGCGGTGCG CGACGCCGGG GTCGGCGCGT TCGGGGTGAA CGTGTTCGTG CCGGGCCCGC CCTCCGACCC CGACGTGGCG GCGTCCTACC GTTGCGACCT GGAGTCCGAG GCCGAGCGGT ACGGGACGCC GGTCGGCGCC CCGGTGCACG ACGACGACGC GTGGGCGGCC AAGATCGACC TGCTGGCGCG GGCGGCCGTG CCGGTGGTGA GCTTCACCTT CGGCTGCCCG GAGGCCGCCG TGTTGGAGCG GCTGCGCGCG GCGGGCAGTG CCACGGTGGT CACCGTGACC ACGGTCGGGG AGGCGCGCGA GGCCGTGGCC CGCGGAGCCG ACGGGGTGTG CGCGCAGGGT ACGGAGGCCG GGGGCCACCG CGGCGCGTTC GACCCGGTCG GGAACGGAGG TCTGCCGCTG CGGGAGCTGC TGGCGGACGT GGTCGGCGCG GTGGAGGTAC CGGTGATCGC CGCCGGGGGG ATCATGACCG GGGCCGACGT GGCCGGGGCC CTGGACGCGG GTGCCGCCGC TGTGCAGCTG GGCACGGCGT TCCTGCGCTG TCCCGAGAGC GGCGCCAACC CGGTCCACAA GGCCGCGCTG GCCGATCCCG CGTACACCGG GACGGCCGTG ACGTGGGCTT TCACCGGCCG TCCGGCGCGG GGCCTGGCCA ACAGGTTCAT CGCCGAGCAC CCGCGGAGGC CCTTCGCCTA CCCCGAGATC CACCACATGA CGAAGCCGCT GCGCGCGGCC GCCGCCCGGG CCGGAGACCC CGGCGGCATG GCGCTGTGGG CGGGAGAGGG GTTCCGGGCG GCCAGCGACG ATCCCGCGGC GCTGGTCGTG GAGCGGTTGC GCCGCGAGGC CGCGGAGGCG GGCCGGAAGG TCTGA
|
Protein sequence | MSLAELLRER PIVQAPMAGG AATPALVAAV AGAGGTGFLA AGYLAPEVLA DQLGAVRDAG VGAFGVNVFV PGPPSDPDVA ASYRCDLESE AERYGTPVGA PVHDDDAWAA KIDLLARAAV PVVSFTFGCP EAAVLERLRA AGSATVVTVT TVGEAREAVA RGADGVCAQG TEAGGHRGAF DPVGNGGLPL RELLADVVGA VEVPVIAAGG IMTGADVAGA LDAGAAAVQL GTAFLRCPES GANPVHKAAL ADPAYTGTAV TWAFTGRPAR GLANRFIAEH PRRPFAYPEI HHMTKPLRAA AARAGDPGGM ALWAGEGFRA ASDDPAALVV ERLRREAAEA GRKV
|
| |