Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0451 |
Symbol | |
ID | 9244290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 543583 |
End bp | 544674 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | oxidoreductase domain protein |
Protein accession | YP_003678404 |
Protein GI | 297559430 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.111618 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGACG CACCCGAGGC ACAGAGCACA CCCGAACACG ACGGCCCCGA ACTGCACGTC GCCCTCATCG GTTACGGCAA GGGCGGTGAG GTCTTCCACG CGCCGCTGAT CGACGCGGTT CCGGGACTGC GCCTGTCCGC GGTGGTCACG GGCAACCCCG ACCGCCGCAG GGCCGCCGAG GCGCGCTACC CCGGCGTCAC CGTGTACCCG AACGTCGCCG AACTGTGGGC CGACGCCGAG CGTTACGAGA TCGCGGTCGT CACCACGCCC AACGACACCC ACGCCCCGCT CGCCCAGGCC GCCCTGGAGG CGGGCCTGGC GGTGGTGGTG GACAAGCCCT TCGCCCTCAC CGCCCTCCAG GCCCGGGAGC TCACCGACCT CGCGGACAAG CTGGGCCGGG TGCTCACCGT GTACCAGAAC CGCCGCTGGG ACGCCGACTT CCTCACCCTG TGGGGTCTCA TCGAGGAGGG GAGGCTGGGC CGCGTACACC GCTTCGAGTC GCGGTTCGAG CGCTGGCGTC CCCAGGCCAA GGGCACCTGG CGCGAGAGCG GCGGGGTCGA GGCCGGGGCC GGGCTGCTCT ACGACCTGGG CCCGCACCTG ATCGACCAGG CGGTCAACCT GTTCGGCCCG GTCTCCTCCG TCTACGCCGA GATCGACGCC CGGCGCGAGG GCGTCAACGC CGACGACGAC GTCTTCCTGG CCCTCGACCA CGCCCGGGGC ACGCGTTCCC ACCTGTGGAT GAGCGCGCTC ACCGCCCAGG GCGGGCCGCG TTTTCGGGTG CTGGGGGACG AGGGGGCGTT CACCAGCCAC GGCATGGACG GCCAGGAGGC CCGGCTGACG GCGGGGGAGC GGGCCGACGC CGACGACTGG GGCGTGGTTC CCGAGGCGGA CTGGGGCGTG CTCGGGGTGG ACGGCCACAC CCGGCCGGTG CCCAGCGCGC GCGGCGCCTA CCCGGAGTTC TACGCCGGGG TGCGCGACGC GGTCGGCGAG GGCGAGCCCC TGCCGGTGGA CCCGCACGAG GTGATCCACG GACTGGAGGT CATCGAGGCC GCCCGGCGCA GCGCCCGCAC GCGGTCGGTG GTCGACGTCT GA
|
Protein sequence | MADAPEAQST PEHDGPELHV ALIGYGKGGE VFHAPLIDAV PGLRLSAVVT GNPDRRRAAE ARYPGVTVYP NVAELWADAE RYEIAVVTTP NDTHAPLAQA ALEAGLAVVV DKPFALTALQ ARELTDLADK LGRVLTVYQN RRWDADFLTL WGLIEEGRLG RVHRFESRFE RWRPQAKGTW RESGGVEAGA GLLYDLGPHL IDQAVNLFGP VSSVYAEIDA RREGVNADDD VFLALDHARG TRSHLWMSAL TAQGGPRFRV LGDEGAFTSH GMDGQEARLT AGERADADDW GVVPEADWGV LGVDGHTRPV PSARGAYPEF YAGVRDAVGE GEPLPVDPHE VIHGLEVIEA ARRSARTRSV VDV
|
| |