Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2334 |
Symbol | |
ID | 9246184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2785086 |
End bp | 2786456 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | homoserine dehydrogenase, NAD-binding protein |
Protein accession | YP_003680262 |
Protein GI | 297561288 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.670502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.115253 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGCCA CCGACGCCAC CACGGCGCCG CCACCGCCCG TGCGCTACGC GCTCACCGGA GCCCGCGGCG GTTTCGCGCG CACCCTGCTC GCCCAGACCC CGCGGATGGA GCGGCTGCGG CCCTCGGTCC TGTGCGACCT GGACACCGAG GGGACGATCG CGCTGTGCGC CGAGCTCGGC TACCCGGCCG ACGCGCTGAC GGTGTGCTCC AGCGCGGACG AGGTGGCCGC GCTGGCCGAC GGGCGGATCG CGGTGATCGC CGACGCCCGC CTCCTCGAAC ACGCCGACTA CGACGTGCTC GTCGAGGCCA CCGGCAGCCC CGAGGCGGGC ACCCGGGCCG CGCGGACCGC CATCGCGGCC GGACGCCACG TGGTCATGGT CTCCAAGGAG GTCGACTCCG TTTCCGGGAT CGCGCTGGCC GACCTCGCCC GGGAACGGGG CGTGGTCTAC ACCCCCGGGA TCGGGGACCA GCCCGCCAAC CTCATCGAGT GGTACGAGCG GACCCGCCTG CTGGGCCTGG ACGTCGTCGC GATCGGCAAG TCGGGCGAGT ACGACCTGGT CTTCGACCCG GCCACGGGCC GCGTGCGCCA GCTCGACCAG GAGATCGACG CCCCCGACCT GGCCGACCTG CTGACCCTGG GCGAGGACGT GCCCGCCACG CTCGCCGCCC GCGCCCGCGC CGTGGCGGGC CTGTCGCGCT CGGCCGCCGC CGACTACTGC GAGATGGGGG TCGTCGCCAA CCACACGGGC CTGGTGCCCG ACACCGAGGA GCTGCACTAC CCCGTCGCGC GCGTCGCCGA ACTCGCCGAC GTCTACTCCC TCGCCGAGGA CGGGGGGATC CTCTCCCGGT CCGGCGCCGT GGACGTCTTC AGCGTGCTCC GCCTGCCCGA GGAGGCCTCC TTCGCCGGAG GGGTCTTCGC CGTCGTCCGC ACCGGCGACC CCGTCTCCTG GGCACTGCTC GCGCAGAAGG GCCACGTGGT CTCGCGCTCG GGACGCTACG CCTGCCTCTA CCTGCCCTAC CACCTCATGG GCGTGGAGAC CCCGCTGAGC CTGCTGGACG CCGTGGACCG CCGCCGCGCG GTGACCCCCC GGCACCCGGC CCCGCACGCG GTCCTGGCCG GGCGCGCCCG GCGCGACCTC CCCGCCGGGA CCAGGCTGGA CATGGGCGGA CACCACCACG ACGTCCAGGG GGTGGCCCCC GTCCTGCTGG ACGCCGCCGA CGCCCCCGAC GACGTCGCAC CGCTCTACCT GGCGGCCCAC GCCGCCCTGG GCCGCGACGT CGCGGCGGGC GCACTGGTGC GCCTGGACGA CCTGGCCGAC GCCAGCGCCC CCCTCCTGGA CGCCTGGCGC CACGCCCGCA CGCACCTGTG A
|
Protein sequence | MQATDATTAP PPPVRYALTG ARGGFARTLL AQTPRMERLR PSVLCDLDTE GTIALCAELG YPADALTVCS SADEVAALAD GRIAVIADAR LLEHADYDVL VEATGSPEAG TRAARTAIAA GRHVVMVSKE VDSVSGIALA DLARERGVVY TPGIGDQPAN LIEWYERTRL LGLDVVAIGK SGEYDLVFDP ATGRVRQLDQ EIDAPDLADL LTLGEDVPAT LAARARAVAG LSRSAAADYC EMGVVANHTG LVPDTEELHY PVARVAELAD VYSLAEDGGI LSRSGAVDVF SVLRLPEEAS FAGGVFAVVR TGDPVSWALL AQKGHVVSRS GRYACLYLPY HLMGVETPLS LLDAVDRRRA VTPRHPAPHA VLAGRARRDL PAGTRLDMGG HHHDVQGVAP VLLDAADAPD DVAPLYLAAH AALGRDVAAG ALVRLDDLAD ASAPLLDAWR HARTHL
|
| |