Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0861 |
Symbol | |
ID | 9244706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1057466 |
End bp | 1058755 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | 3,4-dihydroxy-2-butanone 4-phosphate synthase |
Protein accession | YP_003678811 |
Protein GI | 297559837 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.451908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0935072 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCA CCGAACCCAC CCCGGCCACG ACACCGATCG ACCAGGCCGT CGTCCTCGAC CCGATCGAGG AGGCCGTCGC CGAGTTCGCC GCGGGGCGGG CCATCGTGGT GGTGGACGAC GAGGACCGCG AGAACGAGGG CGACATCATC TTCGCCGCCG AGCTCGCCAC GCCCGAACTC CTGGCGTTCA TGATCCGCTA CACCTCCGGT GTGGTGTGCG TCCCCCTGGA GGGCGGGGAC CTGGACCGCC TCGACCTGCC GCTGATGACC GCGCGCAACG AGGAGAGCCT GCGCACCGCC TACACGGTCA CCGTGGACGC GCGCGAGGGC GTCACCACGG GGATCTCCGC CGCCGACCGG GCCCGCACCA TCCGCCTGCT GGCCGCGGAG GGCAGCGGCC CCGCCGACTT CGTGCGCCCC GGCCACGTCC TGCCGCTGCG CGCCCGCCCG GGCGGCGTGC TGGCCCGGCG CGGCCACACC GAGGCCTCCG TCGACTTCGC CCGCCTGGCC GGTCTGCGTC CCGCGGGCGT GCTCGCCGAG GTCGTCAACG ACGACGGCAC CATGGCCCGT CTGCCCCAGC TGCGCGCGTT CGCCGACGAG CACGGCCTCA AGCTGGTCTC GGTCGAGCAG CTCGCCGCCT ACCGCGAGGC GCTGGGGGAG GCGCTCACCG AGGCCGAGGC CCACCCGCCG CTGGTCTCCC GCGCGGTCCA GACGCGCCTG CCCAACAGGT ACGGCCAGTG GCGCGCGGTC GGGTACCGGG GTACCGCCGA CGGCGCCGAG CACGTCGCGC TGGTGTACGG GGACCTGACC GACGGCACCG ACGTCCTGGC GCGCCTGCAC TCGGAGTGCC TCACCGGCGA CGCGTTCGGC TCCCACCGCT GCGACTGCGG CGCCCAGCTG GACGCCGCCA TGGCCGACAT CGCCGAGGAG GGGCGCGGGG TGCTCGTCTA CCTTGGCGGC CACGAGGGCC GGGGGATCGG TCTGCTGCAC AAGCTGAGCG CCTACAGCCT CCAGGACCAG GGGGCGGACA CCGTGGACGC CAACCTGCGC CTGGGCCTGC CCGCCGACGC GCGCGAGTTC GGCGCCGGGG CGCAGATCCT GGCCGACCTG GGGGTGTCGT CGGTGCGGCT GCTGACCAAC AACCCCGCCA AGGCCGAGGG ACTGGAGCAG CACGGGGTGC GGGTCAAGGA GCGGGTGGCG ATGCCCTCCT TCGTCACCGA GGACAACATC GACTACCTGC GCACCAAGCG CGACCGCATG GGCCACGACC TGACCGGCGT CGTCCGCTGA
|
Protein sequence | MTVTEPTPAT TPIDQAVVLD PIEEAVAEFA AGRAIVVVDD EDRENEGDII FAAELATPEL LAFMIRYTSG VVCVPLEGGD LDRLDLPLMT ARNEESLRTA YTVTVDAREG VTTGISAADR ARTIRLLAAE GSGPADFVRP GHVLPLRARP GGVLARRGHT EASVDFARLA GLRPAGVLAE VVNDDGTMAR LPQLRAFADE HGLKLVSVEQ LAAYREALGE ALTEAEAHPP LVSRAVQTRL PNRYGQWRAV GYRGTADGAE HVALVYGDLT DGTDVLARLH SECLTGDAFG SHRCDCGAQL DAAMADIAEE GRGVLVYLGG HEGRGIGLLH KLSAYSLQDQ GADTVDANLR LGLPADAREF GAGAQILADL GVSSVRLLTN NPAKAEGLEQ HGVRVKERVA MPSFVTEDNI DYLRTKRDRM GHDLTGVVR
|
| |