Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3123 |
Symbol | |
ID | 9246979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3738866 |
End bp | 3739912 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_003681038 |
Protein GI | 297562064 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCATCG TGATGGCGCC GGACGCCACC CCCGAGAACA TCGACGATCT GGTCGAGCTG GTCTCCTCCG CCGGAGGCGA GGCCTACGTG ACGCGCGGGG TGAGCCGCAC GATCATCGGC CTCGTCGGTG ACGTCGAGCG CTTCCAGGAC CTGGGCCTGG CCGCCAAGCC GGGCGTCAGC GACGTCCTGC GCATCTCCGT CCCCTACAAG CTCGTCAGCC GCGAGAACCA CGACAGCCGC ACGGTCGTCA GCGTGCGCGG AGTGCCGATC GGCGGCGACA ACGTCACCGT CATCGCCGGT CCCTGCGCCG TCGAGACCCC CGAGCAGACC CTGGCCGCGG CCCGGATGGC GCTGGAGGCG GGGGCGTCCC TGCTGCGCGG CGGCGCCTAC AAGCCCCGCA CCTCCCCCTA CGCCTTCCAG GGCCTGGGCG AGGAGGGCCT GAGGATCCTC GCCGACGTGC GCGAGGAGAC CGGCCTTCCC ATCGTCACCG AGGTCGTGGA CGCCGCCGAC GTGGAGCTGG TCGCCTCCTA CGCGGACATG CTCCAGGTCG GCACCCGCAA CATGCAGAAC TTCGCCCTCC TCCAGGCGGT GGGGGACGCG GGCAAACCGG TCCTGCTCAA GCGCGGCATG AGCGCCACCA TCGAGGAGTG GCTGATGGCC GCCGAGTACA TCGCCCAGCG CGGCAACCTC GACATCGTCC TGTGCGAGCG CGGCATCCGC ACCTTCGAGA AGGCCACCCG CAACACCCTG GACATCAGCG CGGTCCCGGT CGCCCAGAAC CTGTCCCACC TGCCGGTGAT CGTCGACCCG TCCCACTCGG GCGGCAAGCG CGAGCTGGTG CTGCCGCTCT CGCGCGCGGC CGTGGCGGTC GGCGCGGACG GCGTCATCGT CGACGTGCAC CCCCACCCGG AGAACGCCCT GTGCGACGGC CCGCAGGCCC TCCTGCACGA GGACCTGGCC GAGCTGCGCG ATCTGGCGGG CACCCTGGCC GCCCTGACCG GCCGCACCCT GACCCTGGCG CCGGGCCGCG AACTGGCCGG TCTGTGA
|
Protein sequence | MVIVMAPDAT PENIDDLVEL VSSAGGEAYV TRGVSRTIIG LVGDVERFQD LGLAAKPGVS DVLRISVPYK LVSRENHDSR TVVSVRGVPI GGDNVTVIAG PCAVETPEQT LAAARMALEA GASLLRGGAY KPRTSPYAFQ GLGEEGLRIL ADVREETGLP IVTEVVDAAD VELVASYADM LQVGTRNMQN FALLQAVGDA GKPVLLKRGM SATIEEWLMA AEYIAQRGNL DIVLCERGIR TFEKATRNTL DISAVPVAQN LSHLPVIVDP SHSGGKRELV LPLSRAAVAV GADGVIVDVH PHPENALCDG PQALLHEDLA ELRDLAGTLA ALTGRTLTLA PGRELAGL
|
| |