Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0428 |
Symbol | |
ID | 9244267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 518135 |
End bp | 519190 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_003678381 |
Protein GI | 297559407 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.212053 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAGCA CCAACGACAC CCGGGTGGCC AGTTACAAGC CGCTCATCGC GCCCCGGGAC CTGCTGGCCG AGCTCCCGCT GGGCGCCGAG CGCAGCGCCC TGGTCGAGGA GTCCCGCGCC GAGGTCAAGC GGGTGCTGGA CGGAGAGGAC GACCGCCTGC TGGTCGTCGT GGGCCCCTGC TCGGTGCACG ACCCCGAGTC CGCCATGGAC TACGCCCGCC GCCTGAAGGA GCTGGTGCCC TCCCTGCGCG ACGAGCTGTG CGTCGTGATG CGCGTGTACT TCGAGAAGCC GCGTACCACC GTCGGCTGGA AGGGCCTGAT CAACGACCCC GGCCTGGACG ACACCTACGA CGTGCACCGG GGCCTGCGCA CCGCGCGCAA GCTGCTGCTG GACATCAACG CGCTCGGGCT GCCCGCCGGG ACCGAGTTCC TCGACCCCAT CACCCCGCAG TACATCGCCG ACGTCGTCTC CTGGGGCGCG ATCGGGGCGC GCACCACCGA GAGCCAGGTG CACCGCCAGC TGAGCAGCGG CCTGAGCACG CCCGTGGGCT TCAAGAACGG CACCGACGGC GACGTCCAGG TGGCCGTGGA CGCGGTCGGC GCCGCCGCCG CCTCGCACAC CTTCTTCGGT ATCGACCCCA CCGGCGCCGG TTCGGTGGTC GTCACCGAGG GCAACCCGGA CTGCCACGTC ATCCTGCGCG GCGGCCGCAG CGGTCCCAAC TTCGGCGCCG GGCAGGTCAA CGCCGCCCTG GACGTCATCG AGGGCGCCGG GCTGCCCCGC CGCCTGATGA TCGACGCCAG CCACGCCAAC AGCGGCAAGG ACCACACCCG CCAGCCCGGT GTGGCCGCCG CGATCGCCGC CCAGGTCGCC GAGGGCCAGA CGGGCGTCAT CGGCGTGATG CTGGAGAGCT TCATCGTCGA GGGGGCCCAG AAGCTGGGCG ACCCCGCCGC CCTGACCTAC GGCCAGTCGA TCACCGACAA GTGCATGGGC TGGGAGACCA CCGGCGAGGT GCTGACCCAG CTGGCCGAGG CCGTGCGCCA GCGCCGCAAG GCCTAG
|
Protein sequence | MPSTNDTRVA SYKPLIAPRD LLAELPLGAE RSALVEESRA EVKRVLDGED DRLLVVVGPC SVHDPESAMD YARRLKELVP SLRDELCVVM RVYFEKPRTT VGWKGLINDP GLDDTYDVHR GLRTARKLLL DINALGLPAG TEFLDPITPQ YIADVVSWGA IGARTTESQV HRQLSSGLST PVGFKNGTDG DVQVAVDAVG AAAASHTFFG IDPTGAGSVV VTEGNPDCHV ILRGGRSGPN FGAGQVNAAL DVIEGAGLPR RLMIDASHAN SGKDHTRQPG VAAAIAAQVA EGQTGVIGVM LESFIVEGAQ KLGDPAALTY GQSITDKCMG WETTGEVLTQ LAEAVRQRRK A
|
| |