Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_0904 |
Symbol | |
ID | 8427843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 914912 |
End bp | 915931 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 645033247 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_003190421 |
Protein GI | 258514199 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000034525 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0000000051427 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTATAG TTATGTCTAT TGAATCCACA GACGCACAGA TAGAGGCAGT TATAGAGAAA TTAGATGGTC TGGGTTTTAA GACACAGGTC ATTCGCGGGG TAAAGAGAAT AGTTATAGGT GCGGTAGGTG ACCGGCAGGC GCTCGACAGC GTTAGTCTGA AACAAATGCC AGGTGTGGAG GATATTGTTA AAATTATGAA GCCTTTCAAA ATGGTCAGCA GAGAAGCCAG GGAAGAAAAC ACGGTTATTA ATATACGCGG TATCAGCATA GGTGGAGAGG GCGTTGTTGT TATGGCCGGT CCCTGTGCGG TGGAAAGCAG GGAACAGCTT TTGACCGCCG CCCGGCAGGT TAAAGCTGCG GGTGGTCATG TCCTGCGTGG CGGTGCTTTC AAACCACGAA CATCTCCTTA CAGTTTCCAG GGTATGGAGG AGGAGGGTTT AAAGCTTTTA AAAGAAGCTT CGGAGGAAAC AGGTTTGCCA ACGGTGACGG AAGTTATTGA TGAACACAGT TTGCAGCTGG CGCATGATTA TGTAGACATA ATACAAATCG GTGCCAGAAA TATGCAAAAT TTCCGTTTGC TCAGGGCGGC CGGACAGACA GATAAAATTA TTTTACTGAA AAGAGGATTG TCCGCTACCA TAGAGGAATG GTTAATGTCA GCCGAGTATA TTATGTCAGA GGGCAATGGC AAGATTATTC TCTGCGAGAG AGGTATCCGT ACTTTTGAAA CCTATACCCG CAATACTCTC GATCTCAGTG CCGTTCCGCT GGTTAAGAGG CTCAGTCACC TGCCGGTAAT TGTTGATCCC AGCCATGCCA CAGGTGACAG GCAGCTGGTT GTGCCTATGT CTCTGGCGGC TGCGGCGGGG GGTGCGGACG GTCTGATAGT GGAAATGCAT CCGGAACCGA GTAAAGCTCT TTGTGACGGT GCACAGTCTC TGCACCCGGG GGAACTGGTT GGCTTGATAG CTAAATTAAC AAAAATGATG CCTGCGATAG ATCGCACTAT GTCAGTTTAA
|
Protein sequence | MIIVMSIEST DAQIEAVIEK LDGLGFKTQV IRGVKRIVIG AVGDRQALDS VSLKQMPGVE DIVKIMKPFK MVSREAREEN TVINIRGISI GGEGVVVMAG PCAVESREQL LTAARQVKAA GGHVLRGGAF KPRTSPYSFQ GMEEEGLKLL KEASEETGLP TVTEVIDEHS LQLAHDYVDI IQIGARNMQN FRLLRAAGQT DKIILLKRGL SATIEEWLMS AEYIMSEGNG KIILCERGIR TFETYTRNTL DLSAVPLVKR LSHLPVIVDP SHATGDRQLV VPMSLAAAAG GADGLIVEMH PEPSKALCDG AQSLHPGELV GLIAKLTKMM PAIDRTMSV
|
| |