Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2412 |
Symbol | |
ID | 3832162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2532805 |
End bp | 2533869 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637830331 |
Product | 3-deoxy-D-arabinoheptulosonate-7-phosphate synthase |
Protein accession | YP_431237 |
Protein GI | 83591228 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000000535392 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCATCG TCATGAAGCC GGGGGCCAGC CGGGAACAGA TTAACGCCGT GAGCGAGCGC CTGGTAGAGC TGGGCTTTAA GACCCACCCC ATCTACGGTC AGGAAAAGAC CGTCATCGGC GCCATCGGCG ACAAAAAAGC CCTGAGTTCC GAGGCCATCA TCAACCTGCC CGGCGTCGAA AAGATCGTTC CCATAATGAA GCCCTACAAA CTGGTGAGCC GCGAACTCAA GGACACGCCG ACCATCGTCC GCATCGGCGG TGTCCCGGTA GGAGGGCGGG GCCTGGTGGT CATGGCCGGC CCCTGCGCCG TCGAGGGCGA AGAGCAACTC CTGGAAGCTG CCCGGGCCGT CAAAGCCGCC GGGGCCCAGG TCTTGCGCGG CGGGGCCTTT AAACCGCGTA CCTCCCCCTA TGCCTTCCAG GGTTTGGAGG AAAAGGGGCT TAAAATGCTA GCCCGGGTCC GGGAGGAGGT CGGCCTGCCC TTCATTACGG AAGCCGTGGA TACCAGGGAT GTCCCTCTGG TGGCGGAATA CGCCGACGCC ATCCAGATCG GCGCTCGTAA CATGCAGAAC TTTCGCCTCC TCCAGGAAGC CGGGGCCACG GGTAAACCCA TCCTGTTGAA ACGGGGCCTG GCGGCCACCA TCGAGGAATG GCTCATGGCG GCCGAGTATA TCCTCGATAG CGGTAATCCC AACGTCATCC TGTGCGAGCG GGGGATCCGC ACCTTTGAAA CCTCCACCCG CTTTACCTTG GACCTGGCGG CCATCGCCGT GGTCAAAGAA AACTCCCACC TGCCCGTAAT CGTCGATCCC AGCCACGGCA CGGGCAGCTG GCGCCTGGTC CTGCCTATGG CCAGGGCGGC CGTGGCCGCC GGCGCCGACG GCCTCATTAT CGAAGTCCAT CCCGACCCGG CCCGGGCCCT CTGCGACGGC CCCCAGTCTC TGCACCCTGA GACCTTTGAC CGCCTGATGG GCGAACTTGC ACCGGTGGCC CTGGCTGTCG GTCGCGGCCT GGCCCTTAAT GGTCTTTTCC CGGAACAGGC CGGCACCCTG GCAGCAGGCA AGTAA
|
Protein sequence | MIIVMKPGAS REQINAVSER LVELGFKTHP IYGQEKTVIG AIGDKKALSS EAIINLPGVE KIVPIMKPYK LVSRELKDTP TIVRIGGVPV GGRGLVVMAG PCAVEGEEQL LEAARAVKAA GAQVLRGGAF KPRTSPYAFQ GLEEKGLKML ARVREEVGLP FITEAVDTRD VPLVAEYADA IQIGARNMQN FRLLQEAGAT GKPILLKRGL AATIEEWLMA AEYILDSGNP NVILCERGIR TFETSTRFTL DLAAIAVVKE NSHLPVIVDP SHGTGSWRLV LPMARAAVAA GADGLIIEVH PDPARALCDG PQSLHPETFD RLMGELAPVA LAVGRGLALN GLFPEQAGTL AAGK
|
| |