Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0845 |
Symbol | |
ID | 7407420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 936535 |
End bp | 937548 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643715223 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002572733 |
Protein GI | 222528851 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTATTG TAATGAAAAA GGACTGTAAA GAGTCAGATA TTGAAGAAGT TGTAAAGCTT ATCACATCAC TGGGCCTTCG GCCCCACATT TCACAGGGCG TTGAAAGAAC TGTGATTGGT GTCATTGGCG ATGAGAGAAT TCTGTCTGAT GTTCCTGTTG AGCTTTTACC AGGTGTTGAT AGAATAATAC CAATCCTTGA GAGCTACAAA CTTGCAAGCA GAACATTCAA ATCCGAACCT ACAGTTGTAA AAATCAAAGA TGTAGAAATT GGCGGAGATA CACTTACGCT CATTGGCGGT CCTTGTGCAA TTGAAAGTTA TCAGCAGATG TTTGAAGTTG CAGAAAAGAT TAAAAGAAGC GGTGCAAAGA TTCTACGTGG CGGAGCTTAT AAGCCAAGAA CATCTCCATA TTCTTTCCAG GGTCTTGAGG AAGAAGGATT GAAAATCCTC AAAGAAGCTG CACAAAAGTA TGATCTTTTA GTAATAACAG AGGTAATAAG TGAAAGCGCT GTTGACAGAG CGTATGACTA TGTAGATATT TTTCAAATTG GTGCAAGAAA TATGCAAAAT TTTAACTTGC TAAAATATGT AGGAAGACAA GACAAGCCAG TTCTTCTCAA GAGAGGGCTT GCAGCAACAA TTGAAGAGTG GCTGAATGCT GCTGAATACA TTTTAAGCGA AGGAAATCCA AATGTTATTT TATGCGAAAG AGGAATAAGG ACATTTGAGA CAGCAACAAG AAACACTTTG GACATTTCAG CAATCCCTGT TGTAAAGGAA AAAAGTCATC TTCCAATAAT AGTTGACCCG AGCCATGCAG CAGGGAAAGC AAAGTATGTT CCAGCACTTT CAAAAGCGGC AATTGCAGCA GGAGTTGATG GGCTTATGAT TGAGGTCCAT CCAAACCCGC AAAAGGCTTT ATCTGACGGA CCTCAGTCAA TCACACCTGA GGAGTTTGAC AAACTTGTAA AAGAGATATC TTTAATAGCA AAATCAATTG GAAAGAGTGT ATAG
|
Protein sequence | MIIVMKKDCK ESDIEEVVKL ITSLGLRPHI SQGVERTVIG VIGDERILSD VPVELLPGVD RIIPILESYK LASRTFKSEP TVVKIKDVEI GGDTLTLIGG PCAIESYQQM FEVAEKIKRS GAKILRGGAY KPRTSPYSFQ GLEEEGLKIL KEAAQKYDLL VITEVISESA VDRAYDYVDI FQIGARNMQN FNLLKYVGRQ DKPVLLKRGL AATIEEWLNA AEYILSEGNP NVILCERGIR TFETATRNTL DISAIPVVKE KSHLPIIVDP SHAAGKAKYV PALSKAAIAA GVDGLMIEVH PNPQKALSDG PQSITPEEFD KLVKEISLIA KSIGKSV
|
| |