Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0880 |
Symbol | |
ID | 4810498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1056027 |
End bp | 1057043 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640106296 |
Product | 3-deoxy-D-arabinoheptulosonate-7-phosphate synthase |
Protein accession | YP_001037307 |
Protein GI | 125973397 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000574615 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTATCG TTATGAGTCC AAATGCTACA AAAGAGCAGA TTGAAAATGT GGAAAAAAAA CTTTTGGAGC TGGGTTTTAA AACTCATCCC ATAGTCGGAG ACGTAAAAAC GGTAATTGGG GCTATCGGAG ACAAAAGACT TCTCAATACC CACTCCATAT CCACCATGCC CGGAGTTGAA AGCATTGTTC CAATCATGAA ACCTTACAAG CTGGCCAGCA AAGAACTAAA GCAGGAACCA ACCATTGTTG AGGTAGGCGA TGTACGAATT GGTGGCAATG AAGTAGTGGT TATGGCCGGC CCCTGTGCAA TTGAAAACGA AGAAATTTAT GTCGAAACAG CCAAAAAGGT TAAAGAGGCA GGAGCCAAAA TACTCCGCGG CGGTGCTTTC AAGCCCCGTA CATCTCCTTA TTCTTTCCAA GGTTTGGAAG AAGAAGGCCT CAAAATAATG GCCATTGCCC GGGAAGTAAC GGGACTTAAG CTTGTCACCG AAGTTGTGGA CACAAGAGAT GTGGAACTTG TCGCATCTTA TACAGACATC ATCCAAATCG GTGCAAGAAA CATGCAAAAC TTCAGGCTGC TTAAAGAGGT CGGAATGTCC AATAAGCCCG TACTCCTAAA AAGAGGACTG GCTGCAACCA TTGAAGAATG GTTAATGGCC GCCGAATATA TTATTTCCGA GGGTAATCCC AATGTAATAC TTTGCGAACG AGGCATCCGA ACCTTCGAGA CAGCCACAAG GAACACCATT GACATGAGCG CCATTCCGGT AATAAAAGAG CTGTCCCATT TGCCGATAGT GCTTGACCCC AGCCATGCGG CAGGTACCTG GAAATATGTT GAGCCTCTTG CAAAAGGCGC AATAGCAACC GGAGCCGACG GTTTAATCAT TGAAGTCCAC AGCCAGCCTG ACTGTGCTCT CTGTGACGGT CAACAGTCTT TGATACCTTC AAGGTTCGAA CAGCTTATGA AGGATCTTGA GCCTATAGCT CTTGCAGTGG GAAGAAAACT ATTGTAA
|
Protein sequence | MIIVMSPNAT KEQIENVEKK LLELGFKTHP IVGDVKTVIG AIGDKRLLNT HSISTMPGVE SIVPIMKPYK LASKELKQEP TIVEVGDVRI GGNEVVVMAG PCAIENEEIY VETAKKVKEA GAKILRGGAF KPRTSPYSFQ GLEEEGLKIM AIAREVTGLK LVTEVVDTRD VELVASYTDI IQIGARNMQN FRLLKEVGMS NKPVLLKRGL AATIEEWLMA AEYIISEGNP NVILCERGIR TFETATRNTI DMSAIPVIKE LSHLPIVLDP SHAAGTWKYV EPLAKGAIAT GADGLIIEVH SQPDCALCDG QQSLIPSRFE QLMKDLEPIA LAVGRKLL
|
| |