Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2071 |
Symbol | |
ID | 7310773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 2429744 |
End bp | 2430757 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643609004 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002506396 |
Protein GI | 220929487 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000173491 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTATTG TTATGAAGCA AAGTGCAACG GTAAAGGACA TCGAGAGCGT TGAAAGCAGG CTGCTTGACC TGGGTTTTAA AACTCACCCA ATTCACGGAG ATATAAAAAC TGTAATAGGA GCAATTGGCG ATAAAAGACT TCTTAATGTT CACGCAATAT CCCAGATGCA GGGTGTGGAA ACACTCGTCC CTATTATGAA GCCATATAAA TTTGCAAGTA GTGAATTACA GCATACACCT ACAATTATAG ATGTGGGGGG CGTACAAATA GGTGGTAAAG AAATTGTTGT AATGGCCGGG CCGTGTGCAA TTGAAAATGA AAAGGACTTT GTTGATACTG CAATCAGTGT TAAGAAAAGC GGAGCTAAAA TCCTAAGAGG TGGTGCTTTC AAGCCAAGAA GTTCTCCTTA TGCTTTTCAA GGACTTGAAG AAGACGGTCT TAAGATAATG GTTGCAGGCC GTGAGGCAAC TGGCCTTAAA CTGGTTACCG AGGTTGTGGA TACCAGAGAT GTTGAGCTTG TAAACAAGTA TACGGATATA TTTCAGATAG GGGCACGGAA TATGCAGAAT TTCAGGTTGT TAAGTGAGGT TGGAATGACC CGCAAGCCTG TTCTTCTAAA AAGAGGTCTA TCTGCTACTA TTGAAGAGTG GCTTATGGCT GCTGAATATA TAATTGCAGA GGGAAATCAT GAGGTTATTC TTTGCGAAAG AGGCATTCGT ACTTTTGAAA CTATGACAAG AAATACTCTT GATTTGAGTG CAATACCTGC AATAAAAGAT GTTTCACATC TTCCTGTTGT TGTTGATCCC AGTCATGCCA CAGGCAACTG GAAATATGTC CCTGCACTTG CAAAGGGAGC AGTAGCTACT GGTGCAGACG GTCTTATAAT TGAAGTTCAT CCAAATCCTC CAAGTGCATT ATGTGACGGT CCTCAATCAC TGAGGCCCGA AAGATTTGAA AGTCTTATGG ATGAATTGAG ACTGGTAGCC CAGGCAATCC AAAGAACTAT TTAA
|
Protein sequence | MIIVMKQSAT VKDIESVESR LLDLGFKTHP IHGDIKTVIG AIGDKRLLNV HAISQMQGVE TLVPIMKPYK FASSELQHTP TIIDVGGVQI GGKEIVVMAG PCAIENEKDF VDTAISVKKS GAKILRGGAF KPRSSPYAFQ GLEEDGLKIM VAGREATGLK LVTEVVDTRD VELVNKYTDI FQIGARNMQN FRLLSEVGMT RKPVLLKRGL SATIEEWLMA AEYIIAEGNH EVILCERGIR TFETMTRNTL DLSAIPAIKD VSHLPVVVDP SHATGNWKYV PALAKGAVAT GADGLIIEVH PNPPSALCDG PQSLRPERFE SLMDELRLVA QAIQRTI
|
| |