Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2880 |
Symbol | |
ID | 7311496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 3433865 |
End bp | 3434890 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643609775 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002507154 |
Protein GI | 220930245 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.320126 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTATTG TAATGAATCC AAAGTCAAAT CAAATGCAAA TTGACGACGT GATCAATGTC CTGAAAAATT CAGGTTTGGG AGTTCATGTA TCACAGGGTA CTGAAAGAAC TATCATCGGT ATTATCGGTG ATAAAACGGT TCTTCGTGAC ATTCCACTTG AAATAATGCC GGGGGTTGAA AAACTGGTTC CCATTGTAGA GTCCTTCAAG CTGGCAGGAA AGACATTCCG GCCTGAACCC AGTGTTGTAG ATGTAAATGG CGTAAAAATA GGCGGTAAAG AATTGGTTAT AATGGCAGGT CCATGTGCCG TAGAAAGCCG GGAGCAGGTA ATTGAAGCCG CACAGGCAGT AAAAAGGTCC GGTGCCCAGT TTTTAAGAGG GGGTGCTTTT AAGCCAAGAA CCTCGCCATA TGCTTTTCAG GGCCTTGAGG AAGAAGGACT GAAGCTGCTC AAAGAAGCAA AGGATGCAAC AGGCTTGCAG ATAATCACAG AAGTCACCAG TGATAAGGCA GTAGAAACAT CAATACCATA TGTGGATATG TTTCAGATAG GTGCAAGAAA TGTGCAGAAC TTTCAGCTTT TAAAGGAAGT AGGAAAATCC ATGAAACCAG TTCTTTTGAA AAGAGGTTCT GCAACAACAA TTGACGAATG GTTAAATGCT GCCGAATATA TAATGAGTGA AGGCAACTAC AGCGTAGTAT TGTGCGAAAG AGGTATCAGA ACCTTTGAAA CAGCTACAAG GAACACTCTT GATTTGAGTG CGGTCCCCGT AGTTAAAAAT ATGAGCCACC TCCCTATAAT AGTTGACCCA AGTCATGCTG CCGGAAAGTC CAGGTATGTT ATTCCTCTTT CCAGGGCTGC GATAGCCGCA GGTGCTGACG GTCTTATAGT TGAAGTTCAC CCCAATCCAA TGTGTGCTCT CTCCGATGCA GCACAGCAGC TAAAGCCATC CGAATTTGAT TCACTTTGCA AAGATATAAG TAAGCTTGCA CCAATTCTGG AGAGAGAGTT TAATTATGGA TGTTAG
|
Protein sequence | MIIVMNPKSN QMQIDDVINV LKNSGLGVHV SQGTERTIIG IIGDKTVLRD IPLEIMPGVE KLVPIVESFK LAGKTFRPEP SVVDVNGVKI GGKELVIMAG PCAVESREQV IEAAQAVKRS GAQFLRGGAF KPRTSPYAFQ GLEEEGLKLL KEAKDATGLQ IITEVTSDKA VETSIPYVDM FQIGARNVQN FQLLKEVGKS MKPVLLKRGS ATTIDEWLNA AEYIMSEGNY SVVLCERGIR TFETATRNTL DLSAVPVVKN MSHLPIIVDP SHAAGKSRYV IPLSRAAIAA GADGLIVEVH PNPMCALSDA AQQLKPSEFD SLCKDISKLA PILEREFNYG C
|
| |