Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2330 |
Symbol | |
ID | 7311005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2727397 |
End bp | 2730399 |
Gene Length | 3003 bp |
Protein Length | 1000 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643609258 |
Product | Beta-ketoacyl synthase |
Protein accession | YP_002506646 |
Protein GI | 220929737 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAATT TTGAAGAAAA CGGTTCCCTG GATGGTGCAA TAGCCATAAT AGGTATGGCA GGCCGTTTCC CGGGTGCCAA TAATACGGAA GAATTCTGGG AGAATCTTTA TAATGGTGTT GAGTCGGTTA AGTTTTTTAA TCACGACGAC CTTATAAAAA TGGGAATAGA TGAGCATTTG CTGGATAATC CAAAATATGT TGCCGCTGAC GCTATTCTGG ACGGAATGGA CATGTTTGAT GCAGAGTTTT TTGATTATTC TGCAAGAGAA GCAGAAATAA CGGACCCACA GCACCGTTTA TTTCTGGAGA GTGCATGGGA GGTTCTAGAA AGTGCGGGGT ACAATTCCGA CCTTTATGAC GGAAGAATCG CAGTCTATGC AAGTGCTAAC CTTAGCGGAT ATATGGTAAG AAACCTGTAT TCCAATCCCG GATTGGTTGA AAGTCTTGGC TCATTTAAAA TAATGATAGC AAACGGACAG GATTTCCTTG CAACAAAGGT TTCCTACAAA ATGAATCTTA TGGGGCCAAG CGTAAATGTC AACACTCTTT GTTCGTCATC GATGGTAGCA GTTCACTATG CTTGTCAGAG TCTTAACAGC TTTGAGTGTG ATATTGCTTT GGCAGGGGGC GTCAGCTTTC AAGTGTCAAG AAACGAGACA TTCTTTTATC AGGAAGGCGG TATAGGTTCA GCAGACGGAC ACTGCCGTGC CTTTGATTCC AAAGCCAACG GTACGGTAAG CGGTAGCGGA CTGGGTATTT TAGCATTGAA AAGACTTGAG GATGCAATCG CTGACGGTGA CTGCATACAT GCAATTATAA AGGGAACAGG CATTAATAAC GACGGTTCGT CCAAAAACAG TTATACTGCT CCTAATGTAG ACGGTCAGGC CGAATGTATT GCTGAGGCCA TAGAAATGTC AGGCGTCAAC CCCGAAACAA TAACCTATAT AGATGCTCAC GGAACAGGTA CAAACTTGGG AGATCCTATT GAAATAGCCG CTTTGACCAA GGCATTCAGA GCTTATACCG ATAAAAAAGA ATTTTGTGCC ATAGGTTCGG CCAAAACCAA TATAGGGCAT CTTGTAAATG CCGGTGGTCT GGCCAGCATG ATAAAAACAG TACTTTCCAT GAAACACAGG ATAATTCCGG CAAGTCTGAA TTTCGAGGAG CCGAACCCCA AAATAGACTT TGTAAACAGC CCATTTTATG TAAACAGTAA ACTTTCAAAA TGGGAAACAG AAGGTTTTCC TATAAGAGCA GCTGTTAGCT CCTTTGGAAT CGGAGGTACC AATACACATG TTATTCTGGA GGAAGCTCCT GCTGTTGTAC CATCGGAAAA TTCCCAAAGG CCTTACCAAT TGATTTCACT ATCAGCAAAG ACAGAAACAG CACTTGAAAA AATGACACAA AACCTTGTTG AACATATAAA GAAGAATCCT GACTTGAATT TGGCAGATAT AGCATTTACA CAGCATGTAG GCAGAAGGAG TTTCAATCAC CGCAGAATTA TGATATGTAA GAACCTTGAA GATTTGGAAA CAAAAATAAG TAATTCCACC TGTGGTAATG TTATCTCATA CTTCCAGAAA CATAAGGACA GGCCGGTTAT ATTTATGTTT CCGGGAGAAG GGAAATACCT AAATGAATGC GGTGAATTGT ACAGGACGGA AGAAAAGTTC AGAAATTCCG TCGATTACTG TGCAGATATT CTGTTCCCCA TGTTGGGTAC TGATATCAAA GCAGTACTTG ACTCCGGCAG TTATATGAAT AACAACCAAA CCATTGAAAG GGCAGCTGTC TTTGTTGCAC AGTACTCAAT GTCAAAACTT CTTCAAGAAT CAGGTTTAAA ACCTGAAAGT ATGGTAGGAG AAGGTTTGGG AGAATATGTT TGTGCATGTA TTTCAGGAGT GATGAGCCTT GAAGATTCAT TGAGAATTAC TGCTGCCGAA GAAGAGGATT TTCCGGGAAT TCTTTCGGAA ATTAGCTTGA GCAAACCTCA AATACCTTTT GTATCATCCT TTAGTGGGAA ATGGATAGAG GATTCGCAAG CTACAAATAC GGATTACTGG TTAAAACAGC GTAACAGCAG TTTTTCAGCC CAAGGCTTGA AGGAAATTCT GTCGGATGAT GAAAGGATTT TTGTTGAAAT CGGAAATGGA AAAAGATTCA TTTCAGAAGA GGGTACTTCG TTGATATATG TTCAAAACGA AAACAAAAAT CAAGAAGAAG CCTTTATGGA GTGCTTGGGA AAGATTTGGG TTTACGGAGG AAATGTAAAC TGGAATAAAT TCTATGAGGA GGAAAAGAGA CACCGTATTC CTCTTCCTAC ATATCCATTT GAAAGGCAGA GATACTGGAT TGAACCGGGA GTAAAAAGCG AAGCTGCCAA GGAGCCTGAG TTCCCTGTTT CGGATAAATC AGAAATAGAT GAAAAGTTGC TTGCGAAATG CAGTAGGGTA AGCATTTCCG CATCCATAGA GTTAAATGAA AATACATACG GCTTAGAAAC ACATGCCCCA AAGAAAAAAC AAATAGAAGA AATGCTTTTG TTCAAAGAAA AACTTGAAAA ATTATGCTCT GAGTTTGAGG GAAAAATAAA TATCTCACCC CTTTTGTTAA AAGGGTCTGA AGAGCCTTCC TGCAGTGATT TAGGAGGTGC CGAAAAACTT AGTAAAAGGC CTCGTCCCGA TTTGGATGTA CCTTATGTGG AACCCGTCAG TGAGACTGAA AAAATAATTG CCGAGCATTG GAAAAAGGTT CTTGGCTTTG AAAAACTGGG AATACATGAC GATTTCTTTG AGCTGGGAGG CCATTCGCTG ATTGCTGCCG CCGTTGCAAC CGATTTGAGC AAGATATTCA GAATACAAAT TCCGATGACC AAGCTTTTGG AAACCACCAC AATAGCCTCT GTTGCAGAAA TGGTTGAAAC CTATCAATGG GCTGTGCAGG GAGAAGGAGA GGCGGCTGCT GCGGAAGAGG ACATGGAAGG CGGCACAATA TAA
|
Protein sequence | MGNFEENGSL DGAIAIIGMA GRFPGANNTE EFWENLYNGV ESVKFFNHDD LIKMGIDEHL LDNPKYVAAD AILDGMDMFD AEFFDYSARE AEITDPQHRL FLESAWEVLE SAGYNSDLYD GRIAVYASAN LSGYMVRNLY SNPGLVESLG SFKIMIANGQ DFLATKVSYK MNLMGPSVNV NTLCSSSMVA VHYACQSLNS FECDIALAGG VSFQVSRNET FFYQEGGIGS ADGHCRAFDS KANGTVSGSG LGILALKRLE DAIADGDCIH AIIKGTGINN DGSSKNSYTA PNVDGQAECI AEAIEMSGVN PETITYIDAH GTGTNLGDPI EIAALTKAFR AYTDKKEFCA IGSAKTNIGH LVNAGGLASM IKTVLSMKHR IIPASLNFEE PNPKIDFVNS PFYVNSKLSK WETEGFPIRA AVSSFGIGGT NTHVILEEAP AVVPSENSQR PYQLISLSAK TETALEKMTQ NLVEHIKKNP DLNLADIAFT QHVGRRSFNH RRIMICKNLE DLETKISNST CGNVISYFQK HKDRPVIFMF PGEGKYLNEC GELYRTEEKF RNSVDYCADI LFPMLGTDIK AVLDSGSYMN NNQTIERAAV FVAQYSMSKL LQESGLKPES MVGEGLGEYV CACISGVMSL EDSLRITAAE EEDFPGILSE ISLSKPQIPF VSSFSGKWIE DSQATNTDYW LKQRNSSFSA QGLKEILSDD ERIFVEIGNG KRFISEEGTS LIYVQNENKN QEEAFMECLG KIWVYGGNVN WNKFYEEEKR HRIPLPTYPF ERQRYWIEPG VKSEAAKEPE FPVSDKSEID EKLLAKCSRV SISASIELNE NTYGLETHAP KKKQIEEMLL FKEKLEKLCS EFEGKINISP LLLKGSEEPS CSDLGGAEKL SKRPRPDLDV PYVEPVSETE KIIAEHWKKV LGFEKLGIHD DFFELGGHSL IAAAVATDLS KIFRIQIPMT KLLETTTIAS VAEMVETYQW AVQGEGEAAA AEEDMEGGTI
|
| |