Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1597 |
Symbol | |
ID | 7310354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 1935982 |
End bp | 1937397 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643608526 |
Product | glycoside hydrolase clan GH-D |
Protein accession | YP_002505929 |
Protein GI | 220929020 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.312648 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TCAAACGCCT TATTGCTTTG GTAATTTTAA CAGCTGTGTT TATCTCGCCA TTTCCGCCGA CTGTGCAAAA TGGCAATCAA GTGCTGGCAC TGAATAACAG TCTGGGATTG ACCCCGCCCA TGGGGTGGAA CAGCTGGAAC ATCTTTGGAG GTGACATCAA TGAGGAAAAG ATCAAGCAAA TCACAGATGC TATGGTTACC ACAGGTATGA AGGATGCAGG CTATGAGTAT GTCAATATTG ATGATAATTG GATGGCAAAC CCTGCACGTG ACGCTAATGG AATACTTATT CCGGATCCCA AACGTTTTCC TAACGGTATG AAAGCTTTAG CAGATTATAT TCATTCAAAA GGGTTAAAAT TTGGAATTTA TGGTGACAGG GGAGTAACCA CATGCTGTAA TATTCCCCAG AGTGGAAGCC AAGGATATGA GGAACAAGAC GCAAAAACTT TTGCTCAATG GGGTGTAGAT TATTTGAAAT ATGATAACTG TGCTTCAGAC AGCAATTTGC AGGCAGGCTA CGAAAAAATG CGGGATGCTC TTTTGAAAAC AGGAAGACCT ATTTTCTATA GCATATGCTG CTGGTATTTT GCAGGTGCGT GGATGGTAGA TTGCGGTAAT TCTTGGAGAA CAACAGGAGA TATTAGTGAC AACTGGCGAA GTATTATAAA GAATATTGAT GAAAACTCCA AGTCAGCCGC GTATGCAGGC CCGGGCCATT GGAATGATCC AGATATGCTA GAGGTTGGTA ACGGTAATAT GACAGAGACT GAATACAAAG CACATTTCAG CATGTGGTGC ATGATGGCAG CTCCGCTTAT TGCTGGAAAT GACCTTAGGA ATATGACTCC TGCTACTAAA GATATTCTTA CTAACAAAGA GGTAATTGCT ATTAATCAAG ATGCTGCTGG CGTGCAAGGC ACCAAGGTAA GTACTTCGGG AGAACTTGAA GTGTGGTGTA AACCGCTAGG GACAGATGGC ACTACCAAGG CAGTTGCACT GTTAAATCGC GGAGCCGCAT CGGCAGATAT CACAGTTAAT TGGAGAGATA TAAAGCTTGC CGATGGGCCT GCCACTGTTC GTGATCTTTG GGAGCACAAG GATTACGGCA AGTTTAACAC TGAGTATACA GCCAATGTAC CTTCTCACGG TGTGGTGGTA TTAAAAGTTC AAGCAAGCTC CACCGATACG GATATAATGT ATGGTGATGT TGATGGAAGT GGCATGATTG ATGCACTGGA TTATTCATTA GTTAAAAGGT ATCTGCTAGA CCAGATTTCC GACTTTCCTG CTTCAAACGG CAAACTTACT GCCGATGTTG ATGGAGACAG TCAAATAACA GCGCTGGATT TTTCATTAAT TAAGCAATAT TTGCTGGGTA TTGTTAATAA ATTCCCTGTG CAATAA
|
Protein sequence | MKKIKRLIAL VILTAVFISP FPPTVQNGNQ VLALNNSLGL TPPMGWNSWN IFGGDINEEK IKQITDAMVT TGMKDAGYEY VNIDDNWMAN PARDANGILI PDPKRFPNGM KALADYIHSK GLKFGIYGDR GVTTCCNIPQ SGSQGYEEQD AKTFAQWGVD YLKYDNCASD SNLQAGYEKM RDALLKTGRP IFYSICCWYF AGAWMVDCGN SWRTTGDISD NWRSIIKNID ENSKSAAYAG PGHWNDPDML EVGNGNMTET EYKAHFSMWC MMAAPLIAGN DLRNMTPATK DILTNKEVIA INQDAAGVQG TKVSTSGELE VWCKPLGTDG TTKAVALLNR GAASADITVN WRDIKLADGP ATVRDLWEHK DYGKFNTEYT ANVPSHGVVV LKVQASSTDT DIMYGDVDGS GMIDALDYSL VKRYLLDQIS DFPASNGKLT ADVDGDSQIT ALDFSLIKQY LLGIVNKFPV Q
|
| |