Gene Ccel_1597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1597 
Symbol 
ID7310354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1935982 
End bp1937397 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content42% 
IMG OID643608526 
Productglycoside hydrolase clan GH-D 
Protein accessionYP_002505929 
Protein GI220929020 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.312648 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TCAAACGCCT TATTGCTTTG GTAATTTTAA CAGCTGTGTT TATCTCGCCA 
TTTCCGCCGA CTGTGCAAAA TGGCAATCAA GTGCTGGCAC TGAATAACAG TCTGGGATTG
ACCCCGCCCA TGGGGTGGAA CAGCTGGAAC ATCTTTGGAG GTGACATCAA TGAGGAAAAG
ATCAAGCAAA TCACAGATGC TATGGTTACC ACAGGTATGA AGGATGCAGG CTATGAGTAT
GTCAATATTG ATGATAATTG GATGGCAAAC CCTGCACGTG ACGCTAATGG AATACTTATT
CCGGATCCCA AACGTTTTCC TAACGGTATG AAAGCTTTAG CAGATTATAT TCATTCAAAA
GGGTTAAAAT TTGGAATTTA TGGTGACAGG GGAGTAACCA CATGCTGTAA TATTCCCCAG
AGTGGAAGCC AAGGATATGA GGAACAAGAC GCAAAAACTT TTGCTCAATG GGGTGTAGAT
TATTTGAAAT ATGATAACTG TGCTTCAGAC AGCAATTTGC AGGCAGGCTA CGAAAAAATG
CGGGATGCTC TTTTGAAAAC AGGAAGACCT ATTTTCTATA GCATATGCTG CTGGTATTTT
GCAGGTGCGT GGATGGTAGA TTGCGGTAAT TCTTGGAGAA CAACAGGAGA TATTAGTGAC
AACTGGCGAA GTATTATAAA GAATATTGAT GAAAACTCCA AGTCAGCCGC GTATGCAGGC
CCGGGCCATT GGAATGATCC AGATATGCTA GAGGTTGGTA ACGGTAATAT GACAGAGACT
GAATACAAAG CACATTTCAG CATGTGGTGC ATGATGGCAG CTCCGCTTAT TGCTGGAAAT
GACCTTAGGA ATATGACTCC TGCTACTAAA GATATTCTTA CTAACAAAGA GGTAATTGCT
ATTAATCAAG ATGCTGCTGG CGTGCAAGGC ACCAAGGTAA GTACTTCGGG AGAACTTGAA
GTGTGGTGTA AACCGCTAGG GACAGATGGC ACTACCAAGG CAGTTGCACT GTTAAATCGC
GGAGCCGCAT CGGCAGATAT CACAGTTAAT TGGAGAGATA TAAAGCTTGC CGATGGGCCT
GCCACTGTTC GTGATCTTTG GGAGCACAAG GATTACGGCA AGTTTAACAC TGAGTATACA
GCCAATGTAC CTTCTCACGG TGTGGTGGTA TTAAAAGTTC AAGCAAGCTC CACCGATACG
GATATAATGT ATGGTGATGT TGATGGAAGT GGCATGATTG ATGCACTGGA TTATTCATTA
GTTAAAAGGT ATCTGCTAGA CCAGATTTCC GACTTTCCTG CTTCAAACGG CAAACTTACT
GCCGATGTTG ATGGAGACAG TCAAATAACA GCGCTGGATT TTTCATTAAT TAAGCAATAT
TTGCTGGGTA TTGTTAATAA ATTCCCTGTG CAATAA
 
Protein sequence
MKKIKRLIAL VILTAVFISP FPPTVQNGNQ VLALNNSLGL TPPMGWNSWN IFGGDINEEK 
IKQITDAMVT TGMKDAGYEY VNIDDNWMAN PARDANGILI PDPKRFPNGM KALADYIHSK
GLKFGIYGDR GVTTCCNIPQ SGSQGYEEQD AKTFAQWGVD YLKYDNCASD SNLQAGYEKM
RDALLKTGRP IFYSICCWYF AGAWMVDCGN SWRTTGDISD NWRSIIKNID ENSKSAAYAG
PGHWNDPDML EVGNGNMTET EYKAHFSMWC MMAAPLIAGN DLRNMTPATK DILTNKEVIA
INQDAAGVQG TKVSTSGELE VWCKPLGTDG TTKAVALLNR GAASADITVN WRDIKLADGP
ATVRDLWEHK DYGKFNTEYT ANVPSHGVVV LKVQASSTDT DIMYGDVDGS GMIDALDYSL
VKRYLLDQIS DFPASNGKLT ADVDGDSQIT ALDFSLIKQY LLGIVNKFPV Q