Gene Ccel_0141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0141 
Symbol 
ID7309052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp157776 
End bp159869 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content41% 
IMG OID643607070 
ProductGlycosyl hydrolase 67 middle domain protein 
Protein accessionYP_002504509 
Protein GI220927600 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3661] Alpha-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTATAAAT CAAACGTAAA TGATGAGCTA TACGGTGCAA ATGGATATAA CTGTTGGCTT 
GGATATCATC TGCTTGAAAA CGGAGAGCTA AGAGAAAACT ATTCCCAATG GGCCTCCAAT
ATAGTAATTT CTAAAGAACC GGACGAAATA AAAATAGCTT TAAGCGAACT TAAAAGCGGA
ATAAATGGAA TATTGGGAGT TGATGCTGTT GTTGTAACCA GAGAGCCGGA ACAAAGCTCC
TGCATTGCTC TGGGTGTGCT TGGAAGAGGA CAGAACATTG ATAGCTATGT AAAATACGAT
GAGGTAGTGC AAATCGGTAA TGAAGGCTTT ATAATCAAGG CATTTAAAAC TGGTAATAGT
GAAATCGTTG TTGTTGCCGG TACAACCACA AAAGGCCTAC TCTACGGAGT ATTCAGTCTG
TTGAGACTAC TGCAAACTGA GGCAACGATT TCAGGTATCT TGAAGATTGA AAATCCTGCA
AACCAGCTTC GTATTATAAA CCATTGGGAC AATATCGATG GAAGTATTGA AAGAGGTTAT
GCGGGTAAAT CCATTTTCTT TACGGATAAT AAAGTAACCG AAGACCTTGG CAGAATAAAA
GACTATGCAA GACTTTTATG CTCTGTTGGA ATAAACAGTA TTGTTATAAA CAATGTTAAT
GTTCACAAGT ATGAGAGTAT GCTTATAACA GACAAATATC TCAATGATGT TGCAAGTCTG
GCTCAAATAT TCCGTGACTA CGGTATAAAG CTGTATCTTA GTGCAAATTT TGCAAGTACT
ATTGAAATAG GAGGACTAGC TACGGCCGAC CCGTTGGACC CGCAAGTAAG AAAGTGGTGG
AAGGAGAAGG CCGATGAGAT ATACTCGTTG ATACCTGACT TTGGTGGTTT TCTGATTAAG
GCAGATTCCG AATTCCGGCC TGGGCCTTTT ACTTATGGAC GTACCCATGC GGATGGTGCC
AATATGCTTG CAGAAGCCTT GGAACCATAC GGCGGCCTGG TTATATGGAG ATGCTTTGTA
TACAACTGTA TGCAGGATTG GCGTGATCGC ATCACAGATA GGGCTAGGGC TGCCTATGAC
AACTTTATGC CTCTTGACGG CTTGTTCAGG GAAAATGTAT TGCTTCAGAT AAAAAACGGC
CCTATGGACT TTCAGGTGCG TGAGCCGGTA TCTCCTTTAT TCGGGGGATT ACAAAAAACA
AACCAGCTAT TGGAGCTTCA GATTACTCAG GAATACACAG GACAACAAAA GCACTTATGC
TATCTGGTGC CAATGTGGAA GGAGATACTG GACTTTGATA CAATGGCAAA GGGTAGGAAC
ACAAGTGTGA AAAAAATTAT CACAGGATCC GTGTTCAATA ACAAATTAGG CGGAATGGCA
GCGGTAACAA ATATAGGAAA TGACCTGAAC TGGACGGGCC ACCAAATGGC TCAGTCAAAT
ACATACGGTT ATGCACGTTT GTGTTGGAAT CCTGATTTAT CAGCTGAAAA GATTACTGAT
GAATGGGTTA GAATGACTTA CTCAAATTAT GAAAAGGTTG TGAATACCGT AAAGGAAATG
CTGCTGGGTT CATGGAGAAC CTATGAGAAT TATACTTCTC CTCTGGGAAT AGGTTGGATG
GTTAATCCCA ATCACCATTA CGGGCCGAAT GTAGACGGAT ATGAATATGA TAAGTGGGGA
ACATATCACA GGGCAGACCA TAAGGGGATC GGAGTAGACA GAACAGTCAA GAGCGGAACA
GGATATGCGG GACAATATCA CAAGGATGTT GCCGGGATTT ATGAGGACAT GGACAAGTGT
CCTGAGGAGC TTTTGCTATT TTTCCACCAT ATGCCCTACG ACTACATACT AAAATCAGGC
GAAACGCTGA TTCAATACAT TTACAACACC CATTTCAAAG GGGTTGAGGA GGTAGAAGAA
TTGAGGAACA AGTGGTTTAG TCTGAAAGGT TGGATTAGCG AGGAAATATT TCTGCACGTT
CTGGAAAGAT TGGACGGACA GTTGGAACAT TCCAAAGAGT GGAGAGATGT TATAAATACA
TATTTCTATC GAAAAACAGG TATATCTGAT GAACTTGGCA GAAAAATATA TTAA
 
Protein sequence
MYKSNVNDEL YGANGYNCWL GYHLLENGEL RENYSQWASN IVISKEPDEI KIALSELKSG 
INGILGVDAV VVTREPEQSS CIALGVLGRG QNIDSYVKYD EVVQIGNEGF IIKAFKTGNS
EIVVVAGTTT KGLLYGVFSL LRLLQTEATI SGILKIENPA NQLRIINHWD NIDGSIERGY
AGKSIFFTDN KVTEDLGRIK DYARLLCSVG INSIVINNVN VHKYESMLIT DKYLNDVASL
AQIFRDYGIK LYLSANFAST IEIGGLATAD PLDPQVRKWW KEKADEIYSL IPDFGGFLIK
ADSEFRPGPF TYGRTHADGA NMLAEALEPY GGLVIWRCFV YNCMQDWRDR ITDRARAAYD
NFMPLDGLFR ENVLLQIKNG PMDFQVREPV SPLFGGLQKT NQLLELQITQ EYTGQQKHLC
YLVPMWKEIL DFDTMAKGRN TSVKKIITGS VFNNKLGGMA AVTNIGNDLN WTGHQMAQSN
TYGYARLCWN PDLSAEKITD EWVRMTYSNY EKVVNTVKEM LLGSWRTYEN YTSPLGIGWM
VNPNHHYGPN VDGYEYDKWG TYHRADHKGI GVDRTVKSGT GYAGQYHKDV AGIYEDMDKC
PEELLLFFHH MPYDYILKSG ETLIQYIYNT HFKGVEEVEE LRNKWFSLKG WISEEIFLHV
LERLDGQLEH SKEWRDVINT YFYRKTGISD ELGRKIY