Gene Ccel_1013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1013 
Symbol 
ID7309838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1258840 
End bp1261941 
Gene Length3102 bp 
Protein Length1033 aa 
Translation table11 
GC content42% 
IMG OID643607940 
Productglycoside hydrolase family 2 TIM barrel 
Protein accessionYP_002505355 
Protein GI220928446 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.188379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGAG AATGGGAAAA TCAGTATATA ACACAAATAA ACAGGTATCC GATGCACTCG 
CCATATGGAG CTTATGAATC TGTTGAACAG GCCATGAGCT GTAACCGCTG GACTTCAAAA
TATGTAAAAA GCCTGAGCGG GATATGGAAG TTCAAGCTTG CCCAAAATCC ACAGCAGGCA
CCGGAAAATT TCTATGCATT AAATTATGAT GTTTCCGACT GGGATGATAT ACCCGTACCT
TCAAACTGGG AGCTGCATGG GTATGGCAAA CCGGTTTATA CCAATATTAT ATATCCATTT
AAGAGGGAAG GAGTCGGATC GCACTATGAG ATAGAGGTAG CGGAGGGACA GGTAGAGCTT
AATGCACCTC TGGTACCTGA AAAAAACTTG ACAGGCTGTT ACCGAACCGA TTTTGAAGTG
CCGGATTATT TTGAAGGAAA GGATGTATTT ATAGAATTCG GAGGTGTAGA GTCCTGTTTT
TACCTTTGGG TAAACGGTAC GGAAATAGGC TTTTCCAAGG ATAGCAAGCT GGATGCTTCC
TTTGATATCA CACATGCCGT ACATAGTGGA AAAAATGAAC TGGCAGTAAA GGTACTACAA
TACTGTGACG CCTCGTACCT TGAGGATCAG GACTACTGGC ACTTGTCAGG AATATACCGG
GATGTGAGAA TTTATGCAAA GAACAAACAA CGGCTGCTTG ACTACAAGGT AGAAACTCTG
TTTAAGGACA ATAATTTTAA AGAAGCCGAA CTGAGAGTAA TGCTTCAGCC AAATAATAGA
GTCAGGGGTT ATGGAGAATC TTATGTGAGA TTGAGTCTTT ATAACGCTGA AAATGAGTTG
GTGACTGCTT TTCAGAGTCA ACCCTATGCC AAATGCGGAG TATATCTGGA GTCTAAATTT
ACTGCTTTTC CGTCTGCATT TGTGAAAAAT CCCCATTTAT GGTCTGCTGA GGAGCCGTAT
TTATATACTT TGGTTTTGGA GACGGTAGAC AAAAACGGAT GTGTCACAGA TATTGAAAGT
ACAAAAGTAG GCTTCCGTAA AGTAGAAATA GGCAGGGACG GTGTGTTTTA CCTGAACGGC
AGGAGACTAC TGGTGCGGGG AGTGAATTTA CACGAATTCT GTCCCGAAAC TGGAAGATAT
GTTTCGATGG AATATATGAG ACAGCAGATT TTATCCATGA AACAGATGAA TTTCAATGCC
GTTCGAACCA GCCATTATCC CCATGCAAGT GAATGGTATG AGCTTTGTGA TCAGCTGGGA
ATATATGTTG TTGATGAGGC CAATCTTGAA ACCCATGGTT TTGGCGGACA GTTAAGTTCG
TCCGCAGAAT GGACTGCGGC ATACTTGGAA CGTGCTACTC GAATGGTACT CAGAGATAAG
AATCATCCTT CTATTATCAT CTGGTCGCTG GGGAACGAAT CTGGGGCAGG TGCTAACCAT
GCGGCTATGT ATGGTTGGAT AAAAGAATAT GACAAAACAA GGTATGTTCA GTATGAATCC
TGCAACCCTG AAAGCAATAT TACCGATATT ATCTGTGCAA TGTATCCAGC AAAAGACTGG
GTGGAGCAAA AGATGGCAGA GCATGATGAC CTGAGACCCT TCATAATGTG TGAATATGCC
TATGCCAAAA GCAACAGCAA TGGCAATTTT AAACTATATT GGGACTTGGT GGAGAAGTAT
CCACGTTTCC AGGGCGGGTT CATATGGGAC TTTCAAGATA AGGCACTTGT AAAGCAGCAG
GAAAACGGGA CCTCAAGATA TGTATATGCA GGAGCTTTTA ATGAGGAGGT AGTTGACCCC
ATAGAGGATA TGTGTTTGAA TGGAGTGGTG CTGCCGGATT TGGACTGGAA ACCAGCTGCA
TTTGAAATTA GGAATTGCCA GGCCCCGGTG ACAATCTTCC ATGATGGAAG CACATATCCT
GAAGCTGGTG ACTATAAAAT AAAGAATAAC TATATGTTTA AGGATTTGAG TCATTTATAC
CTTACCTGGG AGCTACTGTG TGACGGAAAA ATCGTGGATA GAGGAGTTAT AAAGCAGTAT
TTTACGGCTC CGGGACAGTC GGATATTCTG GAATATGCCT TGAAACCTGA GAAAATATCT
GGTGAAGCTT TTGTAAATAT CACAGTCTCA TTAAATGAGG ATGTACCGTA TGCAAAAAAG
GGACATGTTA TTTATGCATA TCAAATCCCA TTGCAGGAGT CGGTGTTAAA AATGCAGAAG
GTATGTATTA ACCATAATAA ACTGACAATG CATGAAACTG CCGGAGAAAT ATTGATACTG
GGAGAAAATA CTGAGGTTCG TTTTGACAAG GAAAGCTGTA TATTCACAAA AGCTGTTTTT
AACGGAATTG AGAGCTTTTG GGGGGGCCGG GATAATTTTT ACCGGGCTTC AACGGGAATT
GACGAAGGGT GCAGGAATCC CGGCGGCAAT TACTCAACTG ACTGGAAGGA CTGGGGCTTA
AATGATTTGA AAATAAAGGA CATAAAAGTA GATACTGCTG TTTCTGAATC ACAAATATTT
ATATTTACAA ACGTGTCATA CAATGATGAT AAATTGATTG TTTCCACACA GTACAGGATT
GGCAGTAGAG GTATGGAGGT TGATAAAACC GTCATTAATA ATTGTCCCGG TGAGACAATT
CCAAGAATCG GATTCTCATT AATGCTTCCG GAGGATAAAC AGCAAATAAC TTGGTATGGA
AGAGGACCTT GGGAGAATTA CGCCGACAGA AAGGAAGCAG CCCTTATAGG GTGTTACAGC
AGTACCGTCC CGGAACAGTA TACACACTAT GTGAAGCCCG TGGAATGCGG CGGAAAAGAA
GATGTGAGAT ACATCATTGT ACAGGATAAG GGGGGACATG GCCTACGCGT TTCAGGCACC
GTACCTTTCC ATTTTGATAT TCATGATTAC TCTGTGGAGG CCTGCGATAA TGCAATGTAT
GAGCATGAAC TAATAAAGGA TAAATACGTA CACCTAAATA TAGATCATTT ACATGCCGGC
TTGGGCGGAG ACAACGGCTG GTCAAAAAAT ATTCATACGG AGTACCGTAT TGGAAAAGGT
TGTTATCATT ATCAGATATT AATAGAAGTG GTTGACAATT AA
 
Protein sequence
MSREWENQYI TQINRYPMHS PYGAYESVEQ AMSCNRWTSK YVKSLSGIWK FKLAQNPQQA 
PENFYALNYD VSDWDDIPVP SNWELHGYGK PVYTNIIYPF KREGVGSHYE IEVAEGQVEL
NAPLVPEKNL TGCYRTDFEV PDYFEGKDVF IEFGGVESCF YLWVNGTEIG FSKDSKLDAS
FDITHAVHSG KNELAVKVLQ YCDASYLEDQ DYWHLSGIYR DVRIYAKNKQ RLLDYKVETL
FKDNNFKEAE LRVMLQPNNR VRGYGESYVR LSLYNAENEL VTAFQSQPYA KCGVYLESKF
TAFPSAFVKN PHLWSAEEPY LYTLVLETVD KNGCVTDIES TKVGFRKVEI GRDGVFYLNG
RRLLVRGVNL HEFCPETGRY VSMEYMRQQI LSMKQMNFNA VRTSHYPHAS EWYELCDQLG
IYVVDEANLE THGFGGQLSS SAEWTAAYLE RATRMVLRDK NHPSIIIWSL GNESGAGANH
AAMYGWIKEY DKTRYVQYES CNPESNITDI ICAMYPAKDW VEQKMAEHDD LRPFIMCEYA
YAKSNSNGNF KLYWDLVEKY PRFQGGFIWD FQDKALVKQQ ENGTSRYVYA GAFNEEVVDP
IEDMCLNGVV LPDLDWKPAA FEIRNCQAPV TIFHDGSTYP EAGDYKIKNN YMFKDLSHLY
LTWELLCDGK IVDRGVIKQY FTAPGQSDIL EYALKPEKIS GEAFVNITVS LNEDVPYAKK
GHVIYAYQIP LQESVLKMQK VCINHNKLTM HETAGEILIL GENTEVRFDK ESCIFTKAVF
NGIESFWGGR DNFYRASTGI DEGCRNPGGN YSTDWKDWGL NDLKIKDIKV DTAVSESQIF
IFTNVSYNDD KLIVSTQYRI GSRGMEVDKT VINNCPGETI PRIGFSLMLP EDKQQITWYG
RGPWENYADR KEAALIGCYS STVPEQYTHY VKPVECGGKE DVRYIIVQDK GGHGLRVSGT
VPFHFDIHDY SVEACDNAMY EHELIKDKYV HLNIDHLHAG LGGDNGWSKN IHTEYRIGKG
CYHYQILIEV VDN