Gene Ccel_0992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0992 
Symbol 
ID7309822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1222003 
End bp1225173 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content42% 
IMG OID643607919 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_002505334 
Protein GI220928425 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAAA GGTCTTTCCC GCTGGACGGA GATTGGTATG TAGCTCTGGA TGAAAAGGAG 
ATCGGAATAA AAGAGAAATG GTTTGAAAGT ACATTTACAG AAAAGGTTAA ATTACCCGGT
ACCCTTGATG AAAATGGCAT TGGCAGTCTG GTAACAGGTA CTGACACCTT TAGATTGAAC
CGTATACGCA AATATGTAGG GGCAGCATGG TATAACAGAT TGGTAGTGCT ACCTGACGAC
ATAGTAGAAA AGCATGTCAC ATTGTTTTTG GAAAGATGCA TGTGGAAAAC AGAACTTTGG
ATAAATGGCA ATTATGCCGG CTCTCAAAAT AGCTTAAGCA CCCCTCACAA GTATAAGCTT
GACGGAATGC TTAAAGCGGG TGAAAACCTT ATCAGCATCA AGGTGGATAA TTCACCTATC
TATAATCTGG GAGTCATGAG CCACGGATAT TCGGAAGAGG TACAAACCGT TTGGAACGGC
ATAGTGGGAA GAATAGAACT GGATATTAGG GATAAAGTTT ATGTGGAGCG AGCAAATGTG
TATTCTGATA TGGAAAGCAG TAAGCTATCA ACGCGCCTGT TGCTGGTTAA TACTGCTTCA
AAAGCAGTCG AAGGCTTGAT TAGTCTTTCG GTCCGTCCTC AAGATGGAGA AAATGTAATT
ATAAGGGATT CATATTATTT TAGTATAGGA GCAAATTCAT CAGTTAACGT TGAGTTGACT
CAGGCGTATG AGGACAAGCT AGGCTTGTGG GATGAGTTTA ATCAAAATTT GTATAGAATG
GATATAAATT TGGAATGCCA GTGTGAAAAA GAGAATGAAA TGTATTCTGA TTCAAGCCAA
ATACTTTTCG GTATTCGGAG CTTTAAGGCA GAAGGTCCTG TGTTTAGGCT TAACGGCAAA
AAAATATTCC TAAGGGGGAC TCATGATGCC GGTAATTTTC CGATTACAGG GTATCCGTCA
ATGGAGGTTG AAGACTGGAA ACGTATATAT GCAATAGCAA AATCCTATGG AATCAACCAC
TTCAGATTTC ACTCATGGTG TCCCGGCGAA GCTGCATTTG CTGCCGCTGA TGAAGAGGGA
ATGATTTTAC AGGCAGAATT GCCCTTATTT GGTTTTACTG CTCCGCCGCT GGGACAGGAT
GAACCTAGAG ACAGCTTCCT GAGAGAAGAG CTGCTTAGAA TTCTGGAAGA GTATGGGAAC
CACCCGTCAT TTTGCATGAT GTGCATGGGA AACGAACTTC GCGGAGACTA TGAGATGCTC
TCTGAATTTG TTGAAATGGG CAGGAAAACG GACGGAAGGC ACCTTTATTC GTCAGCCGCA
AACAATGCGG CAGAACCAGG ACTTGGGATC AAACCTAATA AAGGTGATCA GTACTATGTT
GCTCATGAGG CAAGGATAAA CGGTGAAAGG GTTAATAGAC GCTGTGAGAA TGTTTTCAAT
AATGAAAGGC CTGAAACTAT ATCCGACTAC AGTGAGACAC TGAAAGGAAT AGAGGTTCCG
ACCATATCTC ATGAAGTAGG CCAGTGGGAG GTTTATCCCG ACTTTGAAGA AATAAAAAAG
TACACTGGTG TTTTAAAGGC AAAAAATTTT GAGGAGTTTA AAGGATCGGC AGCGGCAAAA
CAGGTTTTGA ACAAAAATAA GGATTTTGTA ATGGCTTCGG GCAAATTGGC GGTTTTATTG
TACAGGGAAG AAATTGAACG CTCGCTGAGA ACTGCGGATT ATGGAGGTTT TCAGCTTCTT
GACATGCATG ATTTCCCTGG GCAGGGTACT GCCCTTGTGG GATGGCTGGA TGCCTTCTGG
GATTCCAAAG GTCTGGTGGA GCCCCGGGAA TTCAGAAACT GGTGCAACCA CTCTGTGCTA
CTGGCAAGAA TGGAAAAACG TGTGTGGTTA AATAGTGAGG TATTCAGTGC TGAGATCAAT
TTTGCCAATT ACAGTCAAAA TGATTATTCA CAGCTTAATA TCAGGTGGGA ATTAACCCGA
AGTGACGGTA CGATGTATTC CTGCGGTAAA TTTAATAACA TCTTCATTCC CCAGGGCAAC
CTATCTTATA TTGGAGTGGC AGCAGCTGAG CTCAATAAGG TAGAAACTGC TGAAAAACTT
ATACTTACAA TTGTTACGGA CGGCATTTGT ATAACAAATC AATGGGATAT CTGGGTATAT
CCCGAAAAGC TAGCTGTTGA AATGCCGGAA GACATATATG TTGCCGAAAG CTGGGATGAC
GGAGTGTCCG AAACCTTGGG CAATGGGGGG ACAGTGCTTA TGTTTACTGG AGCAGTAAGA
AATTCGGAGG CAATGTGTTT TACGACTCCA TTTTGGAATA CCCAGATGTT TGAAAACCAG
AGGAAAACCA TGGGTATTCT TTGTAACCCT AATCATCCTG CTCTTCTGGA CTTCCCCACG
GAGTACCATA CAAACTGGCA ATGGTGGGAG CTGCTTGCCG ATTCAAAGTG TATAGGCATA
AATGAACTCC CGGCAGGGTT TGAGCCAATT GTCTCTGCAA TAGATCACCC TGTCAGAAAT
AACAGGCTAG GAGTTATTTT TGAAGCAAGG GTTCTCAGCG GAAAACTTCT TATATGTAGC
CTCGATTTGA ACGGCGATTT AGGCGGCCTT CCGGCAGCAA GACAGTTAAA GTACAGTATT
CTGAATTATA TCCAGAGTGA AAGGTTTGAG CCTGAATTTT ATGTGGATGA GAGCATTATG
GCAAATATGT TGTTAAAAAA CACTGCATCC AATTTAAAAT TACTGACAAA AAATATTAAG
GCATCAAATA CAAAATACTC AAGTAAAGTG GAATATATTC TTGATACTGA TACGTCAAAT
TTTTGGGTCA CAATGGGAGG ACAATACCCG TATGTCATAG ATATTGAACT TACGGAAAGT
ACAGCCATAA AAGGTTTAAC ATATTGGCCG AGACAGGATG GTATGACGAT AGGACTTATA
TCCAGATATG AAATATATAT ATCCGGTGAT CCGGTGCAAT ACGGTAAGCC TGTCGCATCA
GGAAGCTTTA AAAACACACT TGAAAAACAG GAGATAATTC TGGATTGGAT TGATGACGGA
TTTAATGTTA CCAGGAGCAA GACAGGGAAA TATATTAGAT TTGTTGCTGT TAAAGGGTTT
AACAATGACC GGGAAGCTGC CATTGGAAGT CTGGACATAA TAACCGTATA A
 
Protein sequence
MDKRSFPLDG DWYVALDEKE IGIKEKWFES TFTEKVKLPG TLDENGIGSL VTGTDTFRLN 
RIRKYVGAAW YNRLVVLPDD IVEKHVTLFL ERCMWKTELW INGNYAGSQN SLSTPHKYKL
DGMLKAGENL ISIKVDNSPI YNLGVMSHGY SEEVQTVWNG IVGRIELDIR DKVYVERANV
YSDMESSKLS TRLLLVNTAS KAVEGLISLS VRPQDGENVI IRDSYYFSIG ANSSVNVELT
QAYEDKLGLW DEFNQNLYRM DINLECQCEK ENEMYSDSSQ ILFGIRSFKA EGPVFRLNGK
KIFLRGTHDA GNFPITGYPS MEVEDWKRIY AIAKSYGINH FRFHSWCPGE AAFAAADEEG
MILQAELPLF GFTAPPLGQD EPRDSFLREE LLRILEEYGN HPSFCMMCMG NELRGDYEML
SEFVEMGRKT DGRHLYSSAA NNAAEPGLGI KPNKGDQYYV AHEARINGER VNRRCENVFN
NERPETISDY SETLKGIEVP TISHEVGQWE VYPDFEEIKK YTGVLKAKNF EEFKGSAAAK
QVLNKNKDFV MASGKLAVLL YREEIERSLR TADYGGFQLL DMHDFPGQGT ALVGWLDAFW
DSKGLVEPRE FRNWCNHSVL LARMEKRVWL NSEVFSAEIN FANYSQNDYS QLNIRWELTR
SDGTMYSCGK FNNIFIPQGN LSYIGVAAAE LNKVETAEKL ILTIVTDGIC ITNQWDIWVY
PEKLAVEMPE DIYVAESWDD GVSETLGNGG TVLMFTGAVR NSEAMCFTTP FWNTQMFENQ
RKTMGILCNP NHPALLDFPT EYHTNWQWWE LLADSKCIGI NELPAGFEPI VSAIDHPVRN
NRLGVIFEAR VLSGKLLICS LDLNGDLGGL PAARQLKYSI LNYIQSERFE PEFYVDESIM
ANMLLKNTAS NLKLLTKNIK ASNTKYSSKV EYILDTDTSN FWVTMGGQYP YVIDIELTES
TAIKGLTYWP RQDGMTIGLI SRYEIYISGD PVQYGKPVAS GSFKNTLEKQ EIILDWIDDG
FNVTRSKTGK YIRFVAVKGF NNDREAAIGS LDIITV