Gene Ccel_0154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0154 
Symbol 
ID7312065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp177659 
End bp180751 
Gene Length3093 bp 
Protein Length1030 aa 
Translation table11 
GC content40% 
IMG OID643607083 
Productglycoside hydrolase family 2 TIM barrel 
Protein accessionYP_002504522 
Protein GI220927613 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAATAG AAAATTATTT TGAAAACCCT AAAATACTGC ATGTGGGTAC TATGGAGAAA 
AGGGCTTACT ACGTACCATA CCCACCGGAA GATGTCAATA AAAGCAGTGA CAGGAATAAG
TCCGAAAGAC TTCACTTATT AAACGGAGAG TGGGATTTTC TCTATTTAAA AAGTGTATAC
AATATAACGG ATGAATTCTA CTTACCCGGA TATGACAGAG CAGGATTTGA CAAGATACCT
GTTCCTTCAG TGTGGCAGAA CCATGGGTAT GACAGACATC AGTATACAAA TATAAAATAT
CCTTTTCCCT ATGATCCTCC ACATGTACCG GTAGATAACC CTTGTGGTGT TTATGTCAGA
GAGTTTTACG CAGACACCAG TTGGAACGGG ATGAGGAAGT ATATCAATTT TGAAGGAGTT
GATTCATGCT TCTACCTCTG GATAAATGGT AAATTTACAG GTTACAGTCA GGTTTCCCAT
TCTACCAGTG AATTTGATAT CACAGACTAT GTACATGAAG GAGTAAACAC CATAGCAGTT
CTTGTGCTAA AATGGTGTGA TGGGAGTTAT TTTGAAGATC AGGACAAGTT CAGGACATCT
GGTATATTCA GAGATGTATA TATTCTTATT AGGCCCGAGA ATCATATAAG GGACTTTTTT
GTAAGAACTT CACTGAAAGA GGATTATAAA AAGGCCGAAA TAAAAGTGGA TATGGACATC
TTCGGAAGTA TTCCTGGCAT AGACTATAAA CTTACAGATA ATAGCACTGG TAATATTGAA
GCACAAGGTA ATGCAAATGG AAAAAGTATA AGTATTGAAA TGGATAATCC GAAATTGTGG
AGTGCGGAAA TCCCTTATCT TTATACCCTA ATTATTCGAT CGGATTACGA GGCTATTTCT
GTACAAGTAG GAATACGTGA AATAAAAGCT GAGGACGGTA TTGTTTATCT TAACGGGCAA
AAGATTAAAT TCAAGGGGGT CAACCGACAT GACTCCGACC CAGTAAAGGG GCCCGCTGTG
GATGTTAAAG ATATGATAAA AGACTTGATG CTTATGAAAC AGCACAATAT AAATGCCATT
CGCACAAGTC ATTACCCAAA CTCACCTTTG TTTACAGAGC TTTGTGATAA ATATGGCTGC
TATGTTATTG CAGAAGCAGA TATTGAAGCT CATGGCACAA CAGCGGTTTA TGGAGGTGGT
CAAGATGGAA AATCATTTCC AATGCTGGCA CATGACTCTG AATTTGAAGA TGCTGTTATA
GACCGCGTTG AGAGTTGTGT TATACGGGAT AAGAATCATC CCTGTGTTGT TATATGGTCA
CTTGGGAATG AGTCCGGATA TGGCAAAAAT TTTGAAAGTG CTTTGGACTG GCTTAAAGGT
TACGATAACA GCAGACTTAC CCATTATGAA AGCTCACTAT ATCCACCGGA GGGTTATGAT
GCTGATTATA GCAGCTTGGA TTTGTATAGC CGTATGTATG CCTCCTGTGA AGATATAATA
GAATATTTCC AGGGAGATAA CGTTTCTAAA CCTCTTATTT TATGCGAATA CTCCCATGCC
ATGGGTAACG GCCCGGGAGA CCTTGAGGAA TACCATGAGC AGATAGAGAG ATATGATGGC
TTGTGCGGAG GATTTGTTTG GGAGTGGTGC GACCATGCTG TGTACATGGG AAGAACTGTA
GATGGCAGGA AAAAGTATCT CTATGGCGGA GATTTCGGAG AGTTTCCCCA TGACGGCAAT
TTTTGTGTTG ATGGGTTGGT CTATCCTGAC AGAAGAGTTC ATACTGGATT GCTTGAATAT
AAAAATGTGC TGAGACCGCT AAGACTTGTA AATGAAAACT CTAAAGATGG CAAATTCACC
TTCAAAAACA TGCTGGATTT TATCAATACA AAGGACTTCC TGTTTATTTC CTATGAAGTA
ACCAAAAATG GAGAGGTTAT TTTAGACGGG GTTATAGACG AGCCCGGTTT ACTTGACATA
CAGCCTCATG AGAAGAAGGA TATATGCCTG ACACTTCACG GTGTGGAAGA AGGGGATTGT
TATATAAAAT TTGATTATAT ACAAAAGTAT GATACACCGT TTGTCGGAAG TGGTCATTTA
CTTGGTTTTG ATCAGGTTAA GCTGGCTGTT GACAAGGTTG GTTTAAAGGA GGATAAAATA
ACCTGTATCC TCGGAGAACC GGATTCAAAG GAAGGAGAAA TTTCGGTAGT AGAATCAGAC
AGAAACGCTA TAATAATAGG AAATAATTTC CGTTACACAT TCAATAAAAT GACAGGTGTA
TTTGATAAGC TGATATACAA AAACAATGTA ATTCTTGATA AGCCAATGAA CTACAATATA
TGGAGAGCAC CGACCGACAA TGATAGAAAC GTTCGGCATA AATGGGAAGA GGCAGGTTAT
GACAGAACAC TCTCCAGATC ATATAATACC CAAGTATTCG AGGAAAACGG CAATGTAAGG
ATAATAACGG AACTGTCCCT TTTGGCAGTC CACATCCAGA GAATAATGAG TATCAGTACA
GAGTGGAATA TAGCAGAGAA TGGCCTTATA TCGGTTAACA TACAGGCTGA AAGAAATATG
GAAATGCCGT TTCTGCCACG TTTTGGTCTG AGACTCTTTC TTCCCGAGTA CATCAGAAAT
GTTGAATATT TTGGATATGG CCCCCACGAG AGCTATGCTG ATAAGAGAAG GTCATCATAC
GTGGGACGTT TTAAGTCAAA TGTGGGAAGG ATGCATGAGG ATTACCTGAA ACCACAGGAA
AATGGCAGTC ACTGGGGGTG CCATTATGTA AAGTTGGCCT CGAATTGTGG TCTTGGGCTG
CTGGTTACTG GAGATGAAAC TTTTAGTTTT AATGCATCCT ACTATACTCA GGAGGAATTG
ACAAGAAAAA GCCACAACTT TGAGTTGGAG AAGTGCGGAA GTACTGTACT TTGCGTAGAC
TACGCCCAAA GCGGGATTGG TTCTAACAGC TGCGGACCAG AGCTTATGGA GAAGTACCGT
TTCAATGCAC AAAGATTTCA TTATACAATG TTCTTAAGAC CTTTTATGGA GCAGCCTAGT
ACTGCTATTG TAAAAGCAGC TCATAATACG TAA
 
Protein sequence
MIIENYFENP KILHVGTMEK RAYYVPYPPE DVNKSSDRNK SERLHLLNGE WDFLYLKSVY 
NITDEFYLPG YDRAGFDKIP VPSVWQNHGY DRHQYTNIKY PFPYDPPHVP VDNPCGVYVR
EFYADTSWNG MRKYINFEGV DSCFYLWING KFTGYSQVSH STSEFDITDY VHEGVNTIAV
LVLKWCDGSY FEDQDKFRTS GIFRDVYILI RPENHIRDFF VRTSLKEDYK KAEIKVDMDI
FGSIPGIDYK LTDNSTGNIE AQGNANGKSI SIEMDNPKLW SAEIPYLYTL IIRSDYEAIS
VQVGIREIKA EDGIVYLNGQ KIKFKGVNRH DSDPVKGPAV DVKDMIKDLM LMKQHNINAI
RTSHYPNSPL FTELCDKYGC YVIAEADIEA HGTTAVYGGG QDGKSFPMLA HDSEFEDAVI
DRVESCVIRD KNHPCVVIWS LGNESGYGKN FESALDWLKG YDNSRLTHYE SSLYPPEGYD
ADYSSLDLYS RMYASCEDII EYFQGDNVSK PLILCEYSHA MGNGPGDLEE YHEQIERYDG
LCGGFVWEWC DHAVYMGRTV DGRKKYLYGG DFGEFPHDGN FCVDGLVYPD RRVHTGLLEY
KNVLRPLRLV NENSKDGKFT FKNMLDFINT KDFLFISYEV TKNGEVILDG VIDEPGLLDI
QPHEKKDICL TLHGVEEGDC YIKFDYIQKY DTPFVGSGHL LGFDQVKLAV DKVGLKEDKI
TCILGEPDSK EGEISVVESD RNAIIIGNNF RYTFNKMTGV FDKLIYKNNV ILDKPMNYNI
WRAPTDNDRN VRHKWEEAGY DRTLSRSYNT QVFEENGNVR IITELSLLAV HIQRIMSIST
EWNIAENGLI SVNIQAERNM EMPFLPRFGL RLFLPEYIRN VEYFGYGPHE SYADKRRSSY
VGRFKSNVGR MHEDYLKPQE NGSHWGCHYV KLASNCGLGL LVTGDETFSF NASYYTQEEL
TRKSHNFELE KCGSTVLCVD YAQSGIGSNS CGPELMEKYR FNAQRFHYTM FLRPFMEQPS
TAIVKAAHNT