Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0154 |
Symbol | |
ID | 7312065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 177659 |
End bp | 180751 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643607083 |
Product | glycoside hydrolase family 2 TIM barrel |
Protein accession | YP_002504522 |
Protein GI | 220927613 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAATAG AAAATTATTT TGAAAACCCT AAAATACTGC ATGTGGGTAC TATGGAGAAA AGGGCTTACT ACGTACCATA CCCACCGGAA GATGTCAATA AAAGCAGTGA CAGGAATAAG TCCGAAAGAC TTCACTTATT AAACGGAGAG TGGGATTTTC TCTATTTAAA AAGTGTATAC AATATAACGG ATGAATTCTA CTTACCCGGA TATGACAGAG CAGGATTTGA CAAGATACCT GTTCCTTCAG TGTGGCAGAA CCATGGGTAT GACAGACATC AGTATACAAA TATAAAATAT CCTTTTCCCT ATGATCCTCC ACATGTACCG GTAGATAACC CTTGTGGTGT TTATGTCAGA GAGTTTTACG CAGACACCAG TTGGAACGGG ATGAGGAAGT ATATCAATTT TGAAGGAGTT GATTCATGCT TCTACCTCTG GATAAATGGT AAATTTACAG GTTACAGTCA GGTTTCCCAT TCTACCAGTG AATTTGATAT CACAGACTAT GTACATGAAG GAGTAAACAC CATAGCAGTT CTTGTGCTAA AATGGTGTGA TGGGAGTTAT TTTGAAGATC AGGACAAGTT CAGGACATCT GGTATATTCA GAGATGTATA TATTCTTATT AGGCCCGAGA ATCATATAAG GGACTTTTTT GTAAGAACTT CACTGAAAGA GGATTATAAA AAGGCCGAAA TAAAAGTGGA TATGGACATC TTCGGAAGTA TTCCTGGCAT AGACTATAAA CTTACAGATA ATAGCACTGG TAATATTGAA GCACAAGGTA ATGCAAATGG AAAAAGTATA AGTATTGAAA TGGATAATCC GAAATTGTGG AGTGCGGAAA TCCCTTATCT TTATACCCTA ATTATTCGAT CGGATTACGA GGCTATTTCT GTACAAGTAG GAATACGTGA AATAAAAGCT GAGGACGGTA TTGTTTATCT TAACGGGCAA AAGATTAAAT TCAAGGGGGT CAACCGACAT GACTCCGACC CAGTAAAGGG GCCCGCTGTG GATGTTAAAG ATATGATAAA AGACTTGATG CTTATGAAAC AGCACAATAT AAATGCCATT CGCACAAGTC ATTACCCAAA CTCACCTTTG TTTACAGAGC TTTGTGATAA ATATGGCTGC TATGTTATTG CAGAAGCAGA TATTGAAGCT CATGGCACAA CAGCGGTTTA TGGAGGTGGT CAAGATGGAA AATCATTTCC AATGCTGGCA CATGACTCTG AATTTGAAGA TGCTGTTATA GACCGCGTTG AGAGTTGTGT TATACGGGAT AAGAATCATC CCTGTGTTGT TATATGGTCA CTTGGGAATG AGTCCGGATA TGGCAAAAAT TTTGAAAGTG CTTTGGACTG GCTTAAAGGT TACGATAACA GCAGACTTAC CCATTATGAA AGCTCACTAT ATCCACCGGA GGGTTATGAT GCTGATTATA GCAGCTTGGA TTTGTATAGC CGTATGTATG CCTCCTGTGA AGATATAATA GAATATTTCC AGGGAGATAA CGTTTCTAAA CCTCTTATTT TATGCGAATA CTCCCATGCC ATGGGTAACG GCCCGGGAGA CCTTGAGGAA TACCATGAGC AGATAGAGAG ATATGATGGC TTGTGCGGAG GATTTGTTTG GGAGTGGTGC GACCATGCTG TGTACATGGG AAGAACTGTA GATGGCAGGA AAAAGTATCT CTATGGCGGA GATTTCGGAG AGTTTCCCCA TGACGGCAAT TTTTGTGTTG ATGGGTTGGT CTATCCTGAC AGAAGAGTTC ATACTGGATT GCTTGAATAT AAAAATGTGC TGAGACCGCT AAGACTTGTA AATGAAAACT CTAAAGATGG CAAATTCACC TTCAAAAACA TGCTGGATTT TATCAATACA AAGGACTTCC TGTTTATTTC CTATGAAGTA ACCAAAAATG GAGAGGTTAT TTTAGACGGG GTTATAGACG AGCCCGGTTT ACTTGACATA CAGCCTCATG AGAAGAAGGA TATATGCCTG ACACTTCACG GTGTGGAAGA AGGGGATTGT TATATAAAAT TTGATTATAT ACAAAAGTAT GATACACCGT TTGTCGGAAG TGGTCATTTA CTTGGTTTTG ATCAGGTTAA GCTGGCTGTT GACAAGGTTG GTTTAAAGGA GGATAAAATA ACCTGTATCC TCGGAGAACC GGATTCAAAG GAAGGAGAAA TTTCGGTAGT AGAATCAGAC AGAAACGCTA TAATAATAGG AAATAATTTC CGTTACACAT TCAATAAAAT GACAGGTGTA TTTGATAAGC TGATATACAA AAACAATGTA ATTCTTGATA AGCCAATGAA CTACAATATA TGGAGAGCAC CGACCGACAA TGATAGAAAC GTTCGGCATA AATGGGAAGA GGCAGGTTAT GACAGAACAC TCTCCAGATC ATATAATACC CAAGTATTCG AGGAAAACGG CAATGTAAGG ATAATAACGG AACTGTCCCT TTTGGCAGTC CACATCCAGA GAATAATGAG TATCAGTACA GAGTGGAATA TAGCAGAGAA TGGCCTTATA TCGGTTAACA TACAGGCTGA AAGAAATATG GAAATGCCGT TTCTGCCACG TTTTGGTCTG AGACTCTTTC TTCCCGAGTA CATCAGAAAT GTTGAATATT TTGGATATGG CCCCCACGAG AGCTATGCTG ATAAGAGAAG GTCATCATAC GTGGGACGTT TTAAGTCAAA TGTGGGAAGG ATGCATGAGG ATTACCTGAA ACCACAGGAA AATGGCAGTC ACTGGGGGTG CCATTATGTA AAGTTGGCCT CGAATTGTGG TCTTGGGCTG CTGGTTACTG GAGATGAAAC TTTTAGTTTT AATGCATCCT ACTATACTCA GGAGGAATTG ACAAGAAAAA GCCACAACTT TGAGTTGGAG AAGTGCGGAA GTACTGTACT TTGCGTAGAC TACGCCCAAA GCGGGATTGG TTCTAACAGC TGCGGACCAG AGCTTATGGA GAAGTACCGT TTCAATGCAC AAAGATTTCA TTATACAATG TTCTTAAGAC CTTTTATGGA GCAGCCTAGT ACTGCTATTG TAAAAGCAGC TCATAATACG TAA
|
Protein sequence | MIIENYFENP KILHVGTMEK RAYYVPYPPE DVNKSSDRNK SERLHLLNGE WDFLYLKSVY NITDEFYLPG YDRAGFDKIP VPSVWQNHGY DRHQYTNIKY PFPYDPPHVP VDNPCGVYVR EFYADTSWNG MRKYINFEGV DSCFYLWING KFTGYSQVSH STSEFDITDY VHEGVNTIAV LVLKWCDGSY FEDQDKFRTS GIFRDVYILI RPENHIRDFF VRTSLKEDYK KAEIKVDMDI FGSIPGIDYK LTDNSTGNIE AQGNANGKSI SIEMDNPKLW SAEIPYLYTL IIRSDYEAIS VQVGIREIKA EDGIVYLNGQ KIKFKGVNRH DSDPVKGPAV DVKDMIKDLM LMKQHNINAI RTSHYPNSPL FTELCDKYGC YVIAEADIEA HGTTAVYGGG QDGKSFPMLA HDSEFEDAVI DRVESCVIRD KNHPCVVIWS LGNESGYGKN FESALDWLKG YDNSRLTHYE SSLYPPEGYD ADYSSLDLYS RMYASCEDII EYFQGDNVSK PLILCEYSHA MGNGPGDLEE YHEQIERYDG LCGGFVWEWC DHAVYMGRTV DGRKKYLYGG DFGEFPHDGN FCVDGLVYPD RRVHTGLLEY KNVLRPLRLV NENSKDGKFT FKNMLDFINT KDFLFISYEV TKNGEVILDG VIDEPGLLDI QPHEKKDICL TLHGVEEGDC YIKFDYIQKY DTPFVGSGHL LGFDQVKLAV DKVGLKEDKI TCILGEPDSK EGEISVVESD RNAIIIGNNF RYTFNKMTGV FDKLIYKNNV ILDKPMNYNI WRAPTDNDRN VRHKWEEAGY DRTLSRSYNT QVFEENGNVR IITELSLLAV HIQRIMSIST EWNIAENGLI SVNIQAERNM EMPFLPRFGL RLFLPEYIRN VEYFGYGPHE SYADKRRSSY VGRFKSNVGR MHEDYLKPQE NGSHWGCHYV KLASNCGLGL LVTGDETFSF NASYYTQEEL TRKSHNFELE KCGSTVLCVD YAQSGIGSNS CGPELMEKYR FNAQRFHYTM FLRPFMEQPS TAIVKAAHNT
|
| |