Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1013 |
Symbol | |
ID | 7309838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1258840 |
End bp | 1261941 |
Gene Length | 3102 bp |
Protein Length | 1033 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643607940 |
Product | glycoside hydrolase family 2 TIM barrel |
Protein accession | YP_002505355 |
Protein GI | 220928446 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.188379 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGAG AATGGGAAAA TCAGTATATA ACACAAATAA ACAGGTATCC GATGCACTCG CCATATGGAG CTTATGAATC TGTTGAACAG GCCATGAGCT GTAACCGCTG GACTTCAAAA TATGTAAAAA GCCTGAGCGG GATATGGAAG TTCAAGCTTG CCCAAAATCC ACAGCAGGCA CCGGAAAATT TCTATGCATT AAATTATGAT GTTTCCGACT GGGATGATAT ACCCGTACCT TCAAACTGGG AGCTGCATGG GTATGGCAAA CCGGTTTATA CCAATATTAT ATATCCATTT AAGAGGGAAG GAGTCGGATC GCACTATGAG ATAGAGGTAG CGGAGGGACA GGTAGAGCTT AATGCACCTC TGGTACCTGA AAAAAACTTG ACAGGCTGTT ACCGAACCGA TTTTGAAGTG CCGGATTATT TTGAAGGAAA GGATGTATTT ATAGAATTCG GAGGTGTAGA GTCCTGTTTT TACCTTTGGG TAAACGGTAC GGAAATAGGC TTTTCCAAGG ATAGCAAGCT GGATGCTTCC TTTGATATCA CACATGCCGT ACATAGTGGA AAAAATGAAC TGGCAGTAAA GGTACTACAA TACTGTGACG CCTCGTACCT TGAGGATCAG GACTACTGGC ACTTGTCAGG AATATACCGG GATGTGAGAA TTTATGCAAA GAACAAACAA CGGCTGCTTG ACTACAAGGT AGAAACTCTG TTTAAGGACA ATAATTTTAA AGAAGCCGAA CTGAGAGTAA TGCTTCAGCC AAATAATAGA GTCAGGGGTT ATGGAGAATC TTATGTGAGA TTGAGTCTTT ATAACGCTGA AAATGAGTTG GTGACTGCTT TTCAGAGTCA ACCCTATGCC AAATGCGGAG TATATCTGGA GTCTAAATTT ACTGCTTTTC CGTCTGCATT TGTGAAAAAT CCCCATTTAT GGTCTGCTGA GGAGCCGTAT TTATATACTT TGGTTTTGGA GACGGTAGAC AAAAACGGAT GTGTCACAGA TATTGAAAGT ACAAAAGTAG GCTTCCGTAA AGTAGAAATA GGCAGGGACG GTGTGTTTTA CCTGAACGGC AGGAGACTAC TGGTGCGGGG AGTGAATTTA CACGAATTCT GTCCCGAAAC TGGAAGATAT GTTTCGATGG AATATATGAG ACAGCAGATT TTATCCATGA AACAGATGAA TTTCAATGCC GTTCGAACCA GCCATTATCC CCATGCAAGT GAATGGTATG AGCTTTGTGA TCAGCTGGGA ATATATGTTG TTGATGAGGC CAATCTTGAA ACCCATGGTT TTGGCGGACA GTTAAGTTCG TCCGCAGAAT GGACTGCGGC ATACTTGGAA CGTGCTACTC GAATGGTACT CAGAGATAAG AATCATCCTT CTATTATCAT CTGGTCGCTG GGGAACGAAT CTGGGGCAGG TGCTAACCAT GCGGCTATGT ATGGTTGGAT AAAAGAATAT GACAAAACAA GGTATGTTCA GTATGAATCC TGCAACCCTG AAAGCAATAT TACCGATATT ATCTGTGCAA TGTATCCAGC AAAAGACTGG GTGGAGCAAA AGATGGCAGA GCATGATGAC CTGAGACCCT TCATAATGTG TGAATATGCC TATGCCAAAA GCAACAGCAA TGGCAATTTT AAACTATATT GGGACTTGGT GGAGAAGTAT CCACGTTTCC AGGGCGGGTT CATATGGGAC TTTCAAGATA AGGCACTTGT AAAGCAGCAG GAAAACGGGA CCTCAAGATA TGTATATGCA GGAGCTTTTA ATGAGGAGGT AGTTGACCCC ATAGAGGATA TGTGTTTGAA TGGAGTGGTG CTGCCGGATT TGGACTGGAA ACCAGCTGCA TTTGAAATTA GGAATTGCCA GGCCCCGGTG ACAATCTTCC ATGATGGAAG CACATATCCT GAAGCTGGTG ACTATAAAAT AAAGAATAAC TATATGTTTA AGGATTTGAG TCATTTATAC CTTACCTGGG AGCTACTGTG TGACGGAAAA ATCGTGGATA GAGGAGTTAT AAAGCAGTAT TTTACGGCTC CGGGACAGTC GGATATTCTG GAATATGCCT TGAAACCTGA GAAAATATCT GGTGAAGCTT TTGTAAATAT CACAGTCTCA TTAAATGAGG ATGTACCGTA TGCAAAAAAG GGACATGTTA TTTATGCATA TCAAATCCCA TTGCAGGAGT CGGTGTTAAA AATGCAGAAG GTATGTATTA ACCATAATAA ACTGACAATG CATGAAACTG CCGGAGAAAT ATTGATACTG GGAGAAAATA CTGAGGTTCG TTTTGACAAG GAAAGCTGTA TATTCACAAA AGCTGTTTTT AACGGAATTG AGAGCTTTTG GGGGGGCCGG GATAATTTTT ACCGGGCTTC AACGGGAATT GACGAAGGGT GCAGGAATCC CGGCGGCAAT TACTCAACTG ACTGGAAGGA CTGGGGCTTA AATGATTTGA AAATAAAGGA CATAAAAGTA GATACTGCTG TTTCTGAATC ACAAATATTT ATATTTACAA ACGTGTCATA CAATGATGAT AAATTGATTG TTTCCACACA GTACAGGATT GGCAGTAGAG GTATGGAGGT TGATAAAACC GTCATTAATA ATTGTCCCGG TGAGACAATT CCAAGAATCG GATTCTCATT AATGCTTCCG GAGGATAAAC AGCAAATAAC TTGGTATGGA AGAGGACCTT GGGAGAATTA CGCCGACAGA AAGGAAGCAG CCCTTATAGG GTGTTACAGC AGTACCGTCC CGGAACAGTA TACACACTAT GTGAAGCCCG TGGAATGCGG CGGAAAAGAA GATGTGAGAT ACATCATTGT ACAGGATAAG GGGGGACATG GCCTACGCGT TTCAGGCACC GTACCTTTCC ATTTTGATAT TCATGATTAC TCTGTGGAGG CCTGCGATAA TGCAATGTAT GAGCATGAAC TAATAAAGGA TAAATACGTA CACCTAAATA TAGATCATTT ACATGCCGGC TTGGGCGGAG ACAACGGCTG GTCAAAAAAT ATTCATACGG AGTACCGTAT TGGAAAAGGT TGTTATCATT ATCAGATATT AATAGAAGTG GTTGACAATT AA
|
Protein sequence | MSREWENQYI TQINRYPMHS PYGAYESVEQ AMSCNRWTSK YVKSLSGIWK FKLAQNPQQA PENFYALNYD VSDWDDIPVP SNWELHGYGK PVYTNIIYPF KREGVGSHYE IEVAEGQVEL NAPLVPEKNL TGCYRTDFEV PDYFEGKDVF IEFGGVESCF YLWVNGTEIG FSKDSKLDAS FDITHAVHSG KNELAVKVLQ YCDASYLEDQ DYWHLSGIYR DVRIYAKNKQ RLLDYKVETL FKDNNFKEAE LRVMLQPNNR VRGYGESYVR LSLYNAENEL VTAFQSQPYA KCGVYLESKF TAFPSAFVKN PHLWSAEEPY LYTLVLETVD KNGCVTDIES TKVGFRKVEI GRDGVFYLNG RRLLVRGVNL HEFCPETGRY VSMEYMRQQI LSMKQMNFNA VRTSHYPHAS EWYELCDQLG IYVVDEANLE THGFGGQLSS SAEWTAAYLE RATRMVLRDK NHPSIIIWSL GNESGAGANH AAMYGWIKEY DKTRYVQYES CNPESNITDI ICAMYPAKDW VEQKMAEHDD LRPFIMCEYA YAKSNSNGNF KLYWDLVEKY PRFQGGFIWD FQDKALVKQQ ENGTSRYVYA GAFNEEVVDP IEDMCLNGVV LPDLDWKPAA FEIRNCQAPV TIFHDGSTYP EAGDYKIKNN YMFKDLSHLY LTWELLCDGK IVDRGVIKQY FTAPGQSDIL EYALKPEKIS GEAFVNITVS LNEDVPYAKK GHVIYAYQIP LQESVLKMQK VCINHNKLTM HETAGEILIL GENTEVRFDK ESCIFTKAVF NGIESFWGGR DNFYRASTGI DEGCRNPGGN YSTDWKDWGL NDLKIKDIKV DTAVSESQIF IFTNVSYNDD KLIVSTQYRI GSRGMEVDKT VINNCPGETI PRIGFSLMLP EDKQQITWYG RGPWENYADR KEAALIGCYS STVPEQYTHY VKPVECGGKE DVRYIIVQDK GGHGLRVSGT VPFHFDIHDY SVEACDNAMY EHELIKDKYV HLNIDHLHAG LGGDNGWSKN IHTEYRIGKG CYHYQILIEV VDN
|
| |