Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1972 |
Symbol | |
ID | 7310686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2333201 |
End bp | 2334706 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643608906 |
Product | glycoside hydrolase family 43 |
Protein accession | YP_002506300 |
Protein GI | 220929391 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3507] Beta-xylosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAA AATTAACCGC TTTATTTATG GTTATCTTTA CCGTTCTTGT ATTAGGCACT TCTGTAGCCT TTGCCGAAAT TGGCATAATA AATAACGGCA CAAATTTTGG AGGCGTGCAG GCACACGGGG GCAGTATCAT CAAGGTAGGC GAAACATATT ATTGGTATGG AGAAAACCGT GATACAAATA ACCTGTTTGT ATCTGTGAAG GTTTACTCTT CAACTGATCT TGTGAATTGG ACAGATATGG GTAATGCCTT ATCCAGAACA TCTGCGGCAG AGCTTAACTC CTGCAATATA GAAAGACCGA AAGTTATGTA TAATGCATCA ACAAACCAAT ATGTAATGTG GATGCATTAC GAAAACGGCA GGGACTATAC ATTGGCAAGG GCGGCTGTAG CATACAGTAC AAGCCCGACT GGTCCGTTTA CTTATATTGG AAGTTTTAGA CCCAATAATA ACATGTCAAG AGATTGCACC ACCTTTGTGG ATACTGATGG TACAGGATAT TTTATCTCTG CTGCAAACGA AAATGCCGAT TTGGTAGTTT ACAGGCTTAC ATCGGATTAT AAGTCAGTAG CCTCACAAGT GGTAACTCTT TGGCCGGGAC AATATCGAGA AGCTCCATGT ATGTTTAAAC GTGATAATAT CTATTACCTG ATAACGTCAG GATGTACCGG GTGGGGTCCA AATCAGCAGA AATATGCTAC TTCCACATCA ATATCTTCAG GCTGGACAGG TTTAACCAAT CTAGGTAATT CAACATGCTA TGACTCTCAA GGTGCATGCG TTTTTCCGGT TGGTCGAAAC TTTGTATTAC TTACCGATAG ATGGGCAGGT GCGTGGGGAG GAAAAGTAAA TGATTCGTCA TACCTGATGA TGCCTTTACT TATTAACGGA ACCTCAGTAT CTCTAAACTA CACAGATACG GTAAAGGTTG CAGCTGAATT TGGTGCGATT GCAAACGACT ATCACAGGTA CACCTTCGAA GGAGTAAAGA CAAACGCTGA CCCGTACCAA TGGAACATAA GTCAACCTGC ATCAACAGAT ATACAAGTAC TTTCTGTAGA TGGAGACAAG GCTCTTAAAA TGTCAGACAG CACAACATCC GGCTATTGCT ATGCATACAA GGATTTCCCT GCTAAAAAAG GTATTGTTAC ATTTAATGTA GGTTTTAAAT TTGAGAATAC CGGAAGATGG GATAGAATCA GACTTTACAA TGGAAATAAC ATAGGAATAG ACATTATAAA TTTTGAAGAA GGGCTTGGTT GGATTACAGG AACTTCAACA AGAAATGTAT TACAAAAAGT TTCTTCAAAC ACATGGTATG ATTTAAAAAT TGTTGCCGAT ACTTCTAATC AGAAATTTGA TGTATACATT AATAATATGC TGATTAAAAC AGGTTGTGCA TTTACAGGTT CGATTACTGA GTTTGACAGA ATTGCACTGG ATACAGGAAA TAGTTTTACA AATACGGCTG TATATGACGA TATAATAATT AAGTAA
|
Protein sequence | MIKKLTALFM VIFTVLVLGT SVAFAEIGII NNGTNFGGVQ AHGGSIIKVG ETYYWYGENR DTNNLFVSVK VYSSTDLVNW TDMGNALSRT SAAELNSCNI ERPKVMYNAS TNQYVMWMHY ENGRDYTLAR AAVAYSTSPT GPFTYIGSFR PNNNMSRDCT TFVDTDGTGY FISAANENAD LVVYRLTSDY KSVASQVVTL WPGQYREAPC MFKRDNIYYL ITSGCTGWGP NQQKYATSTS ISSGWTGLTN LGNSTCYDSQ GACVFPVGRN FVLLTDRWAG AWGGKVNDSS YLMMPLLING TSVSLNYTDT VKVAAEFGAI ANDYHRYTFE GVKTNADPYQ WNISQPASTD IQVLSVDGDK ALKMSDSTTS GYCYAYKDFP AKKGIVTFNV GFKFENTGRW DRIRLYNGNN IGIDIINFEE GLGWITGTST RNVLQKVSSN TWYDLKIVAD TSNQKFDVYI NNMLIKTGCA FTGSITEFDR IALDTGNSFT NTAVYDDIII K
|
| |