Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4285 |
Symbol | |
ID | 8335639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4859078 |
End bp | 4860496 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644957388 |
Product | cellulose-binding family II |
Protein accession | YP_003114990 |
Protein GI | 256393426 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.125188 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCAGCC CCAGACCGGC ACCACCGTCC CGCCGCCGCC GCGCCGTCGT GCGCACCGCC GTGGCCGCCG CCGCCGCGCT CGTCGTCGCC CTGACCGCCG CGGTGCTCAC CAGCCCGGCG CAGGCGCTCG GCGTGGGCGG CACGTCGTCC CCGGTGACCG GGAACGCGAC CCACTTCGAC GGGCTCGGGG CGCCCTACGG CGGCTGCGGC GTGCCGCAGG CGAACCTGGA CTCGCAGGAC TTCATCGCCC TCAACGTCTT CAACACCCCG GGGAACTACA ACCAGTTCAC GCGTCCGGTG CCGCCGGCAC AGGCGAGCAT CCTGGGCATG TTCGACAACG GTCTGAACTG CGGCCGCTGG GTGAAGGTCA GCATCGGCGA CCTGTGCACC GGCACCAACG ACGGCGCGCC GAACCAGCCG TTCTGCCGCA ACGGCTCGTG GGTCGCCGAC AAGTACAACG GCGCGACGCT GAACATGCTG GTCGCCGACA GCTGCGCGGA CTCCAACGCC TGGTGCCGTG ACGATCCGTA CCACATCGAC CTGCACACCG ACTCGATCAA CCGCTTCCAG CTCAACGGCT CCGCGGTCGG CGACCTGCTG AACCACTGGA ACAACCGCCA GGTGAGCTGG CAGTTCATCA GCGCTCCGGG CTACAGCGGC GACATCAACA TCGGTTTCAT GCAGGGCGCG CAGGTGTACT GGCCGGCGAT CTCGGTCTCG CACCTGGCGA ACGGGATCCA CGGCGTGCAG TACCTGTCGG CGTCCGGGAC CTGGGTCTCG GCGGCGATGG ACAGCGACAT GGGGCAGTCG TACATCATCG CGCCGACCGC GACCGCCGGC TCCAGCTACC AGATCCGGGT GACCGACGCC TCGGACAATC TGATCAACGG CGGTCAGGTC TACAGCTTCT CGCTGCCGTC CTCCTGCGGG GGCAGCTGCA GCGCGGCCTA CACACAGGTC CCGTACACGA CGACGCCGGG TTCGGGACCG AGCTCGCCGA GTTCGAGTCC CAGCACGACG CCGAGTACCA GCGCCTCGAG TCCGTCCACT CCGCCGAGTT CGCCGAGCTC ATCGGGACCG TCGACCCCGC CGAGCTCGCC CTCCAGCTCC GCACCCGCCT CCGGCTGCTC GGTCACCTCC TCGGTGACAG GTTCCTGGTC CAGCGGCTAC CAACTCGCGT TCACGGTCAC CAATACCGGC AAGGTCGCCT CCTCGCAGTG GGCGGTGCGT TTCTCCTTCG CCGGAAGCCA GACGATCGCC AACTCCTGGA ACGTGACCGC CACGCAATCC GGGCAGGCGG TGACCGCGAA CTCCGTGTCG TACAACGGGT CCCTGGCACC GGGTGCGGCG ACGTCGTGGG GCATGGTGGT CAACGGCGCG AACCAACCGC TCGGCGGCAT TTCCTGTGTC GCGAGCTGA
|
Protein sequence | MRSPRPAPPS RRRRAVVRTA VAAAAALVVA LTAAVLTSPA QALGVGGTSS PVTGNATHFD GLGAPYGGCG VPQANLDSQD FIALNVFNTP GNYNQFTRPV PPAQASILGM FDNGLNCGRW VKVSIGDLCT GTNDGAPNQP FCRNGSWVAD KYNGATLNML VADSCADSNA WCRDDPYHID LHTDSINRFQ LNGSAVGDLL NHWNNRQVSW QFISAPGYSG DINIGFMQGA QVYWPAISVS HLANGIHGVQ YLSASGTWVS AAMDSDMGQS YIIAPTATAG SSYQIRVTDA SDNLINGGQV YSFSLPSSCG GSCSAAYTQV PYTTTPGSGP SSPSSSPSTT PSTSASSPST PPSSPSSSGP STPPSSPSSS APASGCSVTS SVTGSWSSGY QLAFTVTNTG KVASSQWAVR FSFAGSQTIA NSWNVTATQS GQAVTANSVS YNGSLAPGAA TSWGMVVNGA NQPLGGISCV AS
|
| |