Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4288 |
Symbol | |
ID | 8335642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4862872 |
End bp | 4864134 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644957391 |
Product | cellulose-binding family II |
Protein accession | YP_003114993 |
Protein GI | 256393429 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.016596 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACCGCA ATCGCGCTTT CCTACGCAGC GTCCGCGTTC TCGCGGTCGC GGCTGCGGCT GTGCTGGCCA TGTCGGCCAT GTTCGTACTC GGATCCAGAC CCGCCGAGGG CGCCGGGACC GGCTGCACGG TCGCCTATCA GGTGAATCAG TGGAACACCG GCTTCACCGC CGATATGACG GTCACCAACG GCGGTCCGGC GGTGAGCGCG TGGACGGCGA CCTGGTCGTG GACCGGGAAC CAGCAGGTCA CGTCCGGATG GAACGCGCAG GTCAGTCAGA GCGGTCAGCA GGTCACGGCG ACCAACGAGC CGTACAACGG CTCGGTCGCG GCCGGGGCGA CGGTCGCCTT CGGGTTTCAG GGCACGTATT CCGGCACGAA CAGCGCGCCT GTGAACTTCG CGTTCAACGG CGTGGCGTGC AGTGGCCTCG GCACGCCGAG CTCGCCGTCG AGCACGCCTT CCACGCCGCC TTCCAGCTCG CCCTCTAGCT CACCGTCCAC CTCGCCCTCT ACTTCACCCT CCAGTTCACC GTCCAGCTCG CCGTCCAGTT CGCCGAGCAG CGGACCGTGC CCGGCGACGG CAGTCTTCTG CGACGGCTTG GAGAATCAGA GCTCGACCAC ACCGTCCGGC CGCTGGTCGA TCGCCACCCC GAGCTGTTCG GGGACCGGCA CCGCCGCGAT CACCACCGCG CAGGCGCACG CCGGGCTGAA GTCCCTGGAG ATCGACGGTC GCGGCACGTA CTGCGACCAC GTCTTCGCCA GCGACACCAC CGACATGGCG AGCGCCGCGC CGACCTGGTA CGTCCGATTC TGGATCAAGC ACACCGCGCC GCTGCCGACC AACCACACGA CCTTCCTGGC GTTGAACGAC TCCGCGCACG GCAACACCGA CCTGCGGCTC GGAGCCCAGA ACGGCGCGCT GATGTGGAAC CGGCAGTCCG ACGACGCCAC GCTGCCGGAC CAGAGCCCGA ACGGGGTCGC GCAGAGCGTC ACACTGCCGA CCGGAGCCTG GGAATGTCTG GAGTTCTCCG TGAGTGGCAG CAACGGCCAG ATCCACACCT GGTACAACGG CAGCCCGGTC GCGGGCCTGA CCGAGGACGG CGTGGCGACG CCGGACACCG ACGACCAGTG GCTGGCTGGG AGCGGAGCGT CCTGGCGTCC GCAGCTCACC GATCTGAAGC TCGGCTGGGA GAACTACAGC AACGGCGACG ACACGCTGTG GTTCGACGAC GTGGTGTTGA GCACGAGCCG GATCGGATGT TGA
|
Protein sequence | MYRNRAFLRS VRVLAVAAAA VLAMSAMFVL GSRPAEGAGT GCTVAYQVNQ WNTGFTADMT VTNGGPAVSA WTATWSWTGN QQVTSGWNAQ VSQSGQQVTA TNEPYNGSVA AGATVAFGFQ GTYSGTNSAP VNFAFNGVAC SGLGTPSSPS STPSTPPSSS PSSSPSTSPS TSPSSSPSSS PSSSPSSGPC PATAVFCDGL ENQSSTTPSG RWSIATPSCS GTGTAAITTA QAHAGLKSLE IDGRGTYCDH VFASDTTDMA SAAPTWYVRF WIKHTAPLPT NHTTFLALND SAHGNTDLRL GAQNGALMWN RQSDDATLPD QSPNGVAQSV TLPTGAWECL EFSVSGSNGQ IHTWYNGSPV AGLTEDGVAT PDTDDQWLAG SGASWRPQLT DLKLGWENYS NGDDTLWFDD VVLSTSRIGC
|
| |