Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3604 |
Symbol | |
ID | 8334957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4025343 |
End bp | 4028264 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644956746 |
Product | glycoside hydrolase family 48 |
Protein accession | YP_003114349 |
Protein GI | 256392785 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAAA AGTTGTCGCG CAGACAATTC GCGACCGCGG CCGGCGGAGC CGTCCTGGCG TCGGCCGTCG CACCGTCCAT GTCGCGGGCG GCGAGTGTCG CCCCGGCCGC GGCCTCGGCC GCCACGGATG CCTACACCCA GCAGTTCCTG ACCCAGTACA AGAAGATCAA GGACCCGGCG AACGGCTACT TCAGCGCGCA GGGGATCCCG TACCACAGTG TGGAGACGCT GATCGTCGAG GCGCCGGACT ACGGGCACCA GACGACCTCG GAGGCGTTCA GCTTCTGGAT GTGGCTGGAG GCGACGTACG GCCGGGTGAC CGGTGACTGG ACGGCGTTCA ACAACGCCTG GACGACGGCC GAGCACTACA TCATCCCGCA GCACGTCGAT CAGCCGAGCA ACAGCTCCTA CAACCCCAAC TCGCCGGCGA CCTACGCCCC GGAGTGGCCG GACCCCAGCA GCTACCCCAG CCCGCTCAAC ACCTCGGTGT CGGTCGGCCA GGACCCGCTG GCCAACGAGC TGACCTCGAC GTACGGCACG TCGGACATCT ACGGCATGCA CTGGCTGATG GACGTCGACA ACAAGTACGG CTACGGGAAC ACGCCCGGTA CCGGCGGGGA GGCCGGTCCC AGCGCGACCG GGCCGTCGTA CATCAACAGC TACCAGCGCG GCGCGGCGGA GTCGGTGTGG AAGACGATTC CGCAGCCGAC CACGGACCTG TTCAACTACG GCGGTCCGAA CGGATACCTC GACCTGTTCG TGGCGCAGTC CGGCTCCTAC TCCAAGCAGT GGAAGTACAC GACTGCCCCC GACGCCGACG CCCGCGCCAT CCAGGCCGCG TACTGGGCCT ACCGCTGGGC CTCGGCGCAG GGCGCGCAGG GCCAGATCGC CGCCTCGGTC GCGAAGGCCG CGAAGATGGG CGACTTCCTG CGCTACTCGC TGTTCGACAA GTACTTCAAG CAGATCGGGA ACTGCACGAA CGCCAGCTCC TGCGCGGCCG GCACCGGCCG TGGCTCCGAG CACTACCTGC TGGCCTGGTA CTACGCCTGG GGCGGGGCGG AGCCCGGCGG CGGCTGGGCC TGGCGGATCG GCGACGGCGC CGCGCACCAG GGGTATCAGA ACCCGCTGGC GGTGTGGGCC ATGACGAACA TCGCCGCGCT GACCCCCATG TCTCCGACCG CCAAGAGCGA CTGGACCGCC AGCCTGACCC GGCAGATGGA GTTCTACCAG TGGCTGCAGT CCGCCGAGGG CGCCATCGCC GGCGGCTGTA CGAACAGCTG GAACGGCTCG TACAGCGTGC CGCCGTCGGG TGACTCGACG TTCTACGGCA TGGCCTACGA CTGGGAGCCG GTCTACCACG ACCCGCCGAG CAACAACTGG TTCGGCATGC AGGCCTGGTC GATGGAGCGG CTCGCGGAGT TCTACTACGT CACCGGCAAC GCCACCGCCA AGACGATCCT GGGCAAGTGG ATCACCTGGG CTTCCTCGAA GACCACGGTC ACCGCCACCA ACTTCCAGAT CCCCTCCACG CTCGGCTGGA CCGGACAGCC GGACACCTGG AACCCGACGA GTCCGGGCGG CAACTCCGGG CTGCACGTGA CGGTCGCCGA CTACGGCAAC GACGTCGGCG TCGCGGCGGC GTACGTCAAG ACCCTGACGT ACTACTCCGC CAAGTCCGGC GACACCGCCT CCGGCGCCCT CGCCAAGAGC CTGCTCGACG CGATGGCGAC CTTCGCCGAC ACCGCCGGCA TCGCCACGCC CGAGACGCGC ACCGACTACA GCCAGTTCAA CGACACGGTG TACGTGCCCT CCGGCTGGTC CGGCAAGATG CCCAACGGCG ACCCGATCGC CCCCGGCGCC ACCTTCTTGT CGATCCGCTC GTGGTACAAG AACGACCCGG CCTGGCCGAA GGTGCAGGCC TACCTCAACG GCGGATCCGC CCCGACGTTC ACCTACCACC GCTTCTGGGC CCAGGCGGAC ATCGCCATGG CCTACGCGGT GTACGGCGAG CTGATCGCCG GCGGCGGCGG TACCGGCGGC GACACGACGC CGCCGAGCGT GCCGACCGGT CTGACCGTCA CGGGGACCAC CAGCAGCACC GCCTCGCTGT CGTGGACGGC TTCGACCGAC AACATCGGGG TGGCCGGCTA CACCGTGTAC CGGGGCACCA CCGTGGCCGG TTCCGCGACC ACGCCGACGT TCACCGACTC CGGACTGGCC GCCTCGACGC AGTACAGCTA CACGGTCACG GCCCACGACG CCGCCGGCAA CGTCTCCGCC GCCTCCGCCG CCGTCAAGGC CACCACCACC GCCGGGACCG GCGGCGGCGA CACGACGCCG CCGAGCGTGC CGACCAACCT GGCGGTCACC GCCACGACCA GCAGCAGCGT CTCGCTGTCG TGGACGGCCT CGACCGACAA CGTCGCGGTG ACCGGCTACA CCGTGTACCG CGGCACCACG GTGGCGGGCA CCACGACCTC GCCGACCTTC ACCGACTCCG GACTGACCGC CTCGACGCAG TACAGCTACA CGGTCACGGC CCACGACGCC GCCGGCAACG TCTCGGCAGC CTCCGCCGCC GTCAAGGGCA CCACCTCCGG AACCGGCGGG GGCACCGGCC CGACCTGCAC CGCTACCTAC AGCGTCACCA GCGACTGGGG CAACGGCTTC AACGGCAACG TCACCATCAC CAACACCGGG ACGACCGCGA CCAAGTCCTG GAAGGTCACC TGGACCTGGG GAGGCAACCA GACCATCACC AACACCTGGA ACGCCACCGA AACCCAGTCC GGCAAGGCCG TGACCGCGAC CAACGCCCCC TACAACAACG TCATCGCCCC CGGCGCCAGC ACCAGCTTCG GCTTCAACGC CAGCTACTCC GGCACCAACG GCGCGCCGAC GGTCACCGTC ACCGCTACGT GA
|
Protein sequence | MTKKLSRRQF ATAAGGAVLA SAVAPSMSRA ASVAPAAASA ATDAYTQQFL TQYKKIKDPA NGYFSAQGIP YHSVETLIVE APDYGHQTTS EAFSFWMWLE ATYGRVTGDW TAFNNAWTTA EHYIIPQHVD QPSNSSYNPN SPATYAPEWP DPSSYPSPLN TSVSVGQDPL ANELTSTYGT SDIYGMHWLM DVDNKYGYGN TPGTGGEAGP SATGPSYINS YQRGAAESVW KTIPQPTTDL FNYGGPNGYL DLFVAQSGSY SKQWKYTTAP DADARAIQAA YWAYRWASAQ GAQGQIAASV AKAAKMGDFL RYSLFDKYFK QIGNCTNASS CAAGTGRGSE HYLLAWYYAW GGAEPGGGWA WRIGDGAAHQ GYQNPLAVWA MTNIAALTPM SPTAKSDWTA SLTRQMEFYQ WLQSAEGAIA GGCTNSWNGS YSVPPSGDST FYGMAYDWEP VYHDPPSNNW FGMQAWSMER LAEFYYVTGN ATAKTILGKW ITWASSKTTV TATNFQIPST LGWTGQPDTW NPTSPGGNSG LHVTVADYGN DVGVAAAYVK TLTYYSAKSG DTASGALAKS LLDAMATFAD TAGIATPETR TDYSQFNDTV YVPSGWSGKM PNGDPIAPGA TFLSIRSWYK NDPAWPKVQA YLNGGSAPTF TYHRFWAQAD IAMAYAVYGE LIAGGGGTGG DTTPPSVPTG LTVTGTTSST ASLSWTASTD NIGVAGYTVY RGTTVAGSAT TPTFTDSGLA ASTQYSYTVT AHDAAGNVSA ASAAVKATTT AGTGGGDTTP PSVPTNLAVT ATTSSSVSLS WTASTDNVAV TGYTVYRGTT VAGTTTSPTF TDSGLTASTQ YSYTVTAHDA AGNVSAASAA VKGTTSGTGG GTGPTCTATY SVTSDWGNGF NGNVTITNTG TTATKSWKVT WTWGGNQTIT NTWNATETQS GKAVTATNAP YNNVIAPGAS TSFGFNASYS GTNGAPTVTV TAT
|
| |