Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6682 |
Symbol | |
ID | 8338046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 7696971 |
End bp | 7699778 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644959776 |
Product | cellulose-binding family II |
Protein accession | YP_003117369 |
Protein GI | 256395805 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.224663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAGAA GCTGGAAGAA CGCGGTAAGC GCGGTGCTCG GCGCCGTGCT CGCCGTCGCC GGGCTCGGCG TCGTCAGTGT GTCGTTGAGC GGCAGTACCG CCGCCGCGGC CTCGGCAGCC TCGGCGGCCT CCGGTTCCGG TTATACGTGG CAGAACGTGC CGATCGTCGG CGGCGGCTTC GTGCCGGACG TCGTCTTCAA CACCGGCGCG AAGAACGTCG CCTATGCCCG CACCGACATC GGTGGGATGT ACCGCTGGAA TCAGAGCTCG AGCAGTTGGA CGCCGCTGTT GGACTGGGTC GGCTTCACCA ACTGGAACGA GCAGGGCGTC GTCTCGGTCG CCGCGGATCC GGTGCAGACC AACCGCGTCT ACGCGGCGGT CGGGATGTAC ACGAACAGCT GGGACCCGAA CAACGGCGCG ATCCTGCGCT CCACCGACCA GGGGAGTACG TGGTCTGCGA CGCCGCTGCC GTTCAAGCTC GGCGGGAACA TGCCGGGGCG CGGAATGGGG GAGCGGCTGA TGGTCGACCC CAACGACGAC GCCGTGCTGT ACATGGGAAT GCCCAGCGGG CACGGACTGT GGAAGAGCAC TGACTACGGC GTGACGTGGG CGCAGGTGAC GGCGTTCCCG AACGTCGGCA ACTACGTGCA GGACTCGACC GACACCACCG GCTACCTCTC GGACAACCAG GGCGTCGCCT GGGTCGCCTT CGACAAGGCC TCCGGGACGA CCGGAGCCGC CGGCACGCCG ACCAAGACCA TCTTCGTCGG CGTCGCGGAC GTGCAGAACA CGGTCTACGA GTCGACCAAC GGCGGCGCGA CGTGGAGCCG CGTCGCCGGG CAGCCGACCG GCTTCCTGGC ACACAAGGGC CTGGTGGATC CGACCGGCGC CTACCTGTAC ATCACCACCA GCGACAAGGG TGGTCCCTAC GACGGCGCTT CCGGGGACGT GTGGAAGTAC GCGATCGCCT CCGGGACCTG GACGCAGATC AGCCCGGTCA AGTCCAGCGA CACGTCGAAC GACTTCTACG GTTACAGCGG GCTGACCATC GACCAGCAGC ACCCGAACAC CATCATGGTC ACCGGCTACA GCTCGTGGTG GCCGGACACC TTTATCTACC GCTCCACCGA CGGCGGCGCG ACGTGGACGA ACGCTTGGTC CTACAACGGG TATCCGACCC GCGTCGACAA GTACACGCTG GACGTCTCCG CCTCGCCGTG GCTGTCGTGG GGCAACTACC CCTCGCCGCC GGAGGAGACG CCGAAGCTCG GGTGGATGAG CGAGGGGTTG GCGATCGACC CCTTCAACAG CGACCGCATG ATGTTCGGCA CCGGCGCGAC GCTGTACGGC ACGACGAACC TGACGGCGTG GGACTCCAGC ACGGCGAAGG TGAAGCTCTC CGTGATGGCG CAAGGGATCG AGGAGACCGC GGTCACCGAC CTGGTGTCGC CGCCGTCCGG CGCGCCGCTG CTGTCCGCGC TCGGTGATGT GGACGGCTTC CGGCACACGG ACCTCACCAA GGTCCCGGCG ATGATGTATC AGAGCCCGAA CTGGAGCACG TCCACGAGTA TCGACTACGC CGAGGCGAAC CCGAACGACG TGGTCCGCGT CGGCAACGGC AGCTCGACCG TGAACTCGGC GGCGTTCTCC TCGGACGACG GCGCCGACTG GTACGCGGCG TCGTCGCAGC CCTCGGGCGT CACCGCCGGC GGCACGGTGG CGATGGCGGC CGACGGCAGC GCGGCGGTGT GGTCGCCGAC CAGTGCCGCG GTCAGCTACA CGACCACGAC CGGGAGTTCG TGGACTGCTT CGACCGGCAT ACCCGCCGGC GCGCAGGTGC GCAGCGACCG GGTCAACCCG AAGAAGTTCT ACGGCTTCAA CTACGCCACC GGCACGTTCT ACCTCAGCAC CGACGGCGGC AAGACCTTCA CCGCCTCCGC TGCCACGAAC CTGCCCTCGG GCACGGGAGG CGCCTACGTC CACGCAGTCC CCGGCACCGA GGGCGACCTC TGGCTGGCCG GCGGCTCCAC CTCCGGCGCC TACGGTCTGT GGCACTCCAC GGACTCCGGC GCCACCTGGA CGAAGCTGGC GAACGTCCAG CAGGCCGACA ACATCGGCTT CGGCGCTCCC GCACCCGGCC GCACGAACAA GGCGCTGTAC ACGATCGCCG AGATCGGCGG CGTGCGCGGC ATCTTCCGCT CCGACGACGA CGGCAGCACT TGGACCCGCA TCAACGACGA CCAGCACCAG TGGGGCAACA TCGGCGCCGC GATCACCGGC GACCCGCGGG TGTACGGACG GGTTTATGTC GGGACCAACG GACGAGGGAT CGTCTACGGG ACTATGACGG GGACGGGGAG TTCGTCTTCC TCTTCTCCTT CGACGACACC GAGCACCACG CCGTCCACCA CTCCCTCGAC GACACCCTCG ACGACGCCTT CCACCACCCC GAGCGCGACC CCGTCCACGA CGCCGTCGTC CAGTGCCCCG GCGGGCACGT GCCACGTGAC CTACACCAAG GCGAGCGAAT GGGGTGGCGG CTTCACGGCG AACATCACCG TCGCCAACAC CGGCGCGAGC CCGTGGACCG CCTGGACGGT CGCCTGGACC TATCCCGGCG ACCAGAAGGT GACCAACGGT TGGAACGCCA CCGTGACCCA GACCGGAGCC AAGGTCAGTG CCACGAACGT GGCCTACAAC GGTTCTGTGG CCTCGGGATC GTCGGCGTCG TTCGGGTTCC AGGGCACATG GACGTCGAAC GACACCAGTC CGACAGCGTT CTCGGTGAAC GGCGCGGCCT GTTCGTGA
|
Protein sequence | MRRSWKNAVS AVLGAVLAVA GLGVVSVSLS GSTAAAASAA SAASGSGYTW QNVPIVGGGF VPDVVFNTGA KNVAYARTDI GGMYRWNQSS SSWTPLLDWV GFTNWNEQGV VSVAADPVQT NRVYAAVGMY TNSWDPNNGA ILRSTDQGST WSATPLPFKL GGNMPGRGMG ERLMVDPNDD AVLYMGMPSG HGLWKSTDYG VTWAQVTAFP NVGNYVQDST DTTGYLSDNQ GVAWVAFDKA SGTTGAAGTP TKTIFVGVAD VQNTVYESTN GGATWSRVAG QPTGFLAHKG LVDPTGAYLY ITTSDKGGPY DGASGDVWKY AIASGTWTQI SPVKSSDTSN DFYGYSGLTI DQQHPNTIMV TGYSSWWPDT FIYRSTDGGA TWTNAWSYNG YPTRVDKYTL DVSASPWLSW GNYPSPPEET PKLGWMSEGL AIDPFNSDRM MFGTGATLYG TTNLTAWDSS TAKVKLSVMA QGIEETAVTD LVSPPSGAPL LSALGDVDGF RHTDLTKVPA MMYQSPNWST STSIDYAEAN PNDVVRVGNG SSTVNSAAFS SDDGADWYAA SSQPSGVTAG GTVAMAADGS AAVWSPTSAA VSYTTTTGSS WTASTGIPAG AQVRSDRVNP KKFYGFNYAT GTFYLSTDGG KTFTASAATN LPSGTGGAYV HAVPGTEGDL WLAGGSTSGA YGLWHSTDSG ATWTKLANVQ QADNIGFGAP APGRTNKALY TIAEIGGVRG IFRSDDDGST WTRINDDQHQ WGNIGAAITG DPRVYGRVYV GTNGRGIVYG TMTGTGSSSS SSPSTTPSTT PSTTPSTTPS TTPSTTPSAT PSTTPSSSAP AGTCHVTYTK ASEWGGGFTA NITVANTGAS PWTAWTVAWT YPGDQKVTNG WNATVTQTGA KVSATNVAYN GSVASGSSAS FGFQGTWTSN DTSPTAFSVN GAACS
|
| |