Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3718 |
Symbol | |
ID | 8335071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4183999 |
End bp | 4185576 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644956858 |
Product | cellulose-binding family II |
Protein accession | YP_003114461 |
Protein GI | 256392897 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.154432 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACATC CGTTCTCCCT CTTCAGATCT GCCCGCAGCC GGGCGGTTTC GGCCGTGGCC GCGGTCGCGG TGATCACTAT CGCGGGCGCC GGGGCTGTGG TGGCGCTTCC GCGAGCCGCG CAGGCGGCCG GGGCGACGCA GTGTCAGGTT TTGTACTCGG TGGCCAATGA CTGGGGCAGT GGCTTCAGCA CGAACGTCAG CATCACGAAC CTCGGTGCGC CGTGGACGAG CTGGACTCTC GGCTACTCCT ACGCCGGGAA CCAGACGCTG TCCTCGGGCT GGAACGGGTC CTGGACCCAG TCCGGCAAGG CCGTGACGGT CACGAGCATG TCGTGGAACG GCGCCGTGGC CACCAACGGC ACCGTCACCC CGGCGGCGAA CTTCACCTAC AGCGGTGCGA ACGCCGCCCC GACGGCGTTC ACCGTCAACG GAGTCCTGTG CGGCGGGCCC GGCTCGCCGC CTCCGACGTC CACCCCCAGC ACCTCGCCCT CAACCTCGCC GAGCAGCACG CCGAGCACGC CGCCGAGCAG CCCGCCGCCG GGGACGCCCG CCCCGCAGCT TCACGTCTCC GGAAACCACC TGGTCACCTC GGCCGGCGCG ACCTACCGTC TCCTGGGCGT CAACCGCTCC AGCGGCGAGT TCGCCTGCGT CCAGGGCAAG GGCATGTGGG ACGGGCCGGC GGACCAGGCC ACGATCGATG CGATGAAGAC CTGGAACATC CACGTTGTGC GCATCCCGCT GAACGAGGAG TGCTGGCTGG GCAACAGCGA CGTCCCCGCG GGCGGTACCG TCGGCGCCGC GTATCAGAAG GCGGTCAAGG ACTACACCGA TCTGTTGGTG GCCAACGGCA TCAACGTGAT CCTGGACCTG CACTGGACCT ACGGCCAGTA CACCGGCCCG AGCTCGGCGT GCGCCGACGC GCTGGCCGCG TGCCAGAAGC CGATGCCGGA CGCGCAGTAC ACCCCGACGT TCTGGAAGCA GGTCGCCACC GCGTTCAAGG GTGACAACGC AGTGCTCTTC GACCTGTTCA ACGAGCCCTA TCCGGACGCC GCGAACAACT TCTCCAACGC CACCGAGGCC TGGACGTGCC TGCGCGACGG CGGAACCTGC ACCGGCATCA CCTACCCGGT CGCCGGCATG CAATCGCTGG TCGACGCGGT CCGCGCCACC GGTGCCACCA ACGTCGTCAT GACCGGCGGC CTGACCTGGA CCAACGACCT GAGCCAGTGG CTGGCCTACG AGCCGAAGGA TCCCACCGGC AACCTGGTCG CCTCCTGGCA CTCCTACAAC TTCAACGGCT GCATCACCAC CTCCTGCTGG AACTCCACTA TCGGCGCCGT GGCCGCGAAG GTGCCGGTCC AGGCCGGCGA GATCGGCCAG AACAACTGCA ACCACGACTA CATCGACCAG GTGATGGCCT GGGCGGACGC CAACGGCGTC GGCTACTCGG CGTGGACGTG GAACCCCTGG GGCGTCTGCA ACAGCAACGG CAACGACCTG ATCACCGACT GGAGCGGCAC ACCCACCGCC ACCTACGGCC AGGGATACCA AGCGCATCTG CTCACCCAAA AGCCCTGA
|
Protein sequence | MRHPFSLFRS ARSRAVSAVA AVAVITIAGA GAVVALPRAA QAAGATQCQV LYSVANDWGS GFSTNVSITN LGAPWTSWTL GYSYAGNQTL SSGWNGSWTQ SGKAVTVTSM SWNGAVATNG TVTPAANFTY SGANAAPTAF TVNGVLCGGP GSPPPTSTPS TSPSTSPSST PSTPPSSPPP GTPAPQLHVS GNHLVTSAGA TYRLLGVNRS SGEFACVQGK GMWDGPADQA TIDAMKTWNI HVVRIPLNEE CWLGNSDVPA GGTVGAAYQK AVKDYTDLLV ANGINVILDL HWTYGQYTGP SSACADALAA CQKPMPDAQY TPTFWKQVAT AFKGDNAVLF DLFNEPYPDA ANNFSNATEA WTCLRDGGTC TGITYPVAGM QSLVDAVRAT GATNVVMTGG LTWTNDLSQW LAYEPKDPTG NLVASWHSYN FNGCITTSCW NSTIGAVAAK VPVQAGEIGQ NNCNHDYIDQ VMAWADANGV GYSAWTWNPW GVCNSNGNDL ITDWSGTPTA TYGQGYQAHL LTQKP
|
| |