Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6973 |
Symbol | |
ID | 8338339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 8063552 |
End bp | 8066575 |
Gene Length | 3024 bp |
Protein Length | 1007 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644960053 |
Product | cellulose-binding family II |
Protein accession | YP_003117644 |
Protein GI | 256396080 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACATCA GGACACGACG AAGAATCCCG GCGCGCGATC ACCGGCCGAG AGCGGTTCGA CCCAGCCGCC ACCGGGCCGC GTTGGCCGTC GCGGCGCTGG CTGCCAGCGT GATCGGGTGG ATGCCAGGCG TCAGTTCGGC CGCACCGGCC GCAGCCTCGG GCACCTCGGG CGACCCGGGG GTCGCTCGCA CCCAGCAGGT ACTCGCCTCG ATCACGACCG ACAAGTCGCG GTACGCACCC GGCGACACCG TCTCGCTGAC CGTCAACGCA GCCAACAAGA CCGGCAGTGC CATCAGCGGC GGCGCCGTCA CCTTGTACTT CATGACGATG CAGAACGCCG CCAGCGCTTC GCAGGCCCAG ACTCTGAACC TCGGCTCTGG CGCGTCCTCG ACACTGGCCT TCCACTGGAC CGCACCGTCG ACCGACTACA CCGGCTACAT GGTCAGCGCG GTCGCCACCG ACTCCTCCGG CAAGGCGCTG GACTCGATCA ACGTCGCGGT CGACGTCTCC TCGGACTGGA GCCGGTTCCC GCGCTACGGC TACATGACCA ACAACTCCTT CGGGAACCAA GGCCTGTCGA CGTCCCAGGC CGCCTCGATC ATGAGCTCGA TGTCGAACTA CCACATCGAC GGCCTGCAGT TCTACGACTG GCAGTACAAC CACGACCAGC CGCTGTGCGG CACGGTCAGC GCGCCGTGTT CCTCGTGGAC CGACGACGGC AACCAGAAGA CGGTGTACGC CTCGGCGGTG AAGGATCTGG TCACCGCCGC GCACAACAGC AACATCGTCG CGATGCCGTA CAACGCCATC TTCTCGGCGG ACAACGGCTC GTGCTGCGGT GCGCCGGACT ACCACACGCA AGGGCTCGGA GTCAGTCCGT CGTGGGGCGT CTACCAGGAC ACCAACCACA GCAAGCCGTT GGCGTTCTTC CAGTGGGACT ACATGGATCC CAGCAACCCC GGCTGGCAGC AGTACCTGAT GGGCCAGCAG AACGCCGCGA TCCAAGCGTT CGACTTCGAC GGCTTCCACG GTGACACCTT CGGCGACCCC GACACCGTCG ACTACAACTA CAACGGCCAG CCGGCCGGCG TGGCCAGCGA CTCGTGCACC ACCGACACCG ACGGCGCCCA CAGCACGACG CCCGTTCACA ACGTGGCCGG CTCGGCGACG TGGCTCAGCG GGACCTTCCC GTCGTTCCTC AGCTATGCCA AGAGCGCCCT GGGCAGTGGC AAGTACCTGA TGTTCAACCC GGTCACCTAC GACCACGCCC ACTGCGAGGC CAACACCAGC GCGGTGGACC TGCTCTATTC CGAGCTGTGG CCGAACGACC GGGACCAGTA CTGGGACTAC GGCAGCCTGA AGACGGCCAT CGACCAAGGC TTCAGCGAGA GCGCGTCGGC CAGCCCGACC GGCCGCGGCA AGTCGCTGAC GGTCGCGGCG TACACCGACT TCGCCAACGG CGGCGGCGGC ACGTTCAACA CCCCGGACGT GCTGCTGCTG GACTCCACGC TGTTCGCCAG CGGCGGCAGC CACGAAGAGC TCGGCGACAA CGGCCTGATG CTGGACTATC AGGAGTACCG GGCCGGCGCG ACACCCATGA GTGCCTCGCT GTCCCAGTCG GTGCAGAACT ACTACGACTT CATGACCGCC TACGAGAACC TGCTGCGGGA CGGCCAGACG GCGACCAATC AGACCGTCGC CGTCTCCGGC CAGACGGTCA GCAGCCAGGC GACGCCGGGC GACGTCTGGG CGTTCACCAA GCAGGACGCG GACCACGAGG TCATCCAGCT CATCAACATG GTCGGGCAGT CCAGCAACCT CTGGCAGACC GGCGCGTGCG ACATGTGCTC GCACATCACC ACGCCGCACC CGGCGCCGAC ACAGCTGACC AATGTGCCGG TGAAGTACTA CTTCAAGAAC ACGCCCAAGG CCGTCATGTT CGCCTCGCCG GACTACAACA ACGGCACCAC CTACTCGGTG CCGTTCACCA CCGGGACCGA CTCCGGCGGC TCGTACGTGT CGTTCACCGT GCCCAGCCTC AACTACTGGG ACATGGTGTA CACCAGCCAG ACCGGACCGG GCGACGCGCC GGTTCTGCCC GGCAGCGGCG GTACGCCGAC CGCCCCGGGC GCGCCCGGGA CCCCGGTCGC GTCCAACATC ACCGCCAACT CCGCGACCCT GACCTGGACC GCTGCCACGG CCGGCAGCAA CCCGGTCGCC GGGTACGACG TGTACCGCGT CGGCTCCCCG GACGCCGTCG TGGCCTCCTC GACCGGTCTG TCGGCGAACG TCAGTGGTCT GTCGCCGTCG ACCAGCTACC AGTTCTACGT CAAGGCGAAG GACTCCACAG GGCTGACCGG TTCGGCGTCC GGCACCACGT CGGTGACCAC TGCCTCCGGC GGTTCCACAC CACCGGGTGC TCCAGGTACT CCGGCGGCGT CGAACGTCAC CACCAGCGCG GCCACGCTGA CCTGGACCGC GGCAGCCGCG GGCAGCAACC CGATCTCCGG CTACCAGGTC TTCCAGGTCG GCAACCCTGA CAAGGTCGTG GCGTCCACCG GTGCCGGCAC TCTGAGCGCC ACCATCACCG GGCTGTCGCC ATCCACTGCC TACCAGTTCT ACGTCAAGGC GAAGGACTCC ACAGGCCTGA CCGGTTCGGC GTCCGGCACA ACGGCGGTGA CCACCGCAGG CGCACCACCG CCGAGTACGG CCAAGGTGAC GTACGCGGTG CAAAGCGACT GGGGATCGGG GATGTCCGTC GCGGTGACGA TCACCAACAC CGGCAGCACC GCGATCAACG GCTGGACGCT GGGCTTCGCC TTCCCCGGCA ACCAGCAGGT CGGCAGCGGC TGGAACGCCA ACTGGTCGCA GAACGGCCAT AACGTCACCG CCACCAACCA GTCCTTCAAC GGCGCCATCG CGCCGGGAGC CTCGATATCG ATCGGCTTCA GCGGCACCTA CAGCGGTGCC GATGCGAAGC CCTCCGCGTT CACCGTCAAC GGCCTACCGG CCACCGTCGG ATGA
|
Protein sequence | MHIRTRRRIP ARDHRPRAVR PSRHRAALAV AALAASVIGW MPGVSSAAPA AASGTSGDPG VARTQQVLAS ITTDKSRYAP GDTVSLTVNA ANKTGSAISG GAVTLYFMTM QNAASASQAQ TLNLGSGASS TLAFHWTAPS TDYTGYMVSA VATDSSGKAL DSINVAVDVS SDWSRFPRYG YMTNNSFGNQ GLSTSQAASI MSSMSNYHID GLQFYDWQYN HDQPLCGTVS APCSSWTDDG NQKTVYASAV KDLVTAAHNS NIVAMPYNAI FSADNGSCCG APDYHTQGLG VSPSWGVYQD TNHSKPLAFF QWDYMDPSNP GWQQYLMGQQ NAAIQAFDFD GFHGDTFGDP DTVDYNYNGQ PAGVASDSCT TDTDGAHSTT PVHNVAGSAT WLSGTFPSFL SYAKSALGSG KYLMFNPVTY DHAHCEANTS AVDLLYSELW PNDRDQYWDY GSLKTAIDQG FSESASASPT GRGKSLTVAA YTDFANGGGG TFNTPDVLLL DSTLFASGGS HEELGDNGLM LDYQEYRAGA TPMSASLSQS VQNYYDFMTA YENLLRDGQT ATNQTVAVSG QTVSSQATPG DVWAFTKQDA DHEVIQLINM VGQSSNLWQT GACDMCSHIT TPHPAPTQLT NVPVKYYFKN TPKAVMFASP DYNNGTTYSV PFTTGTDSGG SYVSFTVPSL NYWDMVYTSQ TGPGDAPVLP GSGGTPTAPG APGTPVASNI TANSATLTWT AATAGSNPVA GYDVYRVGSP DAVVASSTGL SANVSGLSPS TSYQFYVKAK DSTGLTGSAS GTTSVTTASG GSTPPGAPGT PAASNVTTSA ATLTWTAAAA GSNPISGYQV FQVGNPDKVV ASTGAGTLSA TITGLSPSTA YQFYVKAKDS TGLTGSASGT TAVTTAGAPP PSTAKVTYAV QSDWGSGMSV AVTITNTGST AINGWTLGFA FPGNQQVGSG WNANWSQNGH NVTATNQSFN GAIAPGASIS IGFSGTYSGA DAKPSAFTVN GLPATVG
|
| |