Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6974 |
Symbol | |
ID | 8338340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 8066673 |
End bp | 8069489 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644960054 |
Product | cellulose-binding family II |
Protein accession | YP_003117645 |
Protein GI | 256396081 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATCT CACGGCGAGA TCTGCTCAAG GTCGGCGGAA CGGTGGTGGT CGGCGGGTCT TTGCTGTCGG CCGGTTCCGG TTCGGCGGCC GCTGCGTCGC GTGGCAGGGG CGCCCTGGTG GGCTCGACGG CTGTGGCTGT GTCCGCGGTG GGGACGGACT CAGTCGATCG TGACAGCGTG TTCACCCGGG GCGGTTCCGG GCCGCTGTAC TGGTCCACGT ACGGCTACGA CTACCCGAAC AACAACGCTC AGTCGCAGGC CAGCTGGCAG GCGAACGTCA CCTGGGTGGC GAAGAACCTC AAGCCCTACG GCATCGACAT GGCCTGTACC GACGGCTGGG TCGACTACAC CCAGGCGACC AACTCCAACG GCTACATCCT GAACTACCAG GACAGCTGGG CCATGGGCTG GGCCGGGATG TCGTCGTACC TGTCCGGGCT CGGGCTGAAG ATGGGGGTCT ACTACAACCC CCTGTGGGTG ACCAAGAGTG CGTACAACGA CCCGTCCAAG ACGGTGGTCG GGCGTCCGGA CATCCCGATC TCGTCGATCG TCACCGCGGG GGACTTCTTC GGCGGGAACA AGGGCACCCA GGAGATCTAC TGGGTGGACG TGACCAAGGA CGGCGCCAAG GAGTTCGTCC AGGGCTACGT CAACTACTTC AAGCAGCTGG GCGCGGTCTA CCTGCGCGGC GACTTCTGGG CGTGGTACGA GACCGGCTAC GACCAGAACG AGGGCACGGT CGGCGTGGCG CACGGCAGCG CCAACTACGC CACCGCCCTG GGCTGGATCA GCGAGGCCGC CGGGGACGGG CTGGAAGTCA GTGTCGTGAT GCCCAACTTG CTCAACCACG GCCAGAACGA GCGGCGCTAC GGCGACCTGA TCCGCATCGA CGACGACTGC GGCAGCGGCG GCTGGAACTT CCTGGACGGC GGCCGGTCGA GCTGGCAGAA CTACTGGACC CAGTGGCACA CCCCGTTCCT GGGCTTCACC GGCTTCTCCG ATATCTCCGG GCGCGGTCAG ATGATCCTGG ACGGCGACGT CCTGGAGATG AACAGCTTCG GCAGCGATGA CGAGCGCCGC ACGGCACTGA CACTGTTCGC CATGGCCGGC TCGCCGCTGA TCGTCGGCGA CCGGTCCGAC AACATCGGCT CCTACCTCAG CTTCTGGCAG AACAACGACA TCCTGAACAT CAACAAGGCC GGGTTCGTCG GCAAGCCCTA CTACCACAAC GCCAATCCGT TCTCGTCGGA TCCCACGAGC CGGGACCCGG AGACGTGGAC GGGCCAGCTG CCCGACGGCA CCTGGCTGGT CGCGTTGTTC AACACCACCT ACTCCAGCGT CACCAAGTCG ATCGACTTCG CCGGCGCTCT GGGGCTCGCG GCCGGCGGCA CGGTCCACGA CGTGTGGAAC AACACCAACC TGGGCCAGAT GACGTCGTAT TCGGCGTCCC TGCCGATGCA CGGGGTGTCC CTGATCAAGA TCACGCCGGC CGGGTCCGGG GCGCCGGTCT ACCAGTCTCA GGTGGCTGCC TGGGGCGGCG GGGCGATGTT CGACAACGCC GCGTCCGGCT TCAGCGGCAA CGGCTACGTG GACGGACTGG GTAGCGTCGG CGCCCGGGTC GTCTTCGGCG TCACCGGTGC GGGCGGCACG ACTCCGGTCA CCATCCGATA CGCCAACTCC GGCAGCGCCG CGAGCCTGAC CATATCGGCC AAGAACGTGG CGGGCACGGT CTCGGGCAGC ACCTCGGTCA GCCTCCCGGG CACCGGCGGC GCCGGTACCT GGAGCACGGT CACCGTCAAC CTCGCGCTGG CCGCGGGGAC GAACCTGATC ACCCTGGAGC GCACCTCCAC CGATTCCGGA TCAGTGAATC TGGACTCGAT CCAAGTCGGC GCCTCCTCAG GAGGCTCCAC CCCGCCCGGC GCCCCCGGCA CACCGGCGGC TTCGGCCATC ACCTCCAACG CGGTGACCCT GACGTGGAGC GCGGCAGCCG CGGGCAGCAA CCCGATCGCC GGATACCAGG TCTACCAGGT CGGATCACCC GACACCGTGG TCGCTTCCAC CGCCGCCGGA ACCCTCACCG CGACCATCAG CGGCCTGACT GCGGCGACGA GCTACAGCTT CTACGTCAAG GCCAAGGACA GCGCGGGAAC CGTCGGCGCC GCATCCGGAA CGACGGCAGT CACCACTGCC GGCTCCGGCG GCTCGACCCC GCCCGGCGCA CCGGGCACAC CGGCAGCGTC CACCATCACC GCGACCGCGG TGACCCTGAC GTGGAGCGCG GCAGCCGCTG GCAGCAACGC GATCGCCGGA TACCAGGTCT ACGAGGTCGG ATCGCCCGAC ACCGTGGTCG CTTCCACCGC TGCCGGAACC CTCACCGCGA CCATCAGTGG CCTGATGAGC GCCACGCAGT ACGGCTTCTA CGTCAAGGCC AAGGACAGCG CCGGAGCTCT CGGCGCCGCT TCAGCCACGA AAACCGTGAC GACGGCTGCG GTTTCGGCTG GAGCCGCGGT CTCCTACGCC GTCCAAAGCG ACTGGGGCTC GGGCTTCAGC GCCTTGGTGA CGATCACCAA CACCGGTACC AGCGCGATCA ACAACTGGAC CCTCGGATTC ACCTTCGCGG GCAACCAGCA CGTCACCAAC GGCTGGAACG CCACCTGGTC CCAGAGCGGC GCGAACGTCA CCGCCTCCAG CGAGTCCTTC AACGGCGCGA TCGCCCCGGG CGCCTCGGTC CAGATCGGCT TCACCGGTAC CTACAGCGGT GCCAACGCCA AGCCGACCGC CTTCACGATC AACGGGCAGC CCGCCACCAC GCAGTGA
|
Protein sequence | MAISRRDLLK VGGTVVVGGS LLSAGSGSAA AASRGRGALV GSTAVAVSAV GTDSVDRDSV FTRGGSGPLY WSTYGYDYPN NNAQSQASWQ ANVTWVAKNL KPYGIDMACT DGWVDYTQAT NSNGYILNYQ DSWAMGWAGM SSYLSGLGLK MGVYYNPLWV TKSAYNDPSK TVVGRPDIPI SSIVTAGDFF GGNKGTQEIY WVDVTKDGAK EFVQGYVNYF KQLGAVYLRG DFWAWYETGY DQNEGTVGVA HGSANYATAL GWISEAAGDG LEVSVVMPNL LNHGQNERRY GDLIRIDDDC GSGGWNFLDG GRSSWQNYWT QWHTPFLGFT GFSDISGRGQ MILDGDVLEM NSFGSDDERR TALTLFAMAG SPLIVGDRSD NIGSYLSFWQ NNDILNINKA GFVGKPYYHN ANPFSSDPTS RDPETWTGQL PDGTWLVALF NTTYSSVTKS IDFAGALGLA AGGTVHDVWN NTNLGQMTSY SASLPMHGVS LIKITPAGSG APVYQSQVAA WGGGAMFDNA ASGFSGNGYV DGLGSVGARV VFGVTGAGGT TPVTIRYANS GSAASLTISA KNVAGTVSGS TSVSLPGTGG AGTWSTVTVN LALAAGTNLI TLERTSTDSG SVNLDSIQVG ASSGGSTPPG APGTPAASAI TSNAVTLTWS AAAAGSNPIA GYQVYQVGSP DTVVASTAAG TLTATISGLT AATSYSFYVK AKDSAGTVGA ASGTTAVTTA GSGGSTPPGA PGTPAASTIT ATAVTLTWSA AAAGSNAIAG YQVYEVGSPD TVVASTAAGT LTATISGLMS ATQYGFYVKA KDSAGALGAA SATKTVTTAA VSAGAAVSYA VQSDWGSGFS ALVTITNTGT SAINNWTLGF TFAGNQHVTN GWNATWSQSG ANVTASSESF NGAIAPGASV QIGFTGTYSG ANAKPTAFTI NGQPATTQ
|
| |