Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6683 |
Symbol | |
ID | 8338047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 7699889 |
End bp | 7701790 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644959777 |
Product | Cellulose 1,4-beta-cellobiosidase |
Protein accession | YP_003117370 |
Protein GI | 256395806 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.918543 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.93944 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACAAA GCTCACCCAG CGCCACCGGC AGAGCCAGAC GCCTCACCAG TGCCTTCGTG GCAGCCGGTG TGACGCTTGG GGTGGCCGGC GGGCTCGCGG CGCTGACCAC GACCAGCTCG AACGCGGCGA CCGCCGCCGG ATGCACCGCG GTGTACTCCA CGACGTGGGA CTCCGGGAGC GGCTTCGGCG CGCAGGTGGT CATCACCGAC AACGGTCCGG CGTGGACGAA CTGGACGCTC AGCTACTCCT ACGCCGGCAA CCAGACGCTG CAGAACGGCT GGAACGGCAC CTGGAACCAG TCCGGCAAGA CGGTCACCGT CACCAACGCG TCCTATAACG GCGCGGTGGC CTCCGGCGGA ACCGTGACCC CGGCGGGGAA CTTCGGCTAC TCCGGGACCA ACGCGGCGCC GACGTCGTTC TCGGTCAACG GGATGACGTG CAGCGGGACC ACTCCGCCGC CGACGACGCC GACGACCACG CCGTCCACGC CGTCCACGAC GCCGACGACC ACGCCGAGCA CGACCCCGTC CACGACCCCG TCTACGACGC CCTCCACCAC GCCGTCCACC ACGCCCTCGA CGACCCCGTC GACCGGCGGC GGCGGAGGCG GGCACGTGGC CAACCCGTTC GTGGGCGCCT CGCAGTACCT GAGCCCGGAC TACGCCGGCG AGGTCAACGC CCAGGCGGCC GCCGACCAGT CGTCGAACCC GGCTCTGGCA GCCTCCGAGT CGAAGATGGC CGGCTACGCG ACCGCGGTCT GGATGGACCG GATCGCGGCC ATCACCGGCA CCGGCGACAG CGTGCACCAC GGTCTGCAGT GGCATCTGGA TCAGGCGCTG AGCCAGCAGA AGGCGGGGAC TCCGATCACC TTCGAGGTCG TCATCTACGA CCTGCCGGGG CGCGACTGCG CCGCACTGGC CTCCAACGGC GAGATCCCGG CGACCGCCGC CGGCCTGACC GAGTACGAGT CGCAGTACAT CGACCCGATC TCGGCGATCC TGGCCGACCC GAAGTACTCC GGCATCCGGA TCGTCGCGAT CGTCGAACCG GACTCGCTGC CGAACGCGGT GACCAACCAG AGCAAGTCCG CGTGCGCGAC GGCGACGCCG TTCTACGAGT CCGGGGTCGA ATACGCCCTG AACAAGCTGC ACGCGATCTC CAACGTCTAC AACTACGTGG ACATCGCGCA CTCGGCGTGG CTGGGCTGGT CCTCCAACAT GGGTCCGGCG GCGCAGGAGT TCGCCAAGGT GGCCCGGGCC ACGACGGCCG GGTTCGCCAG CGTCGACGGC TTCATCTCCA ACACGGCCAA CTACACCCCG ACCACCGAGC CGTTCCTGCC GAACTCGACC CTGCAGGTCG GCGGCAACCC GCTGGACTCG GCGAAGTTCT ACCAGTACAA CCCGTACTTC GACGAGTACG ACTACGACCA GGCGATGTAC AGCCAACTGG TCGGCCAGGG CTTCTCGGCC AACATCGGGA TGCTCATCGA CACCTCGCGC AACGGCTGGG GCGGCCCGAA CCGCCCGACC GCGCTGAACT CCTCGCCGAC GACCGTGGAC ACCTACGTCG CGGCCAACAA GGTGGACCAG CGCTCCTTCC GCGGTGACTG GTGCAACCAG AACGGCGCGG GGGTCGGCTC GCGGCCGACG GTACAGCCGT ACGGCGCGTC CAACCACATC ATCGCCTACG TGTGGATCAA GCCTCCGGGG GAGTCCGACG GCGACTACCC GAGCGCCTCG CACAGCCACG GCGACCCGCA CTGCGACCCG GCCGGGACCA ACACCGACGG CAACGGCGGG ACCTACTCGA CCGGGTCGAT CCCCGGCTAC GACGTGCCGG CCGGACAGTG GTTCGCCGCT GAATTCCAGC AGGAGGTGGA GAACGCCTAT CCCGCGATGT AG
|
Protein sequence | MGQSSPSATG RARRLTSAFV AAGVTLGVAG GLAALTTTSS NAATAAGCTA VYSTTWDSGS GFGAQVVITD NGPAWTNWTL SYSYAGNQTL QNGWNGTWNQ SGKTVTVTNA SYNGAVASGG TVTPAGNFGY SGTNAAPTSF SVNGMTCSGT TPPPTTPTTT PSTPSTTPTT TPSTTPSTTP STTPSTTPST TPSTTPSTGG GGGGHVANPF VGASQYLSPD YAGEVNAQAA ADQSSNPALA ASESKMAGYA TAVWMDRIAA ITGTGDSVHH GLQWHLDQAL SQQKAGTPIT FEVVIYDLPG RDCAALASNG EIPATAAGLT EYESQYIDPI SAILADPKYS GIRIVAIVEP DSLPNAVTNQ SKSACATATP FYESGVEYAL NKLHAISNVY NYVDIAHSAW LGWSSNMGPA AQEFAKVARA TTAGFASVDG FISNTANYTP TTEPFLPNST LQVGGNPLDS AKFYQYNPYF DEYDYDQAMY SQLVGQGFSA NIGMLIDTSR NGWGGPNRPT ALNSSPTTVD TYVAANKVDQ RSFRGDWCNQ NGAGVGSRPT VQPYGASNHI IAYVWIKPPG ESDGDYPSAS HSHGDPHCDP AGTNTDGNGG TYSTGSIPGY DVPAGQWFAA EFQQEVENAY PAM
|
| |