Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4876 |
Symbol | |
ID | 8336230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5548775 |
End bp | 5550493 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644957975 |
Product | Cellulase |
Protein accession | YP_003115577 |
Protein GI | 256394013 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.168386 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.620598 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCATCGCT CTCCGCTACG CAAGGTTCGA AGGCGTACCG GTCCAAGACT CGCCGGGTTC GCAAGCCTGC TGCTGGCGAT CACCGGCATG CTGCTCCCCG CGACAGCGCA CGCCACCGCT GGCGCCCCGG CAGCCGCCGG ACCGATGCGG TTCCTCGCGG CCATGCAGCC CGGCTGGGGT CTGAGCAACA CCTTCGACGC CATCCCCGAC CCGACGTCCT GGGGCAACCC GCCGGTCACC AAGGCCCTGA TCGACCAAGT ACGCTCCGAC GGCTTCCACA GCATCCGCAT CCCGGTCACC TGGGGCGGCC ACGAAGGCGC CGCGCCGAGC TACACCATCG ACCCGGCCTT CATGAACCAG GTCAAGCAAG CAGTGGACTG GTCACTGTCC GACGGGCTCT ACGTCGTCCT CGACGTCCAC CACGACTCCT GGCAGTGGAT CAGCAACATG GCCGGCGACC CGACCGGCGT CCTGGCCCGA TTCGACGCCA CCTGGACCCA GATCGCCGAC ACCTTCAAGG CAGAATCCGA CAAACTCGTC TTCGAATCCG TCAACGAACC CCAATTCACC AACACCACCG ATACCCAAGC CGAGGCACTG CTCGATCAAC TCAACACCGC CTTCAGCCAC CTCATACGCG CCACCGGCAG CACCAACACA CACCGCTACC TGCTCCTGCC CACCCTCGGC GACACCCCCA CCAAGCCACT GATGGACAGC CTGCTCAACA CCATCGAGAC GCTCCACGAC CCCGATGTGA TCGCCAGCTT CCACTACTAC GGCTACTGGC CCTTCGCGGT GAACATCGCC GGCGCGACCA CCTTCGACAC CACCGCCCAA CAGGACATGA CCACCGACTA CCAGCTCGCC CACGACGAGT TCATCACCAA GGGCATCCCC GTCTGCGCCT GCGAGGTCGG ACTCCTCGGC TACGACTACA CCAAACCCGG CGTCATCGAG CGCGGAGAAA CACTCAAATA CTTCGAAGCC CTCGGCGACG AATCACGGAG CACCGGCATC CCCACCGACT ACTGGGATTC CTACATCAAC CGCGCCACAC TCCAGCCGCG CGACCCCGAT CTGTTCGCCC AAATCCAGTC GAGTTGGCGG GCGTCTTCAG GAACCGCCTC ATCCGACATG GTGTTCCTGC CACAAGCCAG TCCGATAGCG GCTCACAGCC TCACACTGAA CCTCAACGGC GACACCTTCA CCGGCCTCAC CCAGGGCGAT GTCAGACTTG CCCAAGGCAG GGACTACACC GTCTCCGGCG ACCAGCTCAC CCTCACCGCA TCCCTGCTGA CACGGCTGGC CGGCAACGGG CCCACCGGAC CCAACGCGAC GCTGCAGGCA CACTTCTCCC ACGGCCTGCC ATGGCAGATC AGCGTCATCG TCAGCAGCCA GCCCACCCTG GCCGCCGCCA CCGGAAGCAC CAGCTCCTTC GCGGTCCCCA CGCGGTTCAA CGGCGACATC CTGTCCACCA TGCAGGCGCA GTACGCCGAC GGCAGCAATG CCGGGCCGGC CAACTGGACC TCATACCTGG AATACTCCAC GGCGTTCGCC GCGGACTACG CCAACGGCAA CACCACATTG ACATCAGCGT TCTTCGACTC GCTGACCGAC GGCGCGAAGG TCACCCTCAC GTTCCACTTC TGGAGCGGAG CCACCGCCAC CTACTACGTC ACCAAGAACG GCACCGCCGT CACCGGCACG ACCTCCTGA
|
Protein sequence | MHRSPLRKVR RRTGPRLAGF ASLLLAITGM LLPATAHATA GAPAAAGPMR FLAAMQPGWG LSNTFDAIPD PTSWGNPPVT KALIDQVRSD GFHSIRIPVT WGGHEGAAPS YTIDPAFMNQ VKQAVDWSLS DGLYVVLDVH HDSWQWISNM AGDPTGVLAR FDATWTQIAD TFKAESDKLV FESVNEPQFT NTTDTQAEAL LDQLNTAFSH LIRATGSTNT HRYLLLPTLG DTPTKPLMDS LLNTIETLHD PDVIASFHYY GYWPFAVNIA GATTFDTTAQ QDMTTDYQLA HDEFITKGIP VCACEVGLLG YDYTKPGVIE RGETLKYFEA LGDESRSTGI PTDYWDSYIN RATLQPRDPD LFAQIQSSWR ASSGTASSDM VFLPQASPIA AHSLTLNLNG DTFTGLTQGD VRLAQGRDYT VSGDQLTLTA SLLTRLAGNG PTGPNATLQA HFSHGLPWQI SVIVSSQPTL AAATGSTSSF AVPTRFNGDI LSTMQAQYAD GSNAGPANWT SYLEYSTAFA ADYANGNTTL TSAFFDSLTD GAKVTLTFHF WSGATATYYV TKNGTAVTGT TS
|
| |