Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4946 |
Symbol | |
ID | 8336300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5646243 |
End bp | 5648012 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644958045 |
Product | Cellulase |
Protein accession | YP_003115647 |
Protein GI | 256394083 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0257714 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGCTCG TTGTCTTGTC CAGAACCCGG TGGCGGTCCC GTACGGTCGT CGCCGGAGTC GTCGGCGTGC TCTTCGCCGG TGTCTCCGGG GCGGTGCTCC CGGCGAACGC CGCGACAGCC GGTCCGGCCG CCGTCGCCGC GCCGAGTGTC GCCGCGCCGA CGAACGCCGC GACGAAGTGC GTCGCCGCCA TGCAGCCCGG CTTCAACATC GGCAACTCCC TCGACGCGAT CCCCGACGAG ACCTCGTGGG GCAACCCGCC GATCACCCAG GCGCTGCTCC AGAAGATCAA GTCTCTGGGA TACAAGAGCG TGCGCCTTCC GGTGACCTGG AGCGGACACG AGGGCGCCGC CCCCGATTAC CTGATCGACC CGGCCTGGAT GGCCCGCGTC AAGCAGGTGG TCGACTGGGC CCGGGCCGAC GGCCTGTCCG TGGTGGTCAA TGTTCACCAC GACTCGTGGC AGTGGATCAC GAACATGCCC ACCGACCCGA CGGTGCAGCC CCACTACGAC GCGATCTGGA CCCAGATCGC GAACGCGCTC AAGGACGAGC CCCGCTCGGT GGTCTTCGAA GCCGACAACG AGCAGGAGTT CACCGGCGTC ACCGACGACC AGGGCGAAGC GCTGCTCAAC ACGCTCCAGA CGGACTTCTT CCACATCGTG CGCGGCTCAG GAGGCGCGAA CGCCACACGC TTCCTGATGC TGTCGACGCT GGGCGACTCC GCCCAGAAGG CGTCAGAGGA CGCCCTCTCC TCCGAGATCG CCTCGCTGCA CGACCCGAAC CTGATCGCCT CGTTCCACTA CTACGGCTAC TGGCCGTTCG GGGTGAACAT CGCCGGCGTC GACACCTTCG ACGCCACCTC GCAGCAGGAC GTCCTGAACG CCTTCACCCT GATGCACGAC GAGTTCGTGG CCAAGGGCAT ACCGGTCTAC GCCGGTGAGG TCGGTCTCTA CAACGACTTC AGAGGGTTCG GCGGCCTGGA GACCGGCGAG ATGCTGAAGT ACTACGAGCT GCTGGGCTAC GAGGCGCGCA CCACCGGCAT CACCCTGAGC TACTGGGACG ACGGCGGCCG CATCCTGGAC CGCACCAGCC TGCAGCTGAT CGAGCCAACC ACGTTCGCCG CGGCGGCGTC GAGCTGGAAG ACCCGCTCGG GTACCGCGTC CAACGACACG CTGTACGTGC CCAAGACGAG CCCGATCGCG GACGAGAGTC TGACGCTCAG CCCGAACGGT CTGCACTTCA CCGGGCTCTA CGACGGGAAC CGGCGGCTGC AAGAAGGCTG TGACTACACC GTCAGCGGCA CCAAGCTCAC CCTCAAGGCC GCCCTGCTGA CCAAGCTGGT CGGCGCCCAG AACTACGGGG TGAACGCCAC GCTGTCAGCG CACTTCTCGG CGGGACTGCC GTGGCAGATC AACGTCGTGA CCAACGCCCA GCCTGTGCTG TCCGCGGCGA CCGGCACGGC GACCGATCCG CTCGCCGTCC CCACGCAGTT CAACGGCGAC AGGGTCTTCA TGATGCAGTC CGTCTACGCG GACGGCACCA ACGCCGGTAC CGCCGCCTGG ACCGCGTACC AGGCCTACGG TCCGCCGACG GTGTCGGGCT CGGCGTTCTC CGGTGACTAC GCGAACAACG CAATCGTCTT GACCCCGGCC TACTTCGCCG CGCTCACCGA CGGCGCGCGC GTGACGCTCA CCTTCCACTT CTGGAGCGGC GCCACCGCGA CGTACTACGT GACCAAGTCC GGCAGCACCG TCACCGGAAC GCTGTCCTGA
|
Protein sequence | MRLVVLSRTR WRSRTVVAGV VGVLFAGVSG AVLPANAATA GPAAVAAPSV AAPTNAATKC VAAMQPGFNI GNSLDAIPDE TSWGNPPITQ ALLQKIKSLG YKSVRLPVTW SGHEGAAPDY LIDPAWMARV KQVVDWARAD GLSVVVNVHH DSWQWITNMP TDPTVQPHYD AIWTQIANAL KDEPRSVVFE ADNEQEFTGV TDDQGEALLN TLQTDFFHIV RGSGGANATR FLMLSTLGDS AQKASEDALS SEIASLHDPN LIASFHYYGY WPFGVNIAGV DTFDATSQQD VLNAFTLMHD EFVAKGIPVY AGEVGLYNDF RGFGGLETGE MLKYYELLGY EARTTGITLS YWDDGGRILD RTSLQLIEPT TFAAAASSWK TRSGTASNDT LYVPKTSPIA DESLTLSPNG LHFTGLYDGN RRLQEGCDYT VSGTKLTLKA ALLTKLVGAQ NYGVNATLSA HFSAGLPWQI NVVTNAQPVL SAATGTATDP LAVPTQFNGD RVFMMQSVYA DGTNAGTAAW TAYQAYGPPT VSGSAFSGDY ANNAIVLTPA YFAALTDGAR VTLTFHFWSG ATATYYVTKS GSTVTGTLS
|
| |