Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4289 |
Symbol | |
ID | 8335643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4864183 |
End bp | 4866282 |
Gene Length | 2100 bp |
Protein Length | 699 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644957392 |
Product | cellulose-binding family II |
Protein accession | YP_003114994 |
Protein GI | 256393430 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00315108 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAGAGTGG GGAGAGTAGG TACGCTCGCC ACGGCCGGGC TGCTCGCCGT GAGCGGCGCC GTGGCCGCCG TGCCGGGCGG CAGTGCCGCG GCGTCCGCGG TCACGGCGGC GAGCTCGACC GCTGCCGCCT CGTCCGTCAA CGTCAGCGTC AACACTTTGG AAGGCCTCGG CACGATCCCC GCCACCGGCT ACGGGTTGAA CAGCGCGGTC TGGGACAGCC AGATGAACAC CCCGTCGGTG CAGGGTCTGC TCGGCCAAGC CGGGATCGGG ATGCTGCGCT ACCCCGGTGG TTCCTATGGC GATATGTACC ATTGGCAGAC CAATACCGCT CCCGGCGGCT ACGTGGCGCC GGGCACCGAC TTCGACTCCT TCATGGGCAC CGTCAAGACC ATCGGCGCGC AGCCCATCCT GATCGCCAAC TACGGCACCG GCACGCCGGA GGAAGCCGCC GCCTGGGTCC AGTACGCCAA TGTCACCAAG GGCTATGGCG ACAAGTACTG GGAAGTCGGC AACGAGAACT ACGGCAACGG CTACTACGGC AGCGGCTGGG AGGCCGATGA CCACAGCAGC AAGAGCCCGA CCACTTATGC GCAGAACGTC ATCCAGTACA GCAAGGCGAT GAAGGCGGTC GACCCCACGA TCAAGGTCGG CGCCGTGCTG ACCATGCCCG GCAACTGGCC CGACGGCTCC GTGGCGACCG GCGATTCCGC TGACTGGAAC CGCACGGTCC TGTCGATCGC GGGCTCGGCG GTCGACTTCG TCATCGTGCA CTGGTATCCG AACGGCACCG GTGCCGCCAC CGCGCTGACC GAACCGGACC AGCTGCCCGG GGAACTGTCG CAGCTGCGCA GCGAGATCAA TGAGTACGGC GGAGCCAACG CCTCCCGTCT CGGCGTGGCG CTGACTGAGG TGAACGCCGG CGTCGACGAG GACACGCAGC CTGATGCGCT GTTCGGCGCG GACACCTACT TCACGGCGCT GGAGCAGGGC GTGTTCACGG TCGACTGGTG GGACACCCAC AACGGACCGA CGCAGATCAG CACCGCACCC GACGGCGCCA CGGATTACGA CGACTGGGGT GTGCTGTCCA GCGGCACGTG CGTGGGCTCG GTCTGCGAAC CGGCGATGAA CACCCCGTTC CCGAGCTACT ACGCCATCAG CATGCTGAGC AAGCTCGGAC ATCCGGGAGA CCAGATGGTC CGAGCCGGCA CCGACCAGCA ACTCGTCGCG GCGCACGCCG TGAAGCAGGC CAACGGGAAC CTGGCCGTGA TGCTGGTGAA CAAGGACCCT GCGAACGCGT ACACGGTGAA CCTGCACTAC TCCGGCTACA CGCCGAGCAC CGCGACGCCG ACCGTCTACA CCTACGGCGA CGAAGCCAGC TCGATCACCT CGGCGGCGCA GGGCAGTAGC GCGGTCCAGA CCTTGCCGCC CTACTCGATC GAGACTGTCG TCCTGACTCC GTCCGGCAAC CATGTCAGTA CTCTGAAGGC ACCGGGCTCA CCGGTTGTCT CCCAGGTCAC CGATACGCAG GCGACGGTGA GCTGGACTCC CTCCAGCGGC GGCACCGCCA CCCGCTACGA GATCTACCGG CAGTTCGGCA CGACCAGCGA ACTGCTGGCC GAGTCGACGT CCACCTCGGC GACGATCGCC AACCTGGTTC CCGGCACCGG CTATACGTTT AACGTCCTCG CCACGGACCA GAGCGGCAAC CTGTCTCCGC CCTCGGATCC GCTCACCTTC ACCACCGGCA CGCCGGCCGC CAGCACCTGC GCGGTGGACT ACCAGGTCAC CTCGGGCTGG GGGAGCGGCT ACGTCACCGC GATCACCGTC ACCGACACCG GGCCCGCCCC GATCAACGGC TGGTCGCTGA CCTTCACCTT CCCCAGTACC AGCGAAACGC TGTCCTCCGG CTGGAACGCC ACCTGGACCG GGACCGGCCA GAACATCGAG GCCACCAGCC TGAGCTGGAA CGCGAACCTG GCGGCGAACG GCGGCAACTC CGCCAGCATC GGGTTCGTCG GGAACAACAC CGGCGCTTAT CCCTCGCCGG CGGCGATCAG CTTGAACGGG ACGGTCTGTA GTACCACCTA CAGCTCCTGA
|
Protein sequence | MRVGRVGTLA TAGLLAVSGA VAAVPGGSAA ASAVTAASST AAASSVNVSV NTLEGLGTIP ATGYGLNSAV WDSQMNTPSV QGLLGQAGIG MLRYPGGSYG DMYHWQTNTA PGGYVAPGTD FDSFMGTVKT IGAQPILIAN YGTGTPEEAA AWVQYANVTK GYGDKYWEVG NENYGNGYYG SGWEADDHSS KSPTTYAQNV IQYSKAMKAV DPTIKVGAVL TMPGNWPDGS VATGDSADWN RTVLSIAGSA VDFVIVHWYP NGTGAATALT EPDQLPGELS QLRSEINEYG GANASRLGVA LTEVNAGVDE DTQPDALFGA DTYFTALEQG VFTVDWWDTH NGPTQISTAP DGATDYDDWG VLSSGTCVGS VCEPAMNTPF PSYYAISMLS KLGHPGDQMV RAGTDQQLVA AHAVKQANGN LAVMLVNKDP ANAYTVNLHY SGYTPSTATP TVYTYGDEAS SITSAAQGSS AVQTLPPYSI ETVVLTPSGN HVSTLKAPGS PVVSQVTDTQ ATVSWTPSSG GTATRYEIYR QFGTTSELLA ESTSTSATIA NLVPGTGYTF NVLATDQSGN LSPPSDPLTF TTGTPAASTC AVDYQVTSGW GSGYVTAITV TDTGPAPING WSLTFTFPST SETLSSGWNA TWTGTGQNIE ATSLSWNANL AANGGNSASI GFVGNNTGAY PSPAAISLNG TVCSTTYSS
|
| |