Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3464 |
Symbol | |
ID | 8334817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 3844552 |
End bp | 3846501 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644956608 |
Product | Carbohydrate binding family 6 |
Protein accession | YP_003114211 |
Protein GI | 256392647 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.235288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0024319 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTCCTCC ATCGAAGCAG GCGCGCACGA CGCCGCATCC CCCACCTCGC GGCCGCGATA GCCGCGACCG GCGTCGCGGC CCTCGCCCTC GCGGGCGTCC AAGGCACAGC CCTGGCCGCG TCGGCCGCCC CCCACCCCGC AGCCGCCAAG GCCGCCGCCG CGGTCTTCAA CGTCAAGGAC TACGGAGCCA CCGGCAACGG CTCCACCAAC GACTCTCCAG CCATCAATAA GGCGGTCGCC GCCGCGAACA CCGCCGGCGG CGGCATCGTG GAGTTCCCGT CCGGCAGCTA CAAGTCCGCG AACACCGTCC ACCTCAAGAG CAACGTCACC ATCCAGCTCG ACGCCGGCTC CAAAGTCCTC GGCTCCAGCG CCAAGACCTA CGACGCCGCC GAGTCGAATC CCAACGACAA GTACCAGGAC TACGGCCACA GCCACTTCCA CGACGCGATG TTCTCCGGCG ACAAACTCTC CAACATCGGC TTCACCGGCT CCGGCACCAT CGACGGCGGC GGCAACCTGA TCACCGGCAA CCCCGGCGCC GGGCAGGCCG ACAAGATCCT GTCACTGACC CGCTGCACCA ACCTCACGCT CAGCGGCATC ACCCTCACCC GCGGCGGGCA CTTCGGCGCG CTGATCAACG GCTGCGACGG CGTGGTGTCC GACCACCTCA CGATCGCGAC CTCCAGCGAC CGCGACGGCT GGAACATCAT CTCCACGACG CACGTCACCA TCACCAACGC GAACATCTCC TCCAACGACG ACGCGCTGGT GTTCAAGAGC GACTGGGCCC TGGGCCAGAC GCTGCCCAGC GGCCACGTCA CCGTCACGAA CAGCACGCTG CAGGCCAAGT GTTGCAATGC CCTGATGTTC GGCTCGGAGA CCTGCGGATC CTTCACCGAC TACCGGTTCC AGCAGATCAC CATCCTCGGC GCCGGCAAAT CCGGGCTGGG CATCGTGAGC ATGGACGGCG CAGACATCTC CGACGTGCAC TACCAAGACG TCACCATGAC CGGTGTGCAC TCGCCGATCA TGGAGAAGAT CGGGACCCGA TTGCGATGCG GCGGCAGTCC GAAGGTCGGG CATATCAGTA ACGTGACATT CACCAACGTC ACGGGCACCG GCGTGGCGAG TACGGACTAC AGCCCGACGA TCTGGGGCGC CGACAGCAGC CACCAGGTCA GCGACGAGAC CTTCACCAAC GTCAACCTGA CCGTCCCCGG CGGCCACGGC ACGATGTCCA CCGGCGTCCC CAGCGACAAC GGCGACTACA ACCCGAACAG CATCGGTACG CGGCCGGCGT ACGGCTGGTA CCTGCACAAC GTCTCCGGCA TCCACTTCAC CGGCGGCTCG GTCAAGGTCG CCAAGACCGA CGGCCGCCCC GCGGTCATCG CCAACGCCGG CAGCGCCATC ACCTTCGACG GACTGACCGC GCAGACCGGC AGCTCCAGCC CGTTCGACGT CGGCTTCCAG AACATCACCG GATACTGCCT CAGCAACAGC CACAACACCT CCGGCGGCGC GCTGCGCGTC TCGGCCAGCG GCTCCACCCA AAGCTGCGGC TCCTCGGCGA CACGCTATGA GGCGGAGAAC GCGACGCTGT CCACCGGCGA CACCGTGGCC ACCAACCACA CCGGTTTTTC CGGCAGCGGC TTCGTCGACA CGACCAACGC CGTCGGTGCG TACGTCGAGT GGACCGTGAC CGCGCCGGCC GCCGGCACGT ACACAGCCAC CGTCGGCTAC GCGAACGGAA CCACCACCGA CCGGCCGATG GACGTCGCCG TGAACGGCAC GACCGCGGAC GCCGCAGCGT CGTTCCCCAC GACCGCGAGC TGGAACACCT GGGCCGGCAA GGCATTCAGC GTTCCGTTGA ACGCCGGCGC CAACACCATC CGGGTAGCCG CGAGCACCGC GAACGGCTGC CCGAACCTCG ACTACCTCGA CCTCGGCTGA
|
Protein sequence | MFLHRSRRAR RRIPHLAAAI AATGVAALAL AGVQGTALAA SAAPHPAAAK AAAAVFNVKD YGATGNGSTN DSPAINKAVA AANTAGGGIV EFPSGSYKSA NTVHLKSNVT IQLDAGSKVL GSSAKTYDAA ESNPNDKYQD YGHSHFHDAM FSGDKLSNIG FTGSGTIDGG GNLITGNPGA GQADKILSLT RCTNLTLSGI TLTRGGHFGA LINGCDGVVS DHLTIATSSD RDGWNIISTT HVTITNANIS SNDDALVFKS DWALGQTLPS GHVTVTNSTL QAKCCNALMF GSETCGSFTD YRFQQITILG AGKSGLGIVS MDGADISDVH YQDVTMTGVH SPIMEKIGTR LRCGGSPKVG HISNVTFTNV TGTGVASTDY SPTIWGADSS HQVSDETFTN VNLTVPGGHG TMSTGVPSDN GDYNPNSIGT RPAYGWYLHN VSGIHFTGGS VKVAKTDGRP AVIANAGSAI TFDGLTAQTG SSSPFDVGFQ NITGYCLSNS HNTSGGALRV SASGSTQSCG SSATRYEAEN ATLSTGDTVA TNHTGFSGSG FVDTTNAVGA YVEWTVTAPA AGTYTATVGY ANGTTTDRPM DVAVNGTTAD AAASFPTTAS WNTWAGKAFS VPLNAGANTI RVAASTANGC PNLDYLDLG
|
| |