Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4157 |
Symbol | |
ID | 8335511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4699982 |
End bp | 4702141 |
Gene Length | 2160 bp |
Protein Length | 719 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644957260 |
Product | cellulose-binding family II |
Protein accession | YP_003114862 |
Protein GI | 256393298 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.894149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.131999 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCTCG ATCGCAGGCT CAGGGCGAGA TCGGTCGCCC TGACCGCCGC CACGGCTTTG TCCGCGCTGT CCATCGTCGG AGCGGTCGCG GCTCCGCAGG CGTCCGCCGC CACCGCGGTG TCGGTGACCG TCAACGGCAC AGCCGGGCTC GGTACCATTC CCGGCGGCGC GATCGGCCTG AACACCGCCG TCTACGACAG CTATATGAAC GACACCCCGA TCCCGGGTCT GCTCAAGGCC GCGGGAATCA ATGCTCTGCG CTACCCGGGC GGTTCGTACT CTGACATCTA CAACTGGCAG ACCAACGTCG CGCAGGGCGG CTACGACGCG CCGAACACGA GCTTCGCGGA CTTCATGGGG ACCGCGAAGG CTGCCTCGGC CAGCCCGATC ATCACCGTGA ACTACGGCAC CGGGACACCG GCGTTGGCCG CGTCCTGGGT GCAGAACGCC GCCGTCACCA ACAAGGACGG CGTCGCGTAC TGGGAGGTCG GCAACGAGGT CTACGGCAAC GGGACCTACG GCGCGAACTG GGAGACCGAC GCGCACTGCC AGACGTCCTC CGGAACGCCG GTCACCGTCG GCAGCGAGCC TTCGCAGACC TACGGTTGCG GTCCCTCGGT CTACGCCAAC AATGTCCTGA GTTACATGTC CTCGATGAAG GCGGTCAGCT CGAACGCCCA CGTCTGCGCG ATCCTGACCA CGCCGGGGTT CTGGCCCGAC AACGTCACCA ACGCCACGAC CAGCCCGCTT CCCTGGAACC AGACCGTGCT CACGGCGCTC GGCGCCAAGA CCGACTGCGT CATCGTGCAC TACTATCCCG GCGGCTCGAA CGCGGCCGGG ATGCTGACCG ACACCAGCGA CATCTCCGGG ATCATCTCGA CGCTGCACTC CCAGATCAGC CAGTACGCCA AGGTGAACCC GGCGAACGTG CCGATCCTGG TGACCGAGAC CAACTCCAAC GTGGACATGG ACACCCAGCC CAACGCGCTG TTCGCCGCCG ACATGTACAT GACCTGGCTG GAGAACGGCG TCGCGAACGT CGACTGGTGG GACGAGCACA ACGGCCCGGG GACCAACCCG CCGAGCGTCG TCAACGGCGC GCAGGACTAC GGCGACTACG GCATCTTCTC CACCGGCGGC AACAACAGCG GCGTGACCGA GCCGGCCGCC GAGACCCCGT TCGGGCCGTA CTACGGCATC GCGATGCTGT CCAAGCTCGG CGGACCCGGC GACACGATGG TGAACAGCAC GTCCTCCAAC GCGCTGGTCC GCGTCCACGC GGTGCGGCGG GCCGGCGGGA ACCTCGATCT GCTGATCGAC AACGAGGATC CCACCACCTC CTACTCGGTG AACCTGGCTT ACAACGGGTT CACGCCAGCC GGTAGCCCGA CGGTCTTCAC CTTCGCGAAC AACGGGAATT CGATCACCAG TGCGACGCAG AGCTCTGCGT CGTCGGTCAC GGTCGCTCCG TACACGCTCA CGGTCGTACA GGTCCCGGGC AGCGGCGGGG GAGGTGTGAC AGCACCGGGA GCGCCGGGGC AGCCGGTCGT CTCCGGGCTG GCGTCGAGCA CGTCCGGCAA CACCACCGGC GTGGCGACGC TGACCTGGCC AGCAGCCACG GCCGGCACGT ACCCGGTCGC GTCCTACCAG GTCTACCGGC AGAACAGCGG CGGCGGGACA ACCCTCGCCG GCACGACCAC CACGACGACG CTGAATCTCA GTGGCCTGAC GATCGGCGCG GGCTACACCT ATGACGTGGT CGCGGTGGAC TCCCACGGCA ACCCGTCGCT GCCCTCGCCA CCGGTGACGT TCACCGTGCC ACCCCCGGCG ACCGCGAGCT GCGCGGTGCA CTACGCGGTC AGCTCCTCCT GGTCCGGAGG CTTCGGTGCC GCGATCACGA TCACGAACCG CAGTGCGACC GCCATCAGTG CCTGGACCCT GAAATTCACC TGGCCCGACC CCGGCGAGGC GGTGCAGAGC GGCTGGAACG GCACCTGGAG CCAGAGCGGC TCGGCGGTGA CCGTGGTGAA CGCCGCATGG AACGGCACGA TCGCAGCCAA CGGCGGCACG GTGAGCCTCG GCTTCAACGG CGCGGACACC GGCCAGGACC CGGCGCCGAC CGTGTTCTCG CTCAACGGGA CGGTGTGCGC GAACAACTGA
|
Protein sequence | MPLDRRLRAR SVALTAATAL SALSIVGAVA APQASAATAV SVTVNGTAGL GTIPGGAIGL NTAVYDSYMN DTPIPGLLKA AGINALRYPG GSYSDIYNWQ TNVAQGGYDA PNTSFADFMG TAKAASASPI ITVNYGTGTP ALAASWVQNA AVTNKDGVAY WEVGNEVYGN GTYGANWETD AHCQTSSGTP VTVGSEPSQT YGCGPSVYAN NVLSYMSSMK AVSSNAHVCA ILTTPGFWPD NVTNATTSPL PWNQTVLTAL GAKTDCVIVH YYPGGSNAAG MLTDTSDISG IISTLHSQIS QYAKVNPANV PILVTETNSN VDMDTQPNAL FAADMYMTWL ENGVANVDWW DEHNGPGTNP PSVVNGAQDY GDYGIFSTGG NNSGVTEPAA ETPFGPYYGI AMLSKLGGPG DTMVNSTSSN ALVRVHAVRR AGGNLDLLID NEDPTTSYSV NLAYNGFTPA GSPTVFTFAN NGNSITSATQ SSASSVTVAP YTLTVVQVPG SGGGGVTAPG APGQPVVSGL ASSTSGNTTG VATLTWPAAT AGTYPVASYQ VYRQNSGGGT TLAGTTTTTT LNLSGLTIGA GYTYDVVAVD SHGNPSLPSP PVTFTVPPPA TASCAVHYAV SSSWSGGFGA AITITNRSAT AISAWTLKFT WPDPGEAVQS GWNGTWSQSG SAVTVVNAAW NGTIAANGGT VSLGFNGADT GQDPAPTVFS LNGTVCANN
|
| |