Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0432 |
Symbol | |
ID | 8331759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 480710 |
End bp | 482398 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644953598 |
Product | cellulose-binding family II |
Protein accession | YP_003111225 |
Protein GI | 256389661 |
COG category | [R] General function prediction only |
COG ID | [COG3979] Uncharacterized protein contain chitin-binding domain type 3 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.302151 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAGGG CTCTCCGAGC AAGCCTGATC GCTGCTGGCG CCGGCGCTCT GCTGGTCAGC GGCCTGACAA TCGCCTCCCC GACAGCTTCG GCCGCCACGA CCCCAACCCC GACCGCCACC GCGCCCACCA CCCCGACCGC CACCGCGCCC ACCACCCCGA CCCCGTTGCC CACCGCCGTC TACAGCAAGA CCTCCGACTG GGGCAGCGGC TTCCAGGCGC AGTACGTGAT CACCAACCCG ATGTCCGTCC CGCTGAACAC CTGGTCGCTC CAGTTCAGCC TGCCGTCCAC CGAGAAGATC ACCAGCATGT GGGGCGGGAC CGACACCGCG AGCGGTACCA CGCACACCGT CCTCGCGCAG TCCTACGACA CCACGATCGC GGCCGGCGCG TCGATCACCA TCGGTTTCGA CGGCAGCTAC TCGGGGACGT ACGCGGACCC GTCCGCCTGC AAGCTCAACT CAGACCCCTG TGACGGCAGC GCGGACGCCC AAGCCCCGAC CGCGCCGACG GCACTGACCA CGCTGAGCAC CACCTCGAGC GCCGCGTCGC TGTCCTGGAC CGGCTCCGCC GACAACGTCA CGGTCGCCGG GTACAACGTG TATTCGGGTT CGACGATGGT CGCCACGTCC CCGGGCACGG CGGCGACCGT CACCGGTCTG GCGCCCTCGA CGTCCTACTC GTTCACCGTC CGCGCCGTCG ACGAAGCCGG GAACCTGTCG GCGCCGAGCG CAGCGGTCAG CGCCACGACG CTGGCAAGCA CCGGCGGCGG ACACCTGCCC GGCGTCGCCG CGCCTTTCGT CGACATCGGC GCCTGGCCCA CCCCGAACCT GACCCAGATC GCCGAGACGA CAGGCCTGCG CCAGTTCTCC CTCGGCTTCA TCGTCAACGG CACGGCCACC TGCACCCCGA GCTGGTTCAA CGCCTACGCC ATGTCCGCCG GCTTCGAGCA GTCGGACATC GCCACCCTGC GCGCGATCGG CGGCGACGTG AAGCCGTCCT TCGGCGGCGA GGCGGGCACC GAACTGGCGC AGTCGTGCAC CGACGTCCCG TCCCTGACAG CCGCCTACCA ATCGGCCATC ACCGCCTACA ACCTCACCCA GATCGACTTC GACATCGAGG GCTCCGCAGT CGCCGACCCC GCCTCCATCG ACCGCCGCTC CCAAGCCATA GCCGCCCTGC AGAAGAACGC AGCCGCAGCC GGCAAGCCCC TCACCGTCAC CCTGACCCTC CCGATCCTCC CCTCCGGCCT CACCACCGAC GGCCTCTACG TAGTCCAGTC AGCAGTGAAG TACGGCGCCA AGATCACCAC AGTCAACGGC ATGGCCATGG ACTTCGGCGA CATAGAAGCC CCCAACCCCA GCGGCAAGAT GGGCACCTAC GCCATCGACA CAGCCCAATC ACTCCACACC CAACTGACGC CCCTCTACCC AGCCCTCACC GCAACCCAGC TGTGGAACAT GATCGGCGTC ACCCCCATGA TCGGCCAGAA CGACAACTCC TCCGAAGTCT TCTACCAAAC CGACATGCAC CAACTCCTCA CCTTCGCCCA ACAACAACAC CTCGGCGAAC TAGCCTTCTG GGACGTCACC CGCGACGCCA ACGCCTGCAC CGGCTCCCTG TCAAAGTGCA CGGACATCCC ACAGACCCCC TACGAGTTCT CCAAGATGAT CGCGCCCTAT CAAGGCTGA
|
Protein sequence | MLRALRASLI AAGAGALLVS GLTIASPTAS AATTPTPTAT APTTPTATAP TTPTPLPTAV YSKTSDWGSG FQAQYVITNP MSVPLNTWSL QFSLPSTEKI TSMWGGTDTA SGTTHTVLAQ SYDTTIAAGA SITIGFDGSY SGTYADPSAC KLNSDPCDGS ADAQAPTAPT ALTTLSTTSS AASLSWTGSA DNVTVAGYNV YSGSTMVATS PGTAATVTGL APSTSYSFTV RAVDEAGNLS APSAAVSATT LASTGGGHLP GVAAPFVDIG AWPTPNLTQI AETTGLRQFS LGFIVNGTAT CTPSWFNAYA MSAGFEQSDI ATLRAIGGDV KPSFGGEAGT ELAQSCTDVP SLTAAYQSAI TAYNLTQIDF DIEGSAVADP ASIDRRSQAI AALQKNAAAA GKPLTVTLTL PILPSGLTTD GLYVVQSAVK YGAKITTVNG MAMDFGDIEA PNPSGKMGTY AIDTAQSLHT QLTPLYPALT ATQLWNMIGV TPMIGQNDNS SEVFYQTDMH QLLTFAQQQH LGELAFWDVT RDANACTGSL SKCTDIPQTP YEFSKMIAPY QG
|
| |