Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2517 |
Symbol | |
ID | 8333866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 2846896 |
End bp | 2848284 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644955670 |
Product | beta-galactosidase |
Protein accession | YP_003113276 |
Protein GI | 256391712 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.447454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.232688 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTGG AGCTACCCCG CGACTTCCGG TGGGGCGTCG CGACCTCGGC GTACCAGATC GAGGGGGCGG TGGGTGAGGA CGGCCGGACC CCGTCGATCT GGGACACCTT CTGCCGCGTT CCGGGCGCGA TCGACAACGG CGAGGACGGC GACGTCGCCT GCGACCACTA CCACCGGATG CCGGAGGACG TGGGGCTGAT CAAGTCCCTC GGCGTCGACA CCTACCGCTT CTCGGTGTCC TGGCCGCGCG TCCAGCCCGG GGGCAGCGGC CCGGCGAACG CCGCCGGCCT GGCGTTCTAC GACCGCCTCG TGGACGAGCT GCACAGCGCC GGCATCACTC CTTGGCTCAC GCTGTACCAC TGGGACCTGC CGCAGGAGTT GGAGGACGCC GGCGGCTGGC CGAACCGCGA CACCGCCTAC CGGTTCGCCG ACTACGCCGA GATCGTCTAC GACCGCCTCG GCGACCGCGT GGAGCACTGG ACGACGCTGA ACGAGGCGTG GTGCTCGGCC TGGCTGGGCT ACGTCGAGGG CGTCCACGCC CCGGGCCGCA AGGACTTCGC CGACGGCGTC GCCTCCATCC ACCACCTGCT GCTCGGGCAC GGCCTGGCGA CCCAGCGGCT GCGCGCCGCC GCGGCCGCCG CCGAGCGCCC GATCGACCTG GCGATCACCC ACATCCTGGG CAACTCGGTC CCGGCCTCGG ACTCCGAGGT CGACGTCGAG GCCGCCCGCC GCGGCGACGC CCTGCACTTC CGCGCCTACA TGGACCCGAT CTTCAAGGGC GCCTACCCCG AGGACCTGCT GGCCGACCTG GCCGCGATCG ACGTCCCGAT CCCGGTCCAG GACGGCGACC TGGAGATCAT CGCCGCGCCC CTGGACACCC TCGGCGTCAA CTACTACCGC TCGATGAAGA CCACCGGCTT CGGCGAAGAC GGCGGCACCA CCGACGCCGA CGGCCGCCCG GTGACCCGCA CAATCGACTT CGGCGGCCTC CCCAAGACCT ACATCGACTG GGAGGTCATG CCCGAGGACT TCGCCGACCT CCTGGTCCGC ATCAGCGAGG ACTACCCGGG CACGCCCCTG GTGATCACCG AGAACGGCGC GGCCTACAAC GACGTCCCCG ACGCCGACGG CTTCGTCGAC GACCAGGACC GCACGGACTA CATCGCCACG CACCTGGCCG CCGTGGCCCA GGCCCGCCAG CGAGGCGCGG ACATCCGCGG CTACTTCGCC TGGTCCCTGA TGGACAACTT CGAGTGGGCC TACGGCTACG ACAAGCGCTT CGGCATCATC CGCACCGACT ACCAGACCCA GACCCGCACC CCCAAGCGCA GCGCCCTCTG GCTGCGCGAC ACGATCGCCC GCGCCCGACG CCCCCGCGGC GACGACTGA
|
Protein sequence | MSLELPRDFR WGVATSAYQI EGAVGEDGRT PSIWDTFCRV PGAIDNGEDG DVACDHYHRM PEDVGLIKSL GVDTYRFSVS WPRVQPGGSG PANAAGLAFY DRLVDELHSA GITPWLTLYH WDLPQELEDA GGWPNRDTAY RFADYAEIVY DRLGDRVEHW TTLNEAWCSA WLGYVEGVHA PGRKDFADGV ASIHHLLLGH GLATQRLRAA AAAAERPIDL AITHILGNSV PASDSEVDVE AARRGDALHF RAYMDPIFKG AYPEDLLADL AAIDVPIPVQ DGDLEIIAAP LDTLGVNYYR SMKTTGFGED GGTTDADGRP VTRTIDFGGL PKTYIDWEVM PEDFADLLVR ISEDYPGTPL VITENGAAYN DVPDADGFVD DQDRTDYIAT HLAAVAQARQ RGADIRGYFA WSLMDNFEWA YGYDKRFGII RTDYQTQTRT PKRSALWLRD TIARARRPRG DD
|
| |