Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4512 |
Symbol | |
ID | 8335866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5137782 |
End bp | 5139230 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644957614 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003115216 |
Protein GI | 256393652 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.625596 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAGC TCGATGTGGT GATCGCCCGC GTGCTGCAGC CGGGGTTCGT GGGAACGTCA CCGCCGGACT GGCTGCGGCG GCGGCTGGCC GGCGGGCTGG GATCGGTGAT TCTGTTCGCG GCGAACATGG AGTCGCCGGA GCAGGCGCGG ACGCTGACCA ACGAGCTGCG CGCTGAGAAT CCGGACGTGT TCGTGGCTGT GGACGAGGAA GCCGGGGACA TCACCCGGCT GGAAGCCGCC ACCGGCTCCT CCTACCCCGG GAACCTGGCA CTCGGCGCGA TCGACGACAT CGCGCTGACC GAGGCCACGG CCCGCTCGAT CGGCGCCCTG GTGCACTCGG CGGGGATCGA CCTGACCTAC GCACCGGTCG CCGACGTGAA CACCGAGCCG CGCAACCCGG TCATCGGCGC ACGCTCCTTC GGCGCCGACC CCGCACTGGT CGCACGGCAC GTCGCGGCGA CAGTGCGGGG ACTGCACACG GCGGGCGTCG CGGCATGCGC GAAGCACTTC CCCGGCCACG GCGACACCGT GATCGACTCG CACTTCGGAC TCCCGACCGT CACCGCGACA CTGGAGCAAC TCCGCCACGA CACCCTGCCG CCGTTCAGCG CGGCGGTGGC CGCCGGCGTG CGGGCGGTGA TGGTCGCGCA CCTGCTGATG CCGGAGTTCG ACACCGAGCA CCCCGCATCG GTGAGCCCGG CGGTGATCGG CGGACTGCTG CGAACCGAGC TCGGCTTCGA CGGCCTGGCA GTCAGCGACG CGATCGGCAT GGCGGCGGTC CGCGAGCGCT ACGGGCTCGC CAGCGCGGCG GTGCGCGCAC TGGTCGCGGG GATCGACATG GTCTGCGTGG ACAGCGACTC CACCGACGCC GACCTGGCTG CGATCACGGA CGCGATCACC GCGGCGCTGC GAGACGGGAC GCTGAGCGAG GCGCGGTTGG TGGAGGCTGC TGAGAAGGTG GCGGGTTTCG CCGCGTGGCG ACGGGAGGCT CGGGACGCTT CGGTTTCTGT GTTTGATGCG CCACTCGGCG ACGGACTCGC AGCAGCACGC CGCGCGGTCG CGGTGCTGCG TGCGGACGAC GGTGCGCTGC CACTGAAATC AGCGCCGCAT GTGGTGGAGG TGGATCTCCC CCGAAGCATG GCCGACCACC TGGCCACGCT GCTGCCCGGA ACCACCAACT CAAAGCTCTC CGACACGATC GACGCCGCCG CTGCGACCGG CAGCTTTGCT ATCCCCGCCG ACCAACCCTT GGTCGTCGTC GTCCGCGGCA TCCAGCGCTC CCCCGAGGAC CTGGACCGCG TGGCCCGCCT CGTGAAGGAG CGCCCCGACG CGATCGTCGT CGACCTCGGC GTCGCGCACA TCGACCCTGG CGGCGCGGCG TGGGTCGCAG GGTACGGCGT CTCGCGCGTG AGCCTGCAGG CTGTGGCGGA GGTCCTTGCC GGCAGGTAG
|
Protein sequence | MSELDVVIAR VLQPGFVGTS PPDWLRRRLA GGLGSVILFA ANMESPEQAR TLTNELRAEN PDVFVAVDEE AGDITRLEAA TGSSYPGNLA LGAIDDIALT EATARSIGAL VHSAGIDLTY APVADVNTEP RNPVIGARSF GADPALVARH VAATVRGLHT AGVAACAKHF PGHGDTVIDS HFGLPTVTAT LEQLRHDTLP PFSAAVAAGV RAVMVAHLLM PEFDTEHPAS VSPAVIGGLL RTELGFDGLA VSDAIGMAAV RERYGLASAA VRALVAGIDM VCVDSDSTDA DLAAITDAIT AALRDGTLSE ARLVEAAEKV AGFAAWRREA RDASVSVFDA PLGDGLAAAR RAVAVLRADD GALPLKSAPH VVEVDLPRSM ADHLATLLPG TTNSKLSDTI DAAAATGSFA IPADQPLVVV VRGIQRSPED LDRVARLVKE RPDAIVVDLG VAHIDPGGAA WVAGYGVSRV SLQAVAEVLA GR
|
| |