Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_7349 |
Symbol | |
ID | 8338719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 8532511 |
End bp | 8533779 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644960430 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_003118017 |
Protein GI | 256396453 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.197133 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTCG CCGTCGTAGG CGGAGGATCG ACGTATACGC CGGAGCTGGT GGACGGCTTC GCACGGCTGC GCGACACCCT GCCGCTCACC GAGCTCGCCT TGATCGACCC GGCCGCGGAC CGGGTGGAGC TGATCGGCGG GCTGGCCCGC CGGATCTTCG CCAAGCAGGG GCATCCCGGC ACCGTCACCA CGCACACCGA GCTGGAGTCC GGCATCGAGG GTGCGGACGC GGTGCTCATC CAGCTGCGGG TCGGCGGTCA GACGATCCGG AACGTCGATG AGACGTTCCC GCTGGAGTTC TGCTGCGTCG GCCAGGAGAC CACCGGCGCC GGCGGCTTCG CCAAGGCGCT GCGGACCGTG CCGGTGGTGC TGGACATCGC CGAGCGCGTC CGGCGAATAG CCCCGCAGGC CTGGATCATC GACTTCACCA ACCCGGTCGG TATCGTCACC CGCGCGCTGC TGGACGCCGG GCACCGCGCC GTCGGGCTGT GCAACGTGGC CATCGGCTTC CAGCGCCGCG CCGCCGCGCA CCTGGGCGTG CAGCCCTCGC GCATCAAGCT CGACCACGTC GGTCTCAACC ACCTGACCTG GGAGCGCGGC TTCTACCTCG ACGGCGAGGA CTTCCTGCCG AAGTACCTCA GCGAGTCGCT CGAGGAGATC TCGCACGACA TCGAGCTGCC GGCCGAGCTG ATCCAGCGGC TCGCCGCGAT CCCCTCCTAC TACCTGCGCT ACTTCTACGC CCACGACATC GTGGTGAAGG AGCAGATCGA CCAGGTCGCC AAGGGCGAGA ACCGCGCCAA GGCGGTCGCC GCGGTCGAGG CCGAACTCCT GGCGCAGTAC GCGGACCCGA CGCTGGACAC CAAGCCGGAG GCGCTGAGCA AGCGCGGCGG GGCGTTCTAC TCCGAGGCGG CAGTCGAGCT GCTGGCCTCG CTGCACGGCG ACCTCGGCGA AGAATTGGTC GTCAACGTCC GCAACGCGGG CACCTTCCCC TTCCTGGCCG ACGACGCGGT GATCGAGGTC CCGGCGATCG TCGACGCCTC CGGGGTGCGC CCGGCGCCGT TGCGCGCGCC GATCGAGCCG CTGTACCGCG GGCTCATCGG ACACGTTTCC GCCTACGAGG AGCTGGCCGT GGAGGCCGCG ATCAAGGGCG GCGTGGAGCG GGTCCGCACC GCCCTGCTCG CGCACCCCCT GATCGGTCAG GCGGACCTGG CGGACAAGCT GGCCGACTCC CTGGTCGCCA AGAACCGCAG CTTCCTGCCG TGGGCGTGA
|
Protein sequence | MKLAVVGGGS TYTPELVDGF ARLRDTLPLT ELALIDPAAD RVELIGGLAR RIFAKQGHPG TVTTHTELES GIEGADAVLI QLRVGGQTIR NVDETFPLEF CCVGQETTGA GGFAKALRTV PVVLDIAERV RRIAPQAWII DFTNPVGIVT RALLDAGHRA VGLCNVAIGF QRRAAAHLGV QPSRIKLDHV GLNHLTWERG FYLDGEDFLP KYLSESLEEI SHDIELPAEL IQRLAAIPSY YLRYFYAHDI VVKEQIDQVA KGENRAKAVA AVEAELLAQY ADPTLDTKPE ALSKRGGAFY SEAAVELLAS LHGDLGEELV VNVRNAGTFP FLADDAVIEV PAIVDASGVR PAPLRAPIEP LYRGLIGHVS AYEELAVEAA IKGGVERVRT ALLAHPLIGQ ADLADKLADS LVAKNRSFLP WA
|
| |