Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4693 |
Symbol | |
ID | 8336047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5349195 |
End bp | 5350526 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644957793 |
Product | glycoside hydrolase family 4 |
Protein accession | YP_003115395 |
Protein GI | 256393831 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0209787 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.517895 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCCGCA TCGCCTTCGT CGGCGCCGGC TCGGTGGTCT TCACCCGGCA GCTGCTGCAC GACATCCTGT CCTACCCCGA GCTGTCCGGC GCGGCCATCG CGCTGCACGA CATCGACCCC GAGCGGCTGC AGGTCGCCGC CGCGCTGGCC GACCACGCCG CGCGCACCCT CGGCGCCGAG GTGTCCGTCA CGGCCACCAC CGACCGCCGC GCCGCCCTGG CCGGCGCCGA CGCCGTCGTC AACATGATCG CCGTCGGCGG CCACCAGGCC ACCGTCACCG ACTTCGAGAT CCCCGCCGCC GCCGGCCTGC GCCAGACCAT CGGCGACACC CTCGGCGTCG GCGGCATCTT CCGCGCCCTG CGCACCTTCC CGGTCCTGCG ATCCCTGGCG CAGGACATGG CCGAGGTCTG CCCCGACGCC TGGCTGCTGA ACTACACCAA CCCCATGGCC ATGAACATCC AGTACCTGAG CACGATCGCG CCGAAGCTGA AGGTCGCAGG CCTGTGCCAC TCGGTGTACT GGACCGTCCG CGGCCTGTGC GACATCATCG GCATCCCCCA CGACGACGTC GACGTCCTGT CCGCCGGCGT GAACCACCAA GCCTGGATCC TGCGCTGGCA GCACCAAGGC CGCGACCTCT ACCCGGCCCT GGACGCCGCC ATAGCAGCCA GCCCGGACCT GGCCCGCCGC GTCCGCGTCG ACATGTACCA GCGCCTCGGC TACTACCCGA CCGAGACCAG CGAGCACTCC TCCGAATACG TCCCCTGGTA CCTCGGCCAC GACACCGAAA TCACCCGCCT CCGCATCCCC GTAGGCGACT ACATCGACAT CAGCGCCGAA AACCTCGCCG AATACCGCGA ACTCCGCAAA GTCATCACCG ACGGCGGCGA CCCCGCCAAC GGCTGGGAAA GCGACGCCGC CGAATACGCC CCCCAAGTCA TCCACAGCCT AGCCACCGGC ACCCCCCGCA CCATCCAAGT CACCACCCCC AACACCGGCC TGATCAGCAA CCTCCCCGAA GCAGCCGCCG TCGAAGTCCC AGCAACCCTC GACCGCCTCG GCATCCACCC CCACCACGTA GGAGCACTCC CACCCCAACT AGCAGCCCCC AACCGCCACT TCCTCAACGT AGTCGACCTA GTAGTAGCCG CCGCCGTAGA AGGCGACCCC CGCCACATCC GCCACGCCGC AATGGCAGAC CCCGCCACAG CCGCAACCCT GACCGTCGAC CAGATCTGGA ACCTCTGCGA CGCCATGGTC ACCGCCCACG GAGACGCACT GCCGGAGCCA TTGCGGCGGA GCCCGCTCGC GGCGCGGGGC GGTGGGGGGT GA
|
Protein sequence | MIRIAFVGAG SVVFTRQLLH DILSYPELSG AAIALHDIDP ERLQVAAALA DHAARTLGAE VSVTATTDRR AALAGADAVV NMIAVGGHQA TVTDFEIPAA AGLRQTIGDT LGVGGIFRAL RTFPVLRSLA QDMAEVCPDA WLLNYTNPMA MNIQYLSTIA PKLKVAGLCH SVYWTVRGLC DIIGIPHDDV DVLSAGVNHQ AWILRWQHQG RDLYPALDAA IAASPDLARR VRVDMYQRLG YYPTETSEHS SEYVPWYLGH DTEITRLRIP VGDYIDISAE NLAEYRELRK VITDGGDPAN GWESDAAEYA PQVIHSLATG TPRTIQVTTP NTGLISNLPE AAAVEVPATL DRLGIHPHHV GALPPQLAAP NRHFLNVVDL VVAAAVEGDP RHIRHAAMAD PATAATLTVD QIWNLCDAMV TAHGDALPEP LRRSPLAARG GGG
|
| |