Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5333 |
Symbol | |
ID | 8336687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 6147428 |
End bp | 6148840 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644958431 |
Product | beta-galactosidase |
Protein accession | YP_003116033 |
Protein GI | 256394469 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.622206 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACG GGGTCTTTCC AGAAAACTTC CTGTGGGGCG CGGCCACGGC GGCGTACCAG ATCGAGGGCG CGGCCGCCGA GGGCGGACGC GGACCGTCGA TCTGGGACAC GTTCAGCCGC ACCCCCGGCA AGGTCCTGGC CGGCGACACC GGCGATGTGG CCGCCGACCA CTACCACCGG TTCCGCGAGG ACGTCGCCCT GATGGGCAAG CTGGGCCTGG GCGCCTACCG GTTCTCCACC GCCTGGCCGC GCGTGCAGCC GGCGGGGCGC GGACCGGCCA ACGCCGAAGG GCTGGCCTTC TACGACGAGC TGGTGGACGA GCTGCTCGGC GCCGGCATCG AGCCGGTGCT GACCCTCTAC CACTGGGACC TGCCGCAGGC GGTGGAGGAC GACGGAGGCT GGGGCGCACG CGACACCGCC TACCGGTTCG CCGAATACGC GCGCCTGGTG GCCGAGCGCT TCGCCGACCG GGTGAAGCAG TGGACCACCC TGAACGAGCC GTTCTGCTCG GCGTTCCTGG GCTACGCCTC CGGCGTGCAC GCCCCGGGCC GGCACGAGCC GGAGGTCGCG CTGCGTGCGG CGCACCACCT GCTGCTCGGG CACGGCCTGG CGCTGCGCGC GCTGCGCGAG ACGCTCCCGG CCGAGGCGCA GGTCTCGATC ACGCTGAACG CGACCGAGTT CCGGCCGCTG ACCGACTCCC CGGAGGACGC CGACGCCCAG CGCCGGGTCG ACGCGATCCA GAACCGCGTC TTCCTGGACC CGGTGTTCCG CGGCGCCTAC CCGGAGGACC TGATCCGCGA CACGGCGGCG GTGACCGACT GGTCCTTCGT CGAGCCCGGG GATCTGGAGC TGATCAGCGC CAAGGTGGAC CAGCTGGGGA TCAACTTCTA CAACCCCTCG CTGGTCGCCG CGCCGCTGCC GCCGGGCGCC GAGGCCGGCC CGCGCGACGA CGGCCACGGC CAGTCGGAGT ACTCGCCGTG GGTGGGCAGC GAGGGCGCGG TGCGCTTCGC CCGGCAGGAC GGCGAGCGGA CCGCGATGGA CTGGGTCGTG GACCCCTCCG GCCTGGTCGA CCTGCTGCTG CGGATCCACA ACGACTACGG CCCGATACCG ATCGCGGTGA CCGAGAACGG CGCGGCGTTC GAGGACGTCC CCGGACCCGA CGGGGAGGTG GACGACCCGC GCCGGATCGC CTATCTGCAG GCCCACATCG CGGCCGTTCG CGACGCCCTG GCGGCCGGCG TGGACATGCG CGGGTATTTC GTCTGGTCGC TGCTGGATAA TTTCGAGTGG AGCTACGGAT ACTCCAAGCG GTTCGGCATC GTGCGTGTCG ATTTCGCGAC CGGGAAGCGC GTCGTGAAGG CCTCTGGACA GTGGTACCGC CGGATCGTCG AGGGCAACGG GAGCAGTTTG TGA
|
Protein sequence | MSDGVFPENF LWGAATAAYQ IEGAAAEGGR GPSIWDTFSR TPGKVLAGDT GDVAADHYHR FREDVALMGK LGLGAYRFST AWPRVQPAGR GPANAEGLAF YDELVDELLG AGIEPVLTLY HWDLPQAVED DGGWGARDTA YRFAEYARLV AERFADRVKQ WTTLNEPFCS AFLGYASGVH APGRHEPEVA LRAAHHLLLG HGLALRALRE TLPAEAQVSI TLNATEFRPL TDSPEDADAQ RRVDAIQNRV FLDPVFRGAY PEDLIRDTAA VTDWSFVEPG DLELISAKVD QLGINFYNPS LVAAPLPPGA EAGPRDDGHG QSEYSPWVGS EGAVRFARQD GERTAMDWVV DPSGLVDLLL RIHNDYGPIP IAVTENGAAF EDVPGPDGEV DDPRRIAYLQ AHIAAVRDAL AAGVDMRGYF VWSLLDNFEW SYGYSKRFGI VRVDFATGKR VVKASGQWYR RIVEGNGSSL
|
| |