Gene Caci_5333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5333 
Symbol 
ID8336687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6147428 
End bp6148840 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content72% 
IMG OID644958431 
Productbeta-galactosidase 
Protein accessionYP_003116033 
Protein GI256394469 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.622206 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG GGGTCTTTCC AGAAAACTTC CTGTGGGGCG CGGCCACGGC GGCGTACCAG 
ATCGAGGGCG CGGCCGCCGA GGGCGGACGC GGACCGTCGA TCTGGGACAC GTTCAGCCGC
ACCCCCGGCA AGGTCCTGGC CGGCGACACC GGCGATGTGG CCGCCGACCA CTACCACCGG
TTCCGCGAGG ACGTCGCCCT GATGGGCAAG CTGGGCCTGG GCGCCTACCG GTTCTCCACC
GCCTGGCCGC GCGTGCAGCC GGCGGGGCGC GGACCGGCCA ACGCCGAAGG GCTGGCCTTC
TACGACGAGC TGGTGGACGA GCTGCTCGGC GCCGGCATCG AGCCGGTGCT GACCCTCTAC
CACTGGGACC TGCCGCAGGC GGTGGAGGAC GACGGAGGCT GGGGCGCACG CGACACCGCC
TACCGGTTCG CCGAATACGC GCGCCTGGTG GCCGAGCGCT TCGCCGACCG GGTGAAGCAG
TGGACCACCC TGAACGAGCC GTTCTGCTCG GCGTTCCTGG GCTACGCCTC CGGCGTGCAC
GCCCCGGGCC GGCACGAGCC GGAGGTCGCG CTGCGTGCGG CGCACCACCT GCTGCTCGGG
CACGGCCTGG CGCTGCGCGC GCTGCGCGAG ACGCTCCCGG CCGAGGCGCA GGTCTCGATC
ACGCTGAACG CGACCGAGTT CCGGCCGCTG ACCGACTCCC CGGAGGACGC CGACGCCCAG
CGCCGGGTCG ACGCGATCCA GAACCGCGTC TTCCTGGACC CGGTGTTCCG CGGCGCCTAC
CCGGAGGACC TGATCCGCGA CACGGCGGCG GTGACCGACT GGTCCTTCGT CGAGCCCGGG
GATCTGGAGC TGATCAGCGC CAAGGTGGAC CAGCTGGGGA TCAACTTCTA CAACCCCTCG
CTGGTCGCCG CGCCGCTGCC GCCGGGCGCC GAGGCCGGCC CGCGCGACGA CGGCCACGGC
CAGTCGGAGT ACTCGCCGTG GGTGGGCAGC GAGGGCGCGG TGCGCTTCGC CCGGCAGGAC
GGCGAGCGGA CCGCGATGGA CTGGGTCGTG GACCCCTCCG GCCTGGTCGA CCTGCTGCTG
CGGATCCACA ACGACTACGG CCCGATACCG ATCGCGGTGA CCGAGAACGG CGCGGCGTTC
GAGGACGTCC CCGGACCCGA CGGGGAGGTG GACGACCCGC GCCGGATCGC CTATCTGCAG
GCCCACATCG CGGCCGTTCG CGACGCCCTG GCGGCCGGCG TGGACATGCG CGGGTATTTC
GTCTGGTCGC TGCTGGATAA TTTCGAGTGG AGCTACGGAT ACTCCAAGCG GTTCGGCATC
GTGCGTGTCG ATTTCGCGAC CGGGAAGCGC GTCGTGAAGG CCTCTGGACA GTGGTACCGC
CGGATCGTCG AGGGCAACGG GAGCAGTTTG TGA
 
Protein sequence
MSDGVFPENF LWGAATAAYQ IEGAAAEGGR GPSIWDTFSR TPGKVLAGDT GDVAADHYHR 
FREDVALMGK LGLGAYRFST AWPRVQPAGR GPANAEGLAF YDELVDELLG AGIEPVLTLY
HWDLPQAVED DGGWGARDTA YRFAEYARLV AERFADRVKQ WTTLNEPFCS AFLGYASGVH
APGRHEPEVA LRAAHHLLLG HGLALRALRE TLPAEAQVSI TLNATEFRPL TDSPEDADAQ
RRVDAIQNRV FLDPVFRGAY PEDLIRDTAA VTDWSFVEPG DLELISAKVD QLGINFYNPS
LVAAPLPPGA EAGPRDDGHG QSEYSPWVGS EGAVRFARQD GERTAMDWVV DPSGLVDLLL
RIHNDYGPIP IAVTENGAAF EDVPGPDGEV DDPRRIAYLQ AHIAAVRDAL AAGVDMRGYF
VWSLLDNFEW SYGYSKRFGI VRVDFATGKR VVKASGQWYR RIVEGNGSSL