Gene Caci_1349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1349 
Symbol 
ID8332687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1537007 
End bp1538368 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content71% 
IMG OID644954497 
Productbeta-galactosidase 
Protein accessionYP_003112113 
Protein GI256390549 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.135254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCCC ATGCCCCGCT GCCGCCCTTC CCCGCTTCGT TCCACTGGGG CACGTCCACC 
TCCGCCTACC AGATCGAGGG CGCGCCGGCC GAGGACGGCA AGGGGGTCTC GATCTGGGAC
ACCTTCGTCC GCCGCCCCGG CGCCGTCCGC GACGGCCAGA CCGGCGACCT CGCCTGCGAC
CACTACCACC GCGGCGCCGA AGACACAGCC CTGATGGCGG ACCTCGGCGT CAACGCCTAC
CGCTTTTCCA TCGCCTGGAC ACGCGTCCAG CCCGACGGCT CCGGCCCGGC GAACCCGGCC
GGCCTCGCGT ACTACGAACA GCTCGTGGAT TCCTTGCTGG AGAAGGGGAT CACCCCCTTC
CCTACCCTGT TCCACTGGGA TCTTCCGCAG GCGCTCGAGG ACCGAGACGG CTGGCTCCAC
CGCGACACCG CCCACCGCTT CGCCGACTAC GCCGCGCTCG TCGCCGACCG CCTCGCCGAC
CGCGTCGAGC ACTGGATCAC GCTGAACGAA CCGTTCATCC ACCTGGCGTA CGGCTACGCC
TTCGGCATCC ACGCCCCGGG CCGCGCGCTG ATGACCGACG CCATCCCGGT CGCGCACCAC
CAACTCCTCG CCCACGGCAT GGCTGTCAAA GCCCTGCGAT CCGCGGGCGC GCGCAAGGTG
ATGATCGCCA ACAACTGCAC CCCGGTCTGG TCCGCGAGCG ACGCCCCCGA CGACAAAACG
GCTGCGGAGG CCTACGACAC CCTCCACAAC CACCTGTTCA ACGACCCGAT CCTGCTGGGC
ACCTATCCGG ACCTGTCCGC CTACGGCGCC GGCCCGGACC TGAACGGCGT CGTCCGCGAC
GGCGACCTGG ACGTCATCGC CGCGCCTCTG GACGGCCTCG GCGTCAACTA CTACAACCCG
ACCCGCGTCG CCGCCCCCGG TCCCGAACAC GGCCTGCCGT TCCAGGACCT GCCGATCGAA
GGCGTCCCGC GCACCGCCTT CGACTGGCCG GTGGTCCCCG ACGGCCTGCG CGAAGTCCTC
GTCGGCCTCG CCGACCGGTA CGGCGACGCC CTCCCGCCGA TCTACATCAC CGAGAACGGC
ACCTCGGTCG ACGACAAGGT GGTCGACGGC CGCGTCGCCG ACCCCGAGCG CATCGCCTTC
CTCGACGGCC ACATCAGGGC CCTGTCGCAG GCGATGGCCG CCGGCGTCGA CGTCCGCGGC
TACCTGACCT GGACCCTGCT CGACAACTTC GAGTGGGCCG AGGGCTTCCA CCAGCGCTTC
GGGCTGGTGC ACGTCGACCA CCAGACCCAG ACGCGCACGC CGAAGGACTC CTACTACTGG
CTGCGCGACC GACTCGCCGA GCTCGCTGCG ACGGAAGGCT GA
 
Protein sequence
MTAHAPLPPF PASFHWGTST SAYQIEGAPA EDGKGVSIWD TFVRRPGAVR DGQTGDLACD 
HYHRGAEDTA LMADLGVNAY RFSIAWTRVQ PDGSGPANPA GLAYYEQLVD SLLEKGITPF
PTLFHWDLPQ ALEDRDGWLH RDTAHRFADY AALVADRLAD RVEHWITLNE PFIHLAYGYA
FGIHAPGRAL MTDAIPVAHH QLLAHGMAVK ALRSAGARKV MIANNCTPVW SASDAPDDKT
AAEAYDTLHN HLFNDPILLG TYPDLSAYGA GPDLNGVVRD GDLDVIAAPL DGLGVNYYNP
TRVAAPGPEH GLPFQDLPIE GVPRTAFDWP VVPDGLREVL VGLADRYGDA LPPIYITENG
TSVDDKVVDG RVADPERIAF LDGHIRALSQ AMAAGVDVRG YLTWTLLDNF EWAEGFHQRF
GLVHVDHQTQ TRTPKDSYYW LRDRLAELAA TEG