Gene Caci_4940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4940 
Symbol 
ID8336294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5638589 
End bp5641024 
Gene Length2436 bp 
Protein Length811 aa 
Translation table11 
GC content70% 
IMG OID644958039 
ProductBeta-glucosidase 
Protein accessionYP_003115641 
Protein GI256394077 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0446999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGACG TTCCCCATGC CGCCGTGAAG GCCGCCGCCT TACTGGCGAA ACTGTCCGCG 
GCTGAGAAGA TCGCGTTCCT GCACCAGCAG CAGCCCGCGG TCGAGCGCCT CGGTCTGGCC
GCGTTCCACA CCGGCTGCGA AGCCCTGCAC GGCGTGGCCT GGATCGGACG CGCCACCGTG
TTCCCCCAGG CGGTCGGGCT CGGCGCCACC TGGGACCGGG ACCTGCTGCG GCGGGTGGGA
GAAGCGGTGT CGACCGAGGT GCGGGCGTTC CGTCAGGACA CGGAGAACGC CTTCCGGGCC
AACGACGTGG CCGACAAGCA CCCGATGGTC TCGCTCAACG TCTGGGCGCC GGTGGTGAAC
CTGCTGCGCG ACCCGCGCTG GGGACGCAAC GAGGAGGGCT ACAGCGAGGA CCCGCACGCC
ACCGCCGAAA TGGCCACCGC GTACTGCCGC GGGCTGCGCG GCGACGACCC GGCCGTCTGG
CGCACGGCTC CGGTGCTCAA GCACTTCCTG GCGTACAACG TCGAGACCGA ACGCGACATC
ATCGACATCA AAGTCCCGCC GCGCGTCCTG CACGAATACG AGCTGCCGGC CTTCCGCGGT
CCGATCCTGG CCGGAGTGGC CGCCGGGGTG ATGCCGGGCT ACAACCTCAC CAACGGCGTC
CCCAACCATG TCCACTCGCT GCTCAAGGAC GCGCTGCGAG CGTGGAATCC GGAACTGGTC
GTCTGCTCCG ACGCCCAGGC TCCGTCGAAC CTGGTCGACC GCGAGAAGTA CTTCGCCACG
CATGAGGAGT CACATGCGGC AGCCCTCAAG GCCGGCGTGG ACAGTTTCAC CGACGGCGGT
CCGGACTCGC GGCTGACCGT CGAGCGTTTC ACCGGTGCGC TGTGGCAAGG GCTGATCACC
GAGGCCGACA TCGACGCGGC GGTCGGACGG GTGCTGGCGA TGCGCGCCGC GACCGGCGAG
TTCGACCCCG CCGTCGACCC CTACGCCGGC ATCCGCGCCG ACGTCATCAG CTGCCGGGCG
CACAACGACC TGGCGCTGGA AGCCGCCCGC GCGGCGATCG TCCTGCTCAA GAACGAGAAC
GAGGCGCTGC CGCTGGTCGT GCCCGAGTCC GGCGCGAGCG AGGAGGGTCT GGCCGTCGCG
GTGATCGGAC ACCTGGGAAG CAGGGTACTG ACGGACTGGT ACAGCGGCGA ACTGCCCTAT
GCCGTCAGCA TCGCCGACGG CGTGCGCACC GCCTTCGGCG ACCGGGCGGT GACCGCGGTG
GACGGCGCGG ACGTGGTCAA CCTGCGGGCC GGCGAGGCGG AGTTCGGGCC GTTCGCCCGC
CAGGACTGGG GCACGAGCGT CCAGTGCCCC GTCCCTGTGC ACACGCTCCA AGCAATCGAG
AACGGCAGGT ATCTGACACT CGCCGGCGAC GGCGACACCG ATGTGCTCGC CGACGCCGCG
ACACCGGACG GCTGGGTGGT CAAGGAGCTG TGGGAGTTCC ACCAGACCGA CGACGGGCAG
CGGCTCGTAC GCTCCAACGC CACGGGCCGC TACCTGCGCG TCGGGGACGA CGGCCGCCTG
GTCGCGGACG CCGACTCCGC CGAGGAGGCC ACGGCTTTCG TGATCGAGAC GGTGACCTCC
GGCATCCGCC AGGCGGTGGC GGCGGCTGCC TCGGCACAGC GCGCGGTCGT GGTGCTGGGG
AACGACCCGC ACATCAACGG ACGCGAGACC ATCGACCGCG ACGGGCTCGC GCTTCCGCCG
GACCAGGAGG CGCTGCTGCG CGCGGTGCTG CAGGCGAACC CGGACACGAC GCTCGTCCTG
GTATCGAGCT ATCCCTACGC GATCGGCTGG GCCGCCGAGC ATGTGCCCTC GATCCTGTGG
ACCGCACACG GCGGGCAGGA GGCGGGCAAC GCCGTCGCCG AGGTGCTGAC CGGAGCGTTC
AACCCGGCGG GACGGCTGCC GCAGACGTGG TACGCCCCCG ACGCTGATCT GCCCGCGCCG
GACGACTACG ACATCATCGG ATCGGGCTGG ACATATCAGT ATTCGCAGCG GGAGCACCTG
TACGCGTTCG GCCATGGCTT GTCGTACACG GATTTCGAGT ACGGAGAGCT CGTCCTGACG
GTCCGGGAAG ATCAGCGGGG ATCGTTCACG ATAGCGGCCG AGACGCTGAT CACCAACACC
GGAGCCGTCG CAGGTCAGGA GGTCGTGCAG TGCTATTCGC AGGCCCTTGA CGCCTCGGTA
CCCAGTCCCC TACACCGCCT GCAAGGCTTC GAACGGATCG AGCTGGCTCC TGGCGAATCA
CGCACCATCG CCTTCGAGGT ACCCGCGGAC CGGCTGTCGC ATTGGTCCGA GGAACTCGGC
GCTTTCCGTC TGGAGAGCGG GGACTACGAG TTCACCGTCG GCCGGTCCAG CGCCGATCTG
CCGTCTCGGG CGACGGTGAG ACTGGGTGCT CCTTGA
 
Protein sequence
MTDVPHAAVK AAALLAKLSA AEKIAFLHQQ QPAVERLGLA AFHTGCEALH GVAWIGRATV 
FPQAVGLGAT WDRDLLRRVG EAVSTEVRAF RQDTENAFRA NDVADKHPMV SLNVWAPVVN
LLRDPRWGRN EEGYSEDPHA TAEMATAYCR GLRGDDPAVW RTAPVLKHFL AYNVETERDI
IDIKVPPRVL HEYELPAFRG PILAGVAAGV MPGYNLTNGV PNHVHSLLKD ALRAWNPELV
VCSDAQAPSN LVDREKYFAT HEESHAAALK AGVDSFTDGG PDSRLTVERF TGALWQGLIT
EADIDAAVGR VLAMRAATGE FDPAVDPYAG IRADVISCRA HNDLALEAAR AAIVLLKNEN
EALPLVVPES GASEEGLAVA VIGHLGSRVL TDWYSGELPY AVSIADGVRT AFGDRAVTAV
DGADVVNLRA GEAEFGPFAR QDWGTSVQCP VPVHTLQAIE NGRYLTLAGD GDTDVLADAA
TPDGWVVKEL WEFHQTDDGQ RLVRSNATGR YLRVGDDGRL VADADSAEEA TAFVIETVTS
GIRQAVAAAA SAQRAVVVLG NDPHINGRET IDRDGLALPP DQEALLRAVL QANPDTTLVL
VSSYPYAIGW AAEHVPSILW TAHGGQEAGN AVAEVLTGAF NPAGRLPQTW YAPDADLPAP
DDYDIIGSGW TYQYSQREHL YAFGHGLSYT DFEYGELVLT VREDQRGSFT IAAETLITNT
GAVAGQEVVQ CYSQALDASV PSPLHRLQGF ERIELAPGES RTIAFEVPAD RLSHWSEELG
AFRLESGDYE FTVGRSSADL PSRATVRLGA P