Gene Caci_4288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4288 
Symbol 
ID8335642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4862872 
End bp4864134 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content68% 
IMG OID644957391 
Productcellulose-binding family II 
Protein accessionYP_003114993 
Protein GI256393429 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.016596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCGCA ATCGCGCTTT CCTACGCAGC GTCCGCGTTC TCGCGGTCGC GGCTGCGGCT 
GTGCTGGCCA TGTCGGCCAT GTTCGTACTC GGATCCAGAC CCGCCGAGGG CGCCGGGACC
GGCTGCACGG TCGCCTATCA GGTGAATCAG TGGAACACCG GCTTCACCGC CGATATGACG
GTCACCAACG GCGGTCCGGC GGTGAGCGCG TGGACGGCGA CCTGGTCGTG GACCGGGAAC
CAGCAGGTCA CGTCCGGATG GAACGCGCAG GTCAGTCAGA GCGGTCAGCA GGTCACGGCG
ACCAACGAGC CGTACAACGG CTCGGTCGCG GCCGGGGCGA CGGTCGCCTT CGGGTTTCAG
GGCACGTATT CCGGCACGAA CAGCGCGCCT GTGAACTTCG CGTTCAACGG CGTGGCGTGC
AGTGGCCTCG GCACGCCGAG CTCGCCGTCG AGCACGCCTT CCACGCCGCC TTCCAGCTCG
CCCTCTAGCT CACCGTCCAC CTCGCCCTCT ACTTCACCCT CCAGTTCACC GTCCAGCTCG
CCGTCCAGTT CGCCGAGCAG CGGACCGTGC CCGGCGACGG CAGTCTTCTG CGACGGCTTG
GAGAATCAGA GCTCGACCAC ACCGTCCGGC CGCTGGTCGA TCGCCACCCC GAGCTGTTCG
GGGACCGGCA CCGCCGCGAT CACCACCGCG CAGGCGCACG CCGGGCTGAA GTCCCTGGAG
ATCGACGGTC GCGGCACGTA CTGCGACCAC GTCTTCGCCA GCGACACCAC CGACATGGCG
AGCGCCGCGC CGACCTGGTA CGTCCGATTC TGGATCAAGC ACACCGCGCC GCTGCCGACC
AACCACACGA CCTTCCTGGC GTTGAACGAC TCCGCGCACG GCAACACCGA CCTGCGGCTC
GGAGCCCAGA ACGGCGCGCT GATGTGGAAC CGGCAGTCCG ACGACGCCAC GCTGCCGGAC
CAGAGCCCGA ACGGGGTCGC GCAGAGCGTC ACACTGCCGA CCGGAGCCTG GGAATGTCTG
GAGTTCTCCG TGAGTGGCAG CAACGGCCAG ATCCACACCT GGTACAACGG CAGCCCGGTC
GCGGGCCTGA CCGAGGACGG CGTGGCGACG CCGGACACCG ACGACCAGTG GCTGGCTGGG
AGCGGAGCGT CCTGGCGTCC GCAGCTCACC GATCTGAAGC TCGGCTGGGA GAACTACAGC
AACGGCGACG ACACGCTGTG GTTCGACGAC GTGGTGTTGA GCACGAGCCG GATCGGATGT
TGA
 
Protein sequence
MYRNRAFLRS VRVLAVAAAA VLAMSAMFVL GSRPAEGAGT GCTVAYQVNQ WNTGFTADMT 
VTNGGPAVSA WTATWSWTGN QQVTSGWNAQ VSQSGQQVTA TNEPYNGSVA AGATVAFGFQ
GTYSGTNSAP VNFAFNGVAC SGLGTPSSPS STPSTPPSSS PSSSPSTSPS TSPSSSPSSS
PSSSPSSGPC PATAVFCDGL ENQSSTTPSG RWSIATPSCS GTGTAAITTA QAHAGLKSLE
IDGRGTYCDH VFASDTTDMA SAAPTWYVRF WIKHTAPLPT NHTTFLALND SAHGNTDLRL
GAQNGALMWN RQSDDATLPD QSPNGVAQSV TLPTGAWECL EFSVSGSNGQ IHTWYNGSPV
AGLTEDGVAT PDTDDQWLAG SGASWRPQLT DLKLGWENYS NGDDTLWFDD VVLSTSRIGC