Gene Caci_3604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3604 
Symbol 
ID8334957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4025343 
End bp4028264 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content69% 
IMG OID644956746 
Productglycoside hydrolase family 48 
Protein accessionYP_003114349 
Protein GI256392785 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAAA AGTTGTCGCG CAGACAATTC GCGACCGCGG CCGGCGGAGC CGTCCTGGCG 
TCGGCCGTCG CACCGTCCAT GTCGCGGGCG GCGAGTGTCG CCCCGGCCGC GGCCTCGGCC
GCCACGGATG CCTACACCCA GCAGTTCCTG ACCCAGTACA AGAAGATCAA GGACCCGGCG
AACGGCTACT TCAGCGCGCA GGGGATCCCG TACCACAGTG TGGAGACGCT GATCGTCGAG
GCGCCGGACT ACGGGCACCA GACGACCTCG GAGGCGTTCA GCTTCTGGAT GTGGCTGGAG
GCGACGTACG GCCGGGTGAC CGGTGACTGG ACGGCGTTCA ACAACGCCTG GACGACGGCC
GAGCACTACA TCATCCCGCA GCACGTCGAT CAGCCGAGCA ACAGCTCCTA CAACCCCAAC
TCGCCGGCGA CCTACGCCCC GGAGTGGCCG GACCCCAGCA GCTACCCCAG CCCGCTCAAC
ACCTCGGTGT CGGTCGGCCA GGACCCGCTG GCCAACGAGC TGACCTCGAC GTACGGCACG
TCGGACATCT ACGGCATGCA CTGGCTGATG GACGTCGACA ACAAGTACGG CTACGGGAAC
ACGCCCGGTA CCGGCGGGGA GGCCGGTCCC AGCGCGACCG GGCCGTCGTA CATCAACAGC
TACCAGCGCG GCGCGGCGGA GTCGGTGTGG AAGACGATTC CGCAGCCGAC CACGGACCTG
TTCAACTACG GCGGTCCGAA CGGATACCTC GACCTGTTCG TGGCGCAGTC CGGCTCCTAC
TCCAAGCAGT GGAAGTACAC GACTGCCCCC GACGCCGACG CCCGCGCCAT CCAGGCCGCG
TACTGGGCCT ACCGCTGGGC CTCGGCGCAG GGCGCGCAGG GCCAGATCGC CGCCTCGGTC
GCGAAGGCCG CGAAGATGGG CGACTTCCTG CGCTACTCGC TGTTCGACAA GTACTTCAAG
CAGATCGGGA ACTGCACGAA CGCCAGCTCC TGCGCGGCCG GCACCGGCCG TGGCTCCGAG
CACTACCTGC TGGCCTGGTA CTACGCCTGG GGCGGGGCGG AGCCCGGCGG CGGCTGGGCC
TGGCGGATCG GCGACGGCGC CGCGCACCAG GGGTATCAGA ACCCGCTGGC GGTGTGGGCC
ATGACGAACA TCGCCGCGCT GACCCCCATG TCTCCGACCG CCAAGAGCGA CTGGACCGCC
AGCCTGACCC GGCAGATGGA GTTCTACCAG TGGCTGCAGT CCGCCGAGGG CGCCATCGCC
GGCGGCTGTA CGAACAGCTG GAACGGCTCG TACAGCGTGC CGCCGTCGGG TGACTCGACG
TTCTACGGCA TGGCCTACGA CTGGGAGCCG GTCTACCACG ACCCGCCGAG CAACAACTGG
TTCGGCATGC AGGCCTGGTC GATGGAGCGG CTCGCGGAGT TCTACTACGT CACCGGCAAC
GCCACCGCCA AGACGATCCT GGGCAAGTGG ATCACCTGGG CTTCCTCGAA GACCACGGTC
ACCGCCACCA ACTTCCAGAT CCCCTCCACG CTCGGCTGGA CCGGACAGCC GGACACCTGG
AACCCGACGA GTCCGGGCGG CAACTCCGGG CTGCACGTGA CGGTCGCCGA CTACGGCAAC
GACGTCGGCG TCGCGGCGGC GTACGTCAAG ACCCTGACGT ACTACTCCGC CAAGTCCGGC
GACACCGCCT CCGGCGCCCT CGCCAAGAGC CTGCTCGACG CGATGGCGAC CTTCGCCGAC
ACCGCCGGCA TCGCCACGCC CGAGACGCGC ACCGACTACA GCCAGTTCAA CGACACGGTG
TACGTGCCCT CCGGCTGGTC CGGCAAGATG CCCAACGGCG ACCCGATCGC CCCCGGCGCC
ACCTTCTTGT CGATCCGCTC GTGGTACAAG AACGACCCGG CCTGGCCGAA GGTGCAGGCC
TACCTCAACG GCGGATCCGC CCCGACGTTC ACCTACCACC GCTTCTGGGC CCAGGCGGAC
ATCGCCATGG CCTACGCGGT GTACGGCGAG CTGATCGCCG GCGGCGGCGG TACCGGCGGC
GACACGACGC CGCCGAGCGT GCCGACCGGT CTGACCGTCA CGGGGACCAC CAGCAGCACC
GCCTCGCTGT CGTGGACGGC TTCGACCGAC AACATCGGGG TGGCCGGCTA CACCGTGTAC
CGGGGCACCA CCGTGGCCGG TTCCGCGACC ACGCCGACGT TCACCGACTC CGGACTGGCC
GCCTCGACGC AGTACAGCTA CACGGTCACG GCCCACGACG CCGCCGGCAA CGTCTCCGCC
GCCTCCGCCG CCGTCAAGGC CACCACCACC GCCGGGACCG GCGGCGGCGA CACGACGCCG
CCGAGCGTGC CGACCAACCT GGCGGTCACC GCCACGACCA GCAGCAGCGT CTCGCTGTCG
TGGACGGCCT CGACCGACAA CGTCGCGGTG ACCGGCTACA CCGTGTACCG CGGCACCACG
GTGGCGGGCA CCACGACCTC GCCGACCTTC ACCGACTCCG GACTGACCGC CTCGACGCAG
TACAGCTACA CGGTCACGGC CCACGACGCC GCCGGCAACG TCTCGGCAGC CTCCGCCGCC
GTCAAGGGCA CCACCTCCGG AACCGGCGGG GGCACCGGCC CGACCTGCAC CGCTACCTAC
AGCGTCACCA GCGACTGGGG CAACGGCTTC AACGGCAACG TCACCATCAC CAACACCGGG
ACGACCGCGA CCAAGTCCTG GAAGGTCACC TGGACCTGGG GAGGCAACCA GACCATCACC
AACACCTGGA ACGCCACCGA AACCCAGTCC GGCAAGGCCG TGACCGCGAC CAACGCCCCC
TACAACAACG TCATCGCCCC CGGCGCCAGC ACCAGCTTCG GCTTCAACGC CAGCTACTCC
GGCACCAACG GCGCGCCGAC GGTCACCGTC ACCGCTACGT GA
 
Protein sequence
MTKKLSRRQF ATAAGGAVLA SAVAPSMSRA ASVAPAAASA ATDAYTQQFL TQYKKIKDPA 
NGYFSAQGIP YHSVETLIVE APDYGHQTTS EAFSFWMWLE ATYGRVTGDW TAFNNAWTTA
EHYIIPQHVD QPSNSSYNPN SPATYAPEWP DPSSYPSPLN TSVSVGQDPL ANELTSTYGT
SDIYGMHWLM DVDNKYGYGN TPGTGGEAGP SATGPSYINS YQRGAAESVW KTIPQPTTDL
FNYGGPNGYL DLFVAQSGSY SKQWKYTTAP DADARAIQAA YWAYRWASAQ GAQGQIAASV
AKAAKMGDFL RYSLFDKYFK QIGNCTNASS CAAGTGRGSE HYLLAWYYAW GGAEPGGGWA
WRIGDGAAHQ GYQNPLAVWA MTNIAALTPM SPTAKSDWTA SLTRQMEFYQ WLQSAEGAIA
GGCTNSWNGS YSVPPSGDST FYGMAYDWEP VYHDPPSNNW FGMQAWSMER LAEFYYVTGN
ATAKTILGKW ITWASSKTTV TATNFQIPST LGWTGQPDTW NPTSPGGNSG LHVTVADYGN
DVGVAAAYVK TLTYYSAKSG DTASGALAKS LLDAMATFAD TAGIATPETR TDYSQFNDTV
YVPSGWSGKM PNGDPIAPGA TFLSIRSWYK NDPAWPKVQA YLNGGSAPTF TYHRFWAQAD
IAMAYAVYGE LIAGGGGTGG DTTPPSVPTG LTVTGTTSST ASLSWTASTD NIGVAGYTVY
RGTTVAGSAT TPTFTDSGLA ASTQYSYTVT AHDAAGNVSA ASAAVKATTT AGTGGGDTTP
PSVPTNLAVT ATTSSSVSLS WTASTDNVAV TGYTVYRGTT VAGTTTSPTF TDSGLTASTQ
YSYTVTAHDA AGNVSAASAA VKGTTSGTGG GTGPTCTATY SVTSDWGNGF NGNVTITNTG
TTATKSWKVT WTWGGNQTIT NTWNATETQS GKAVTATNAP YNNVIAPGAS TSFGFNASYS
GTNGAPTVTV TAT