Gene Caci_4285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4285 
Symbol 
ID8335639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4859078 
End bp4860496 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content69% 
IMG OID644957388 
Productcellulose-binding family II 
Protein accessionYP_003114990 
Protein GI256393426 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.125188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCAGCC CCAGACCGGC ACCACCGTCC CGCCGCCGCC GCGCCGTCGT GCGCACCGCC 
GTGGCCGCCG CCGCCGCGCT CGTCGTCGCC CTGACCGCCG CGGTGCTCAC CAGCCCGGCG
CAGGCGCTCG GCGTGGGCGG CACGTCGTCC CCGGTGACCG GGAACGCGAC CCACTTCGAC
GGGCTCGGGG CGCCCTACGG CGGCTGCGGC GTGCCGCAGG CGAACCTGGA CTCGCAGGAC
TTCATCGCCC TCAACGTCTT CAACACCCCG GGGAACTACA ACCAGTTCAC GCGTCCGGTG
CCGCCGGCAC AGGCGAGCAT CCTGGGCATG TTCGACAACG GTCTGAACTG CGGCCGCTGG
GTGAAGGTCA GCATCGGCGA CCTGTGCACC GGCACCAACG ACGGCGCGCC GAACCAGCCG
TTCTGCCGCA ACGGCTCGTG GGTCGCCGAC AAGTACAACG GCGCGACGCT GAACATGCTG
GTCGCCGACA GCTGCGCGGA CTCCAACGCC TGGTGCCGTG ACGATCCGTA CCACATCGAC
CTGCACACCG ACTCGATCAA CCGCTTCCAG CTCAACGGCT CCGCGGTCGG CGACCTGCTG
AACCACTGGA ACAACCGCCA GGTGAGCTGG CAGTTCATCA GCGCTCCGGG CTACAGCGGC
GACATCAACA TCGGTTTCAT GCAGGGCGCG CAGGTGTACT GGCCGGCGAT CTCGGTCTCG
CACCTGGCGA ACGGGATCCA CGGCGTGCAG TACCTGTCGG CGTCCGGGAC CTGGGTCTCG
GCGGCGATGG ACAGCGACAT GGGGCAGTCG TACATCATCG CGCCGACCGC GACCGCCGGC
TCCAGCTACC AGATCCGGGT GACCGACGCC TCGGACAATC TGATCAACGG CGGTCAGGTC
TACAGCTTCT CGCTGCCGTC CTCCTGCGGG GGCAGCTGCA GCGCGGCCTA CACACAGGTC
CCGTACACGA CGACGCCGGG TTCGGGACCG AGCTCGCCGA GTTCGAGTCC CAGCACGACG
CCGAGTACCA GCGCCTCGAG TCCGTCCACT CCGCCGAGTT CGCCGAGCTC ATCGGGACCG
TCGACCCCGC CGAGCTCGCC CTCCAGCTCC GCACCCGCCT CCGGCTGCTC GGTCACCTCC
TCGGTGACAG GTTCCTGGTC CAGCGGCTAC CAACTCGCGT TCACGGTCAC CAATACCGGC
AAGGTCGCCT CCTCGCAGTG GGCGGTGCGT TTCTCCTTCG CCGGAAGCCA GACGATCGCC
AACTCCTGGA ACGTGACCGC CACGCAATCC GGGCAGGCGG TGACCGCGAA CTCCGTGTCG
TACAACGGGT CCCTGGCACC GGGTGCGGCG ACGTCGTGGG GCATGGTGGT CAACGGCGCG
AACCAACCGC TCGGCGGCAT TTCCTGTGTC GCGAGCTGA
 
Protein sequence
MRSPRPAPPS RRRRAVVRTA VAAAAALVVA LTAAVLTSPA QALGVGGTSS PVTGNATHFD 
GLGAPYGGCG VPQANLDSQD FIALNVFNTP GNYNQFTRPV PPAQASILGM FDNGLNCGRW
VKVSIGDLCT GTNDGAPNQP FCRNGSWVAD KYNGATLNML VADSCADSNA WCRDDPYHID
LHTDSINRFQ LNGSAVGDLL NHWNNRQVSW QFISAPGYSG DINIGFMQGA QVYWPAISVS
HLANGIHGVQ YLSASGTWVS AAMDSDMGQS YIIAPTATAG SSYQIRVTDA SDNLINGGQV
YSFSLPSSCG GSCSAAYTQV PYTTTPGSGP SSPSSSPSTT PSTSASSPST PPSSPSSSGP
STPPSSPSSS APASGCSVTS SVTGSWSSGY QLAFTVTNTG KVASSQWAVR FSFAGSQTIA
NSWNVTATQS GQAVTANSVS YNGSLAPGAA TSWGMVVNGA NQPLGGISCV AS