Gene Caci_3602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3602 
Symbol 
ID8334955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4021849 
End bp4023153 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content71% 
IMG OID644956744 
Productglycoside hydrolase family 6 
Protein accessionYP_003114347 
Protein GI256392783 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0359346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCCA GAACCGCGAG ACTCCTGCTC GCCGCAGGTC TTTCCGCATC GGCCATCGCC 
TTGGGTCCGG TCGCCACGGC GGCCACCGCG ACCACGGCGC ACGCCCCGAG CGCCGGGCAC
ACGCTGCCGG CCGACACGCG CTTCGCCGTC ACGCCCGACA ACGAGGCCCA GCGCCAAGCA
CTGACCGATC TGCAGCACCA CGACCTCGCC GGCGCCGCGG CCATGGCGAA GCTGGCCAGC
TGGCCCGAGG CGACCTGGTT CACCAGCGGC ACGCCGGCTC AGGTGCGCGA CCAGGTCCGC
GCGACCGTGC GGACCGCCGC GGCCGAGCGT GCCGTTCCGG TGCTGGTCGC CTACGACATC
CCGCTGCGGG ACTGCAGTCA GTACTCCGCC GGCGGCGCGG CGTCCGATGC CGCCTACCAG
CAGTGGATAT CAGCGTTCGC ACAAGGGGTC GGCTCGAGCC GGGCCGTGGT GATCGTCGAG
CCGGACGCGC TGGCGAACCT GCCCTCGGAT TGCAATGCCA CCACCGACCC GACCGGGACG
CTGACCGCCG GGCGCATCGC CGACATCAAG TACGCGGTGT CCGCCCTCGA AGCCCAGCCG
CAGACGGTCG TCTACCTCGA CGCCGGAAAC AGCCAGTGGC ACTCTGTCGG CGATATGGCG
CAGCGCCTGA TCCAGGCAGG CGTCGCTCAG TCCCAGGGCT TCTTCCTCAA CGTGTCCAAC
TACCAGCCGA CCGACCAGAC CGACCAGTAC GGCACCTGGA TCTCCAAGTG CCTGTGGTTC
GCCACCGACG GTCCGGCATG GGCAGCCGGA CACACCGACT ACTGCGCCAG CCAGTACTAC
TCCTCGGCGG CGCCGAACGA CGGAGCGCCC GGCGACGCGG TGTCCCCGAC CGATGCGAGC
ACCTGGCACT GGACGGACGC CTGGTTCGAC CAGAACGTCG GCACTCCCCC GCCCGCGCAG
CTGACCCACT TCGTCGTGGA CACCAGCCGC AACGGTAAGG GCGCATGGAC CCCGGCGCCC
GGCAAGTACA CCGGCGACCC CCAGACCTGG TGCAACCCTC CGGGTCGCGG CATCGGCGCC
ACGCCGACCG CCGCCACCGG CGTCCCGCTC GTCGACGCCG ACCTGTTCAT CAAGACGATC
GGCGAGTCCG ACGGCAGCTG CACGCGCAGC ACCGCGGGTC CCGGCGACCC CGAATACGGC
GGCACGGTGG ACCCGGCGGC CGGCGCGTGG TGGCCGGCCC AGGCACTCGG CCTCGTCCAG
GACGCCGTCC CGACGCTGAC CTTCAATCCG CGTCTGCTTC CCTGA
 
Protein sequence
MLSRTARLLL AAGLSASAIA LGPVATAATA TTAHAPSAGH TLPADTRFAV TPDNEAQRQA 
LTDLQHHDLA GAAAMAKLAS WPEATWFTSG TPAQVRDQVR ATVRTAAAER AVPVLVAYDI
PLRDCSQYSA GGAASDAAYQ QWISAFAQGV GSSRAVVIVE PDALANLPSD CNATTDPTGT
LTAGRIADIK YAVSALEAQP QTVVYLDAGN SQWHSVGDMA QRLIQAGVAQ SQGFFLNVSN
YQPTDQTDQY GTWISKCLWF ATDGPAWAAG HTDYCASQYY SSAAPNDGAP GDAVSPTDAS
TWHWTDAWFD QNVGTPPPAQ LTHFVVDTSR NGKGAWTPAP GKYTGDPQTW CNPPGRGIGA
TPTAATGVPL VDADLFIKTI GESDGSCTRS TAGPGDPEYG GTVDPAAGAW WPAQALGLVQ
DAVPTLTFNP RLLP