Gene Caci_6683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6683 
Symbol 
ID8338047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7699889 
End bp7701790 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content69% 
IMG OID644959777 
ProductCellulose 1,4-beta-cellobiosidase 
Protein accessionYP_003117370 
Protein GI256395806 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.918543 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.93944 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACAAA GCTCACCCAG CGCCACCGGC AGAGCCAGAC GCCTCACCAG TGCCTTCGTG 
GCAGCCGGTG TGACGCTTGG GGTGGCCGGC GGGCTCGCGG CGCTGACCAC GACCAGCTCG
AACGCGGCGA CCGCCGCCGG ATGCACCGCG GTGTACTCCA CGACGTGGGA CTCCGGGAGC
GGCTTCGGCG CGCAGGTGGT CATCACCGAC AACGGTCCGG CGTGGACGAA CTGGACGCTC
AGCTACTCCT ACGCCGGCAA CCAGACGCTG CAGAACGGCT GGAACGGCAC CTGGAACCAG
TCCGGCAAGA CGGTCACCGT CACCAACGCG TCCTATAACG GCGCGGTGGC CTCCGGCGGA
ACCGTGACCC CGGCGGGGAA CTTCGGCTAC TCCGGGACCA ACGCGGCGCC GACGTCGTTC
TCGGTCAACG GGATGACGTG CAGCGGGACC ACTCCGCCGC CGACGACGCC GACGACCACG
CCGTCCACGC CGTCCACGAC GCCGACGACC ACGCCGAGCA CGACCCCGTC CACGACCCCG
TCTACGACGC CCTCCACCAC GCCGTCCACC ACGCCCTCGA CGACCCCGTC GACCGGCGGC
GGCGGAGGCG GGCACGTGGC CAACCCGTTC GTGGGCGCCT CGCAGTACCT GAGCCCGGAC
TACGCCGGCG AGGTCAACGC CCAGGCGGCC GCCGACCAGT CGTCGAACCC GGCTCTGGCA
GCCTCCGAGT CGAAGATGGC CGGCTACGCG ACCGCGGTCT GGATGGACCG GATCGCGGCC
ATCACCGGCA CCGGCGACAG CGTGCACCAC GGTCTGCAGT GGCATCTGGA TCAGGCGCTG
AGCCAGCAGA AGGCGGGGAC TCCGATCACC TTCGAGGTCG TCATCTACGA CCTGCCGGGG
CGCGACTGCG CCGCACTGGC CTCCAACGGC GAGATCCCGG CGACCGCCGC CGGCCTGACC
GAGTACGAGT CGCAGTACAT CGACCCGATC TCGGCGATCC TGGCCGACCC GAAGTACTCC
GGCATCCGGA TCGTCGCGAT CGTCGAACCG GACTCGCTGC CGAACGCGGT GACCAACCAG
AGCAAGTCCG CGTGCGCGAC GGCGACGCCG TTCTACGAGT CCGGGGTCGA ATACGCCCTG
AACAAGCTGC ACGCGATCTC CAACGTCTAC AACTACGTGG ACATCGCGCA CTCGGCGTGG
CTGGGCTGGT CCTCCAACAT GGGTCCGGCG GCGCAGGAGT TCGCCAAGGT GGCCCGGGCC
ACGACGGCCG GGTTCGCCAG CGTCGACGGC TTCATCTCCA ACACGGCCAA CTACACCCCG
ACCACCGAGC CGTTCCTGCC GAACTCGACC CTGCAGGTCG GCGGCAACCC GCTGGACTCG
GCGAAGTTCT ACCAGTACAA CCCGTACTTC GACGAGTACG ACTACGACCA GGCGATGTAC
AGCCAACTGG TCGGCCAGGG CTTCTCGGCC AACATCGGGA TGCTCATCGA CACCTCGCGC
AACGGCTGGG GCGGCCCGAA CCGCCCGACC GCGCTGAACT CCTCGCCGAC GACCGTGGAC
ACCTACGTCG CGGCCAACAA GGTGGACCAG CGCTCCTTCC GCGGTGACTG GTGCAACCAG
AACGGCGCGG GGGTCGGCTC GCGGCCGACG GTACAGCCGT ACGGCGCGTC CAACCACATC
ATCGCCTACG TGTGGATCAA GCCTCCGGGG GAGTCCGACG GCGACTACCC GAGCGCCTCG
CACAGCCACG GCGACCCGCA CTGCGACCCG GCCGGGACCA ACACCGACGG CAACGGCGGG
ACCTACTCGA CCGGGTCGAT CCCCGGCTAC GACGTGCCGG CCGGACAGTG GTTCGCCGCT
GAATTCCAGC AGGAGGTGGA GAACGCCTAT CCCGCGATGT AG
 
Protein sequence
MGQSSPSATG RARRLTSAFV AAGVTLGVAG GLAALTTTSS NAATAAGCTA VYSTTWDSGS 
GFGAQVVITD NGPAWTNWTL SYSYAGNQTL QNGWNGTWNQ SGKTVTVTNA SYNGAVASGG
TVTPAGNFGY SGTNAAPTSF SVNGMTCSGT TPPPTTPTTT PSTPSTTPTT TPSTTPSTTP
STTPSTTPST TPSTTPSTGG GGGGHVANPF VGASQYLSPD YAGEVNAQAA ADQSSNPALA
ASESKMAGYA TAVWMDRIAA ITGTGDSVHH GLQWHLDQAL SQQKAGTPIT FEVVIYDLPG
RDCAALASNG EIPATAAGLT EYESQYIDPI SAILADPKYS GIRIVAIVEP DSLPNAVTNQ
SKSACATATP FYESGVEYAL NKLHAISNVY NYVDIAHSAW LGWSSNMGPA AQEFAKVARA
TTAGFASVDG FISNTANYTP TTEPFLPNST LQVGGNPLDS AKFYQYNPYF DEYDYDQAMY
SQLVGQGFSA NIGMLIDTSR NGWGGPNRPT ALNSSPTTVD TYVAANKVDQ RSFRGDWCNQ
NGAGVGSRPT VQPYGASNHI IAYVWIKPPG ESDGDYPSAS HSHGDPHCDP AGTNTDGNGG
TYSTGSIPGY DVPAGQWFAA EFQQEVENAY PAM