Gene Caci_6974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6974 
Symbol 
ID8338340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8066673 
End bp8069489 
Gene Length2817 bp 
Protein Length938 aa 
Translation table11 
GC content68% 
IMG OID644960054 
Productcellulose-binding family II 
Protein accessionYP_003117645 
Protein GI256396081 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCT CACGGCGAGA TCTGCTCAAG GTCGGCGGAA CGGTGGTGGT CGGCGGGTCT 
TTGCTGTCGG CCGGTTCCGG TTCGGCGGCC GCTGCGTCGC GTGGCAGGGG CGCCCTGGTG
GGCTCGACGG CTGTGGCTGT GTCCGCGGTG GGGACGGACT CAGTCGATCG TGACAGCGTG
TTCACCCGGG GCGGTTCCGG GCCGCTGTAC TGGTCCACGT ACGGCTACGA CTACCCGAAC
AACAACGCTC AGTCGCAGGC CAGCTGGCAG GCGAACGTCA CCTGGGTGGC GAAGAACCTC
AAGCCCTACG GCATCGACAT GGCCTGTACC GACGGCTGGG TCGACTACAC CCAGGCGACC
AACTCCAACG GCTACATCCT GAACTACCAG GACAGCTGGG CCATGGGCTG GGCCGGGATG
TCGTCGTACC TGTCCGGGCT CGGGCTGAAG ATGGGGGTCT ACTACAACCC CCTGTGGGTG
ACCAAGAGTG CGTACAACGA CCCGTCCAAG ACGGTGGTCG GGCGTCCGGA CATCCCGATC
TCGTCGATCG TCACCGCGGG GGACTTCTTC GGCGGGAACA AGGGCACCCA GGAGATCTAC
TGGGTGGACG TGACCAAGGA CGGCGCCAAG GAGTTCGTCC AGGGCTACGT CAACTACTTC
AAGCAGCTGG GCGCGGTCTA CCTGCGCGGC GACTTCTGGG CGTGGTACGA GACCGGCTAC
GACCAGAACG AGGGCACGGT CGGCGTGGCG CACGGCAGCG CCAACTACGC CACCGCCCTG
GGCTGGATCA GCGAGGCCGC CGGGGACGGG CTGGAAGTCA GTGTCGTGAT GCCCAACTTG
CTCAACCACG GCCAGAACGA GCGGCGCTAC GGCGACCTGA TCCGCATCGA CGACGACTGC
GGCAGCGGCG GCTGGAACTT CCTGGACGGC GGCCGGTCGA GCTGGCAGAA CTACTGGACC
CAGTGGCACA CCCCGTTCCT GGGCTTCACC GGCTTCTCCG ATATCTCCGG GCGCGGTCAG
ATGATCCTGG ACGGCGACGT CCTGGAGATG AACAGCTTCG GCAGCGATGA CGAGCGCCGC
ACGGCACTGA CACTGTTCGC CATGGCCGGC TCGCCGCTGA TCGTCGGCGA CCGGTCCGAC
AACATCGGCT CCTACCTCAG CTTCTGGCAG AACAACGACA TCCTGAACAT CAACAAGGCC
GGGTTCGTCG GCAAGCCCTA CTACCACAAC GCCAATCCGT TCTCGTCGGA TCCCACGAGC
CGGGACCCGG AGACGTGGAC GGGCCAGCTG CCCGACGGCA CCTGGCTGGT CGCGTTGTTC
AACACCACCT ACTCCAGCGT CACCAAGTCG ATCGACTTCG CCGGCGCTCT GGGGCTCGCG
GCCGGCGGCA CGGTCCACGA CGTGTGGAAC AACACCAACC TGGGCCAGAT GACGTCGTAT
TCGGCGTCCC TGCCGATGCA CGGGGTGTCC CTGATCAAGA TCACGCCGGC CGGGTCCGGG
GCGCCGGTCT ACCAGTCTCA GGTGGCTGCC TGGGGCGGCG GGGCGATGTT CGACAACGCC
GCGTCCGGCT TCAGCGGCAA CGGCTACGTG GACGGACTGG GTAGCGTCGG CGCCCGGGTC
GTCTTCGGCG TCACCGGTGC GGGCGGCACG ACTCCGGTCA CCATCCGATA CGCCAACTCC
GGCAGCGCCG CGAGCCTGAC CATATCGGCC AAGAACGTGG CGGGCACGGT CTCGGGCAGC
ACCTCGGTCA GCCTCCCGGG CACCGGCGGC GCCGGTACCT GGAGCACGGT CACCGTCAAC
CTCGCGCTGG CCGCGGGGAC GAACCTGATC ACCCTGGAGC GCACCTCCAC CGATTCCGGA
TCAGTGAATC TGGACTCGAT CCAAGTCGGC GCCTCCTCAG GAGGCTCCAC CCCGCCCGGC
GCCCCCGGCA CACCGGCGGC TTCGGCCATC ACCTCCAACG CGGTGACCCT GACGTGGAGC
GCGGCAGCCG CGGGCAGCAA CCCGATCGCC GGATACCAGG TCTACCAGGT CGGATCACCC
GACACCGTGG TCGCTTCCAC CGCCGCCGGA ACCCTCACCG CGACCATCAG CGGCCTGACT
GCGGCGACGA GCTACAGCTT CTACGTCAAG GCCAAGGACA GCGCGGGAAC CGTCGGCGCC
GCATCCGGAA CGACGGCAGT CACCACTGCC GGCTCCGGCG GCTCGACCCC GCCCGGCGCA
CCGGGCACAC CGGCAGCGTC CACCATCACC GCGACCGCGG TGACCCTGAC GTGGAGCGCG
GCAGCCGCTG GCAGCAACGC GATCGCCGGA TACCAGGTCT ACGAGGTCGG ATCGCCCGAC
ACCGTGGTCG CTTCCACCGC TGCCGGAACC CTCACCGCGA CCATCAGTGG CCTGATGAGC
GCCACGCAGT ACGGCTTCTA CGTCAAGGCC AAGGACAGCG CCGGAGCTCT CGGCGCCGCT
TCAGCCACGA AAACCGTGAC GACGGCTGCG GTTTCGGCTG GAGCCGCGGT CTCCTACGCC
GTCCAAAGCG ACTGGGGCTC GGGCTTCAGC GCCTTGGTGA CGATCACCAA CACCGGTACC
AGCGCGATCA ACAACTGGAC CCTCGGATTC ACCTTCGCGG GCAACCAGCA CGTCACCAAC
GGCTGGAACG CCACCTGGTC CCAGAGCGGC GCGAACGTCA CCGCCTCCAG CGAGTCCTTC
AACGGCGCGA TCGCCCCGGG CGCCTCGGTC CAGATCGGCT TCACCGGTAC CTACAGCGGT
GCCAACGCCA AGCCGACCGC CTTCACGATC AACGGGCAGC CCGCCACCAC GCAGTGA
 
Protein sequence
MAISRRDLLK VGGTVVVGGS LLSAGSGSAA AASRGRGALV GSTAVAVSAV GTDSVDRDSV 
FTRGGSGPLY WSTYGYDYPN NNAQSQASWQ ANVTWVAKNL KPYGIDMACT DGWVDYTQAT
NSNGYILNYQ DSWAMGWAGM SSYLSGLGLK MGVYYNPLWV TKSAYNDPSK TVVGRPDIPI
SSIVTAGDFF GGNKGTQEIY WVDVTKDGAK EFVQGYVNYF KQLGAVYLRG DFWAWYETGY
DQNEGTVGVA HGSANYATAL GWISEAAGDG LEVSVVMPNL LNHGQNERRY GDLIRIDDDC
GSGGWNFLDG GRSSWQNYWT QWHTPFLGFT GFSDISGRGQ MILDGDVLEM NSFGSDDERR
TALTLFAMAG SPLIVGDRSD NIGSYLSFWQ NNDILNINKA GFVGKPYYHN ANPFSSDPTS
RDPETWTGQL PDGTWLVALF NTTYSSVTKS IDFAGALGLA AGGTVHDVWN NTNLGQMTSY
SASLPMHGVS LIKITPAGSG APVYQSQVAA WGGGAMFDNA ASGFSGNGYV DGLGSVGARV
VFGVTGAGGT TPVTIRYANS GSAASLTISA KNVAGTVSGS TSVSLPGTGG AGTWSTVTVN
LALAAGTNLI TLERTSTDSG SVNLDSIQVG ASSGGSTPPG APGTPAASAI TSNAVTLTWS
AAAAGSNPIA GYQVYQVGSP DTVVASTAAG TLTATISGLT AATSYSFYVK AKDSAGTVGA
ASGTTAVTTA GSGGSTPPGA PGTPAASTIT ATAVTLTWSA AAAGSNAIAG YQVYEVGSPD
TVVASTAAGT LTATISGLMS ATQYGFYVKA KDSAGALGAA SATKTVTTAA VSAGAAVSYA
VQSDWGSGFS ALVTITNTGT SAINNWTLGF TFAGNQHVTN GWNATWSQSG ANVTASSESF
NGAIAPGASV QIGFTGTYSG ANAKPTAFTI NGQPATTQ