Gene Caci_6973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6973 
Symbol 
ID8338339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8063552 
End bp8066575 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content67% 
IMG OID644960053 
Productcellulose-binding family II 
Protein accessionYP_003117644 
Protein GI256396080 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATCA GGACACGACG AAGAATCCCG GCGCGCGATC ACCGGCCGAG AGCGGTTCGA 
CCCAGCCGCC ACCGGGCCGC GTTGGCCGTC GCGGCGCTGG CTGCCAGCGT GATCGGGTGG
ATGCCAGGCG TCAGTTCGGC CGCACCGGCC GCAGCCTCGG GCACCTCGGG CGACCCGGGG
GTCGCTCGCA CCCAGCAGGT ACTCGCCTCG ATCACGACCG ACAAGTCGCG GTACGCACCC
GGCGACACCG TCTCGCTGAC CGTCAACGCA GCCAACAAGA CCGGCAGTGC CATCAGCGGC
GGCGCCGTCA CCTTGTACTT CATGACGATG CAGAACGCCG CCAGCGCTTC GCAGGCCCAG
ACTCTGAACC TCGGCTCTGG CGCGTCCTCG ACACTGGCCT TCCACTGGAC CGCACCGTCG
ACCGACTACA CCGGCTACAT GGTCAGCGCG GTCGCCACCG ACTCCTCCGG CAAGGCGCTG
GACTCGATCA ACGTCGCGGT CGACGTCTCC TCGGACTGGA GCCGGTTCCC GCGCTACGGC
TACATGACCA ACAACTCCTT CGGGAACCAA GGCCTGTCGA CGTCCCAGGC CGCCTCGATC
ATGAGCTCGA TGTCGAACTA CCACATCGAC GGCCTGCAGT TCTACGACTG GCAGTACAAC
CACGACCAGC CGCTGTGCGG CACGGTCAGC GCGCCGTGTT CCTCGTGGAC CGACGACGGC
AACCAGAAGA CGGTGTACGC CTCGGCGGTG AAGGATCTGG TCACCGCCGC GCACAACAGC
AACATCGTCG CGATGCCGTA CAACGCCATC TTCTCGGCGG ACAACGGCTC GTGCTGCGGT
GCGCCGGACT ACCACACGCA AGGGCTCGGA GTCAGTCCGT CGTGGGGCGT CTACCAGGAC
ACCAACCACA GCAAGCCGTT GGCGTTCTTC CAGTGGGACT ACATGGATCC CAGCAACCCC
GGCTGGCAGC AGTACCTGAT GGGCCAGCAG AACGCCGCGA TCCAAGCGTT CGACTTCGAC
GGCTTCCACG GTGACACCTT CGGCGACCCC GACACCGTCG ACTACAACTA CAACGGCCAG
CCGGCCGGCG TGGCCAGCGA CTCGTGCACC ACCGACACCG ACGGCGCCCA CAGCACGACG
CCCGTTCACA ACGTGGCCGG CTCGGCGACG TGGCTCAGCG GGACCTTCCC GTCGTTCCTC
AGCTATGCCA AGAGCGCCCT GGGCAGTGGC AAGTACCTGA TGTTCAACCC GGTCACCTAC
GACCACGCCC ACTGCGAGGC CAACACCAGC GCGGTGGACC TGCTCTATTC CGAGCTGTGG
CCGAACGACC GGGACCAGTA CTGGGACTAC GGCAGCCTGA AGACGGCCAT CGACCAAGGC
TTCAGCGAGA GCGCGTCGGC CAGCCCGACC GGCCGCGGCA AGTCGCTGAC GGTCGCGGCG
TACACCGACT TCGCCAACGG CGGCGGCGGC ACGTTCAACA CCCCGGACGT GCTGCTGCTG
GACTCCACGC TGTTCGCCAG CGGCGGCAGC CACGAAGAGC TCGGCGACAA CGGCCTGATG
CTGGACTATC AGGAGTACCG GGCCGGCGCG ACACCCATGA GTGCCTCGCT GTCCCAGTCG
GTGCAGAACT ACTACGACTT CATGACCGCC TACGAGAACC TGCTGCGGGA CGGCCAGACG
GCGACCAATC AGACCGTCGC CGTCTCCGGC CAGACGGTCA GCAGCCAGGC GACGCCGGGC
GACGTCTGGG CGTTCACCAA GCAGGACGCG GACCACGAGG TCATCCAGCT CATCAACATG
GTCGGGCAGT CCAGCAACCT CTGGCAGACC GGCGCGTGCG ACATGTGCTC GCACATCACC
ACGCCGCACC CGGCGCCGAC ACAGCTGACC AATGTGCCGG TGAAGTACTA CTTCAAGAAC
ACGCCCAAGG CCGTCATGTT CGCCTCGCCG GACTACAACA ACGGCACCAC CTACTCGGTG
CCGTTCACCA CCGGGACCGA CTCCGGCGGC TCGTACGTGT CGTTCACCGT GCCCAGCCTC
AACTACTGGG ACATGGTGTA CACCAGCCAG ACCGGACCGG GCGACGCGCC GGTTCTGCCC
GGCAGCGGCG GTACGCCGAC CGCCCCGGGC GCGCCCGGGA CCCCGGTCGC GTCCAACATC
ACCGCCAACT CCGCGACCCT GACCTGGACC GCTGCCACGG CCGGCAGCAA CCCGGTCGCC
GGGTACGACG TGTACCGCGT CGGCTCCCCG GACGCCGTCG TGGCCTCCTC GACCGGTCTG
TCGGCGAACG TCAGTGGTCT GTCGCCGTCG ACCAGCTACC AGTTCTACGT CAAGGCGAAG
GACTCCACAG GGCTGACCGG TTCGGCGTCC GGCACCACGT CGGTGACCAC TGCCTCCGGC
GGTTCCACAC CACCGGGTGC TCCAGGTACT CCGGCGGCGT CGAACGTCAC CACCAGCGCG
GCCACGCTGA CCTGGACCGC GGCAGCCGCG GGCAGCAACC CGATCTCCGG CTACCAGGTC
TTCCAGGTCG GCAACCCTGA CAAGGTCGTG GCGTCCACCG GTGCCGGCAC TCTGAGCGCC
ACCATCACCG GGCTGTCGCC ATCCACTGCC TACCAGTTCT ACGTCAAGGC GAAGGACTCC
ACAGGCCTGA CCGGTTCGGC GTCCGGCACA ACGGCGGTGA CCACCGCAGG CGCACCACCG
CCGAGTACGG CCAAGGTGAC GTACGCGGTG CAAAGCGACT GGGGATCGGG GATGTCCGTC
GCGGTGACGA TCACCAACAC CGGCAGCACC GCGATCAACG GCTGGACGCT GGGCTTCGCC
TTCCCCGGCA ACCAGCAGGT CGGCAGCGGC TGGAACGCCA ACTGGTCGCA GAACGGCCAT
AACGTCACCG CCACCAACCA GTCCTTCAAC GGCGCCATCG CGCCGGGAGC CTCGATATCG
ATCGGCTTCA GCGGCACCTA CAGCGGTGCC GATGCGAAGC CCTCCGCGTT CACCGTCAAC
GGCCTACCGG CCACCGTCGG ATGA
 
Protein sequence
MHIRTRRRIP ARDHRPRAVR PSRHRAALAV AALAASVIGW MPGVSSAAPA AASGTSGDPG 
VARTQQVLAS ITTDKSRYAP GDTVSLTVNA ANKTGSAISG GAVTLYFMTM QNAASASQAQ
TLNLGSGASS TLAFHWTAPS TDYTGYMVSA VATDSSGKAL DSINVAVDVS SDWSRFPRYG
YMTNNSFGNQ GLSTSQAASI MSSMSNYHID GLQFYDWQYN HDQPLCGTVS APCSSWTDDG
NQKTVYASAV KDLVTAAHNS NIVAMPYNAI FSADNGSCCG APDYHTQGLG VSPSWGVYQD
TNHSKPLAFF QWDYMDPSNP GWQQYLMGQQ NAAIQAFDFD GFHGDTFGDP DTVDYNYNGQ
PAGVASDSCT TDTDGAHSTT PVHNVAGSAT WLSGTFPSFL SYAKSALGSG KYLMFNPVTY
DHAHCEANTS AVDLLYSELW PNDRDQYWDY GSLKTAIDQG FSESASASPT GRGKSLTVAA
YTDFANGGGG TFNTPDVLLL DSTLFASGGS HEELGDNGLM LDYQEYRAGA TPMSASLSQS
VQNYYDFMTA YENLLRDGQT ATNQTVAVSG QTVSSQATPG DVWAFTKQDA DHEVIQLINM
VGQSSNLWQT GACDMCSHIT TPHPAPTQLT NVPVKYYFKN TPKAVMFASP DYNNGTTYSV
PFTTGTDSGG SYVSFTVPSL NYWDMVYTSQ TGPGDAPVLP GSGGTPTAPG APGTPVASNI
TANSATLTWT AATAGSNPVA GYDVYRVGSP DAVVASSTGL SANVSGLSPS TSYQFYVKAK
DSTGLTGSAS GTTSVTTASG GSTPPGAPGT PAASNVTTSA ATLTWTAAAA GSNPISGYQV
FQVGNPDKVV ASTGAGTLSA TITGLSPSTA YQFYVKAKDS TGLTGSASGT TAVTTAGAPP
PSTAKVTYAV QSDWGSGMSV AVTITNTGST AINGWTLGFA FPGNQQVGSG WNANWSQNGH
NVTATNQSFN GAIAPGASIS IGFSGTYSGA DAKPSAFTVN GLPATVG