Gene Caci_3718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3718 
Symbol 
ID8335071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4183999 
End bp4185576 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content68% 
IMG OID644956858 
Productcellulose-binding family II 
Protein accessionYP_003114461 
Protein GI256392897 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.154432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACATC CGTTCTCCCT CTTCAGATCT GCCCGCAGCC GGGCGGTTTC GGCCGTGGCC 
GCGGTCGCGG TGATCACTAT CGCGGGCGCC GGGGCTGTGG TGGCGCTTCC GCGAGCCGCG
CAGGCGGCCG GGGCGACGCA GTGTCAGGTT TTGTACTCGG TGGCCAATGA CTGGGGCAGT
GGCTTCAGCA CGAACGTCAG CATCACGAAC CTCGGTGCGC CGTGGACGAG CTGGACTCTC
GGCTACTCCT ACGCCGGGAA CCAGACGCTG TCCTCGGGCT GGAACGGGTC CTGGACCCAG
TCCGGCAAGG CCGTGACGGT CACGAGCATG TCGTGGAACG GCGCCGTGGC CACCAACGGC
ACCGTCACCC CGGCGGCGAA CTTCACCTAC AGCGGTGCGA ACGCCGCCCC GACGGCGTTC
ACCGTCAACG GAGTCCTGTG CGGCGGGCCC GGCTCGCCGC CTCCGACGTC CACCCCCAGC
ACCTCGCCCT CAACCTCGCC GAGCAGCACG CCGAGCACGC CGCCGAGCAG CCCGCCGCCG
GGGACGCCCG CCCCGCAGCT TCACGTCTCC GGAAACCACC TGGTCACCTC GGCCGGCGCG
ACCTACCGTC TCCTGGGCGT CAACCGCTCC AGCGGCGAGT TCGCCTGCGT CCAGGGCAAG
GGCATGTGGG ACGGGCCGGC GGACCAGGCC ACGATCGATG CGATGAAGAC CTGGAACATC
CACGTTGTGC GCATCCCGCT GAACGAGGAG TGCTGGCTGG GCAACAGCGA CGTCCCCGCG
GGCGGTACCG TCGGCGCCGC GTATCAGAAG GCGGTCAAGG ACTACACCGA TCTGTTGGTG
GCCAACGGCA TCAACGTGAT CCTGGACCTG CACTGGACCT ACGGCCAGTA CACCGGCCCG
AGCTCGGCGT GCGCCGACGC GCTGGCCGCG TGCCAGAAGC CGATGCCGGA CGCGCAGTAC
ACCCCGACGT TCTGGAAGCA GGTCGCCACC GCGTTCAAGG GTGACAACGC AGTGCTCTTC
GACCTGTTCA ACGAGCCCTA TCCGGACGCC GCGAACAACT TCTCCAACGC CACCGAGGCC
TGGACGTGCC TGCGCGACGG CGGAACCTGC ACCGGCATCA CCTACCCGGT CGCCGGCATG
CAATCGCTGG TCGACGCGGT CCGCGCCACC GGTGCCACCA ACGTCGTCAT GACCGGCGGC
CTGACCTGGA CCAACGACCT GAGCCAGTGG CTGGCCTACG AGCCGAAGGA TCCCACCGGC
AACCTGGTCG CCTCCTGGCA CTCCTACAAC TTCAACGGCT GCATCACCAC CTCCTGCTGG
AACTCCACTA TCGGCGCCGT GGCCGCGAAG GTGCCGGTCC AGGCCGGCGA GATCGGCCAG
AACAACTGCA ACCACGACTA CATCGACCAG GTGATGGCCT GGGCGGACGC CAACGGCGTC
GGCTACTCGG CGTGGACGTG GAACCCCTGG GGCGTCTGCA ACAGCAACGG CAACGACCTG
ATCACCGACT GGAGCGGCAC ACCCACCGCC ACCTACGGCC AGGGATACCA AGCGCATCTG
CTCACCCAAA AGCCCTGA
 
Protein sequence
MRHPFSLFRS ARSRAVSAVA AVAVITIAGA GAVVALPRAA QAAGATQCQV LYSVANDWGS 
GFSTNVSITN LGAPWTSWTL GYSYAGNQTL SSGWNGSWTQ SGKAVTVTSM SWNGAVATNG
TVTPAANFTY SGANAAPTAF TVNGVLCGGP GSPPPTSTPS TSPSTSPSST PSTPPSSPPP
GTPAPQLHVS GNHLVTSAGA TYRLLGVNRS SGEFACVQGK GMWDGPADQA TIDAMKTWNI
HVVRIPLNEE CWLGNSDVPA GGTVGAAYQK AVKDYTDLLV ANGINVILDL HWTYGQYTGP
SSACADALAA CQKPMPDAQY TPTFWKQVAT AFKGDNAVLF DLFNEPYPDA ANNFSNATEA
WTCLRDGGTC TGITYPVAGM QSLVDAVRAT GATNVVMTGG LTWTNDLSQW LAYEPKDPTG
NLVASWHSYN FNGCITTSCW NSTIGAVAAK VPVQAGEIGQ NNCNHDYIDQ VMAWADANGV
GYSAWTWNPW GVCNSNGNDL ITDWSGTPTA TYGQGYQAHL LTQKP