Gene Caci_2792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2792 
Symbol 
ID8334141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3205822 
End bp3207867 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content69% 
IMG OID644955940 
Productcellulose-binding family II 
Protein accessionYP_003113546 
Protein GI256391982 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5520] O-Glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGT CCCAGCCTGG AGTGGGAGCG CTCCCGCTCC AGGGTCCACT GAGCGAGAAG 
CCCGGACCCG AGTCATCCAT CGCCGAGATC CAGGAGACTA CGTTGACCCC CCGCAACGAG
ACCACGGCCA CAGGGCCGGA GTTCCCCAGC CGGAGAACCG TACTGGCGGC GCTCGGTGCC
GTGCCGGTGC TCGCGGTCGC GGCACCGTCC ATGGCGGCTA CCGCGGCACC TGCGGCGGCC
TCCGCGGTGG TCGTCGACCC GTCCGCCTTG CGGCAGACGA TCCGGGGCTA CGGCGGCATG
AACCACCCGG AGTGGGCGGG CGACCTGACA GCCGCGCAGC GCGACACCGC GTTCGGGAAC
GGCACGGGCC AGCTGGGGTT CTCCATGCTG AGGATCCACG TTGACGAGGA CCAGAACAAC
TGGAGCCGCG AGCTGGCGAC GGCGCAGCGG GCGGTCGCGC TCGGGGCGAC CGTCTTCGCG
TCGCCGTGGA ATCCCCCCGC CAGCATGGTC GAGACCTTCA CACGCGGAAG CCAGACCAAC
GCCCACCGCC TGCGGCATGA CATGTACGGC GCCTACGCGC AGCACCTGAA TTCCTTCTAC
CAGTACATGA AGACCAACGG GGTGGACCTG TACGCCATCT CGGTGCAGAA CGAACCCGAC
TACGCCTCGA CGTGGACGTG GTGGACCGCG AGCGAGATCG TCACCTTCCT GCAGAACAAC
GCCGGCGCCA TCGGCACCAG GATCATCGCG CCGGAGTCCT TCCAGTACGT CAAGAGCATG
TCGGACCCGA TCCTCAACGA CGCCACGGCG CTGGCCAACC TGGACATCCT CGGGGCCCAC
CTTTACGGCA CCTCGTACGC GAACTTCCCC TACCCCCTCT TCCAGCAGAA GGGGCAGGGC
AAGGAGCTGT GGATGACCGA GGTCTACTAT CCCAACAGCA CCGACTCGGC CGACCTGTGG
CCCGCGGCAC TCGGCGTCGG GGAGCACATG CACCACGCGA TGGTGGACGC CGAGTTCCAG
GCGTACGTGT GGTGGTACAT CCGGCGCAGC TACGGGCCCA TGCGCGAGGA CGGGCAGATC
AGCAAGCGCG GCGCCCTGAT GGCGCAGTTC TCGAAGTTCG TCCGCCCCGG ATACGTGCGT
GTCAACGCGA CCGCGAACCC CCAGACGAAC CTCCTCACCT CGGTCTACAA GGGGCCCTCC
ACGCTGGTCA TCGTCGCCGT CAACTCGGCG ACCAGCACGC TGAGCCAGCA GTTCACGCTG
TCGAACACCA CGGCGTCCAG CGTGTCCGCA TGGGTGACGG ACGCGTCCAG GAACGTGGCC
TCCACGAGCG CGCCCAGCGT GTCGAACGGC AGCTTCACCG CCACGCTCCC CGCCCAGAGC
GTCACCACCT TCGTCATCAC GGTGGGCTCT TCCACCGGAT CGGACACCCA AGCGCCCACC
GCGCCCGGCA CACCCACGGC CACCGGGATC ACGGCCACCT CCGCCACGCT GAGCTGGCCG
GCCTCCACCG ACAACGTCGG CGTGGTCGGT TACGACGTGG TACGCGTCAG CGGCACCACC
GAGACCGCTG CCACGTCCTC CACCACCACC CAGGGCACTG TCACCGGCCT GACCGCGAGC
ACTGCCTACA CCTTCGCGGT CTACGCGCGC GACGCGGCCG GCAACCGCTC GACCCGCTCC
GCCACCGTCT CCGTCACCAC CAGCGCCTCG GGCGGTACTG GTACCGGAGC CTGCGGGGTC
ACCTACCAGG TCACGGGGAG CTGGACCGGC AGTTTCCAGG GCCAGATAGA CATCCACAAC
ACCGGCACCA CTGCCCTCAA CGGCTGGACC CTCACCTTCA CCTTCACCGC GGGCCAGACC
ATCACCCAGA TGTGGGGCGG CACCCCCGCG CAGAGCGGGA GCAAGGTGAC CGTGACCCCG
GCGGACTACA ACAGCTCCAT CCCGGCCGGC GGCTCCGTCA CCGTCGGTTT CCTCGGCACC
GCGGGCAGCA CCAACCCGGC CCCGACCGGC TTCACACTCA ACGGCGGCAC CTGCACGACC
GCCTGA
 
Protein sequence
MPESQPGVGA LPLQGPLSEK PGPESSIAEI QETTLTPRNE TTATGPEFPS RRTVLAALGA 
VPVLAVAAPS MAATAAPAAA SAVVVDPSAL RQTIRGYGGM NHPEWAGDLT AAQRDTAFGN
GTGQLGFSML RIHVDEDQNN WSRELATAQR AVALGATVFA SPWNPPASMV ETFTRGSQTN
AHRLRHDMYG AYAQHLNSFY QYMKTNGVDL YAISVQNEPD YASTWTWWTA SEIVTFLQNN
AGAIGTRIIA PESFQYVKSM SDPILNDATA LANLDILGAH LYGTSYANFP YPLFQQKGQG
KELWMTEVYY PNSTDSADLW PAALGVGEHM HHAMVDAEFQ AYVWWYIRRS YGPMREDGQI
SKRGALMAQF SKFVRPGYVR VNATANPQTN LLTSVYKGPS TLVIVAVNSA TSTLSQQFTL
SNTTASSVSA WVTDASRNVA STSAPSVSNG SFTATLPAQS VTTFVITVGS STGSDTQAPT
APGTPTATGI TATSATLSWP ASTDNVGVVG YDVVRVSGTT ETAATSSTTT QGTVTGLTAS
TAYTFAVYAR DAAGNRSTRS ATVSVTTSAS GGTGTGACGV TYQVTGSWTG SFQGQIDIHN
TGTTALNGWT LTFTFTAGQT ITQMWGGTPA QSGSKVTVTP ADYNSSIPAG GSVTVGFLGT
AGSTNPAPTG FTLNGGTCTT A