Gene Caci_4651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4651 
Symbol 
ID8336005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5288058 
End bp5292704 
Gene Length4647 bp 
Protein Length1548 aa 
Translation table11 
GC content70% 
IMG OID644957751 
ProductBeta-glucosidase 
Protein accessionYP_003115353 
Protein GI256393789 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00279613 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.760952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGC ACGAACGCAC CACCCGGCCC AGAGGGCGAA GGGCCGTCCA CGCCACGCTC 
GCTGTGGCGC TCGGGCTGGG TATCGCGGTG GCCGCCGCCC CAGCCTCGTC AGCCGGAGCA
GCCGGAGCAG CGGGAGCAGC CGGCGCGCAG GCGGCGCAGG CGGCGCAGAA GAAGACGCCG
CTGCCGATCT ACCTCGACAC CCACTACAGC GCGCAGGCGC GCGCCGCCGA TCTGGTGTCG
CGGATGACGC TGCCGGAGAA GGTGGAGCAG CTGAGCACCA ACAGCGGCCC GGCGATTCCG
CGGCTGGGGG TGCAGCAGTA CACGTATTGG AGCGAGGGCC AGCACGGTGT CAACACCCTC
GGCGCCAATC AGGACAACGG CGGCAACGGC GGGGCGGTGC ACTCCACGAG CTTCCCGACG
AACTTCGCCA GCACCATGTC CTGGGATCCG AGCCTGATCT ATCAGGAGAC GACTGCGATA
TCCGATGAGG CTCGCGGCTT CTTGGACAAG TCGCTGTTCG GGGTGAACCA GAACAACCTG
GGCCCGTCGG CCGCCGACTA CGGCAGCCTG ACGTTCTGGG CTCCGACGGT GAACATGGAC
CGCGATCCCC GCTGGGGTCG GACCGACGAG GCGTTCGGCG AGGACCCGTA TCTGACCTCG
ACGATGGCCG GGGCGTTCGT CAACGGCTAT GAGGGCAACA CTCCGACCGG GCAGTCCAAG
ACCGGGACGT TGAAGGTCGC GGCGACGGCG AAGCACTATG CGCTCAACGA CGTCGAGCAG
GACCGCACCG GTATCTCCTC TAATGTCAGC GACACCGATC TGCACGACTA CTACACCAAG
CAGTTCGCGA GCTTGATCGA GAACGCGCAC GTCTCAGGGC TGATGACCTC CTACAACGCG
ATCAACGGCA CCCCGTCGGT GGCGGACACC TATACCGCCA ACCAGCTCGC GCAGCGCCAG
TTCGGCTTCA ACGGCTACGT GACCTCCGAC TGCGGCGCCA TCGGCACCGC GTATCAGAGC
TTCCCCAGCG GCCATGACTG GGCGCCGCCG GGCTGGACCA CCGACGGCAA GAGCTCCACC
GGGACGTGGA CGAACACCGC GACCGGCGCC ACCGTGCCGG CGCAGGCCGG CGGCCAGGCC
TATGCGCTGC GCGCGGGCAC TGACTTGAAC TGTGCTGGAG GCGAGAACAC CTACGCGCAG
ATCACCGCGG CGATCAGCGC CGGGGTCCTG AGCGAAGGCG TCATCGACAA CGCGCTGGTG
AAGATCTTCA CGGTGCGCGT CGAGACCGGC GAGTTCGACC CGGCCGGGTC CAACCCGTAC
ACCGGCATCA CCAAGGCGCA GATCCAGAGC CCGGCGCACC AGGCGCTGGC GACGAAGGTC
GCGGACAACT CGCTGGTGCT GCTGAAGAAC CAGCCGCCGG CGGCGTCCGG CACGTCGACC
ACCCCGCCCG CGGCGTCCAG CGCGGCGTCC AGCGCGGCAG CGGCGGCCAA GCCGCTGCTT
CCGCTGTCGG CAGCGGCCAC CGCCAAGATC GTGATCGTCG GTGACATGGC GAACGCCGTC
ACCCTCGGCA ACTACTCCAG CGACCCGGCG CTGAAGGTCA GCCCGGTGCA GGGCATCACC
GCCGCGGTGC GCAAGGCCAA CCCCGGGGCG AGCGTGACCT TCGACGCCTG CGGCACCTCC
ACCACGGCGA GCGCCGCCGC GTCCTGCTCG GCGCAGACTC TCGCCGACGT CGCCGGCGCC
GATGCGGTGA TCGTGTTCGT CGGCACCAAC CAGCAGATCG CCGACGAGGG CAAGGACCGC
ACGTCCATCG CGATGCCCGG CAACTACGAC TCGCTGATCT CGCAGGTCGC GGCGGTCGGC
AACCCGCGGA TGGTGCTGGC GGTCCAGTCC GGCGGCCCGG TGCGGATCGA CGACGTCCAG
AAGGACTTCG CCTCGATCGT GTTCAGCGGC TTCAACGGCG AGAGCCAGGG CACCGCGTTG
GCCGATGTGC TGTTCGGCGC GCAGAACCCC GACGGGCACC TGGACTTCAC CTGGTACGCC
GACGACTCGC AGCTTCCGGC GATGTCGAAC TACGGCCTGA CCCCGGCGCA GACCGGCGGT
CTGGGCCGGA CGTACATGTA CTTCACCGGC ACTCCGACCT ACCCGTTCGG CTACGGCCTG
AGCTACTCCA CGTTCTCCTT CTCCGGTGTT CACGCCGAGG GCCGCAGCGT TGACGCCAAC
GGCAGCCAGA GCGTGTCGGT GACGGTGAAG AACACCGGCA AGACCGCGGG CAGCACCGTG
GCGCAGCTGT ACGCGCAGCC GAAGTTCACG GTCGCCGGGC AAACGTTCCC GAACGAGCAG
CTGGTCGGGT TCGCCAAGAG CAAGGTGCTC AAGCCCGGCG AGAGCCAGCA TCTGACGATC
ACCGCGCACA TCCCCGACCT GGGCATCTGG GATCCGGCCA CCATGAAGTC GGTCGTGTAT
GACGGCACCT ACTCCTTCGG CGTCGGCGCC GACGCCTCCG ACATCCGCGG CAGCGCCGAC
GTCGCCGTGA CCGGCGCTCT GAGCTCGCGC GTCAGCACCG TCACGGTGCA GCCGGACCAG
GTCGACTTCC AGGTCGGTCA GAGCCTTGAC CTGACGGGCA AGAACCCGTG GATCGCCGAT
GACACCACCG GCGTCGGCTC GGTCCCGCAG GACCGGAACA TGGCCGTCAC CGCCGACGGC
ATCATCGAGG CCGCCGACAC CGACGGCAGC TTCGCCGACC TGTCCAAGGC GCACGTCAGC
TACCGCAGCA GCGACCCGCG CGTCGCCACC GTCAACGCCA AGGGCGTCCT GACCGCCGTC
GGAACGGGAA CCGCGGACAT CACGGTCACC GTCAACGGCG TGTCCGGGTC CACACCGGTC
GTCGTCGGCC ACGCGGTGTC GGTGAGCACT CCGGCGCTCG CCCAGCCCGG TCAGACCTCG
ACAGTCACCA CCACGTTCAC CAACACCGCC GCTGCCACCG CCTCCTCCTC CGGTGCGGTG
AGCAACGTCG CGATGAACCT GGACCTGCCC TCGGGCTGGA CGGCGACCGC GACCAGCCCG
GCGAGCTTCG CGCACGTCGC AGCCGGGGCG AAGGTCAGCA CCACCTGGTC GGTGACAGTG
CCGGCCAACA CCGGCGGCAC GTACACGATC AACGCTGACG CGACCGTCGG CGGGCAGCAC
GACAGCACCG GCTTCAGCCA GCTGGCGGTG CCGTTCACCT CGCTGCCGGC GGCGTTCAAC
AACGACGCGA TCACCAACGA CAGCAACCGC GGCGGCGCCG ACCTGGACGG CGCCGGCGCG
AGCTTCTCGG CGCAGGCGCT GGCCTCGGTC GGGGTCACCC CCGGCGCGCC GCTGGTCCAC
GACGGTCTGA CCTTCACCTG GCCGGACCGG CAGGTCGGGC AGTCCGACAA CGTGGTCGCC
GCCGGACAGA CCATCGACAT CTCCGGCTCG GGCTCCACCC TGGGTCTGCT GGGCACCAGC
ACCTGGGGAG CCAGCAGCGG CAGCGGCACC ATCGCCTACA CCGACGGCAG CACCCAGCCG
TACACGATCG CGTTCGGCGA CTGGGCCAAC GGCACACCCC CGACCGGAGG CGACGTCGCG
ATCCGCGCGC CCTACGGCAA CCAGCCGGGG AACCAGACCG GGTGGGCGGC GACCATCGAC
TACTTCCCCA TCACCCTGGA CGCGACCAAG ACCGTGCAGT CGATCACGTT GCCGCCCGGC
AGCGCGCAGC CGCACGGCGG CACCCCGGCG ATGCACATCT TCGCCATGTC GATCAAGTCC
GACCAGCTGT CCGTCACCGC GCCGACGGCG CTGGCGGCGG GATCGTCGGG CACGGTGACC
ACGACGCTGA CGAACTCCTC GCCGGCGGCG CTGAGCACCG TGGCACTGGC GTTGACGCTG
CCGGCCGGGT GGACGGCGAC CAACAGCACC CCGGACACGT TCGCCACAGT GGGCGCCGGT
GAGACGGTGT CGACGACCTG GTCGGTGTCG GTGCCGGCGA CTCAGCAGCC GGGGTCGCAG
GTGATCGGCG TGACCGAGAG CGTGGGCGGC GCGCAGGCCG GGATCTCCGG CACGCAGACC
ACCGTCGCGT ACCCGTCGCT GACCGCAGGG TTCAACAACG TGTCCATCAC CGATGACGCG
AACCACGGAC CCGGCAACAT CGACGGCGGC GGCAACAGCT TCTCCGCGCA GGCGCTCGCC
GCCGCCGGGC TGACCCCGGG TGATGCGTTC ACCTTCGACG GGCTCGCCTT CACCTGGCCG
GCCGCCGCCG CGGGCACCAG CGACAACGTC GAAGCCGACG GCCGCTCGTT CGCCATTGCC
GGCAGCGGCA GCGGCGACGC CAAGCTCGGG TTCCTCGGCG CCGCCGCCAA CGGCCAGTCC
TCCGGCACCG CGACCATCAC CTACACCGAC GGCACCACGC AGCAGTTCAC CCTCGGGTTC
GGCGACTGGG CCTCGACGTC GCCGTTCCCC GGATCGCAGG TCGCGGTGAC CTCCGCTTAC
GGCAACACCT CCAGCGGAAC GTCGCCGTGG AAGGCCTCGA TCTTCTACGA CAGCGTGGCA
CTGCAGGCTG GCAAGACCGT TGCGACGGTG ACGCTGCCGG TCGGCAACTC CTCGCCGCTG
CACGTGTTCG CGGCGGCGAT CGGCTGA
 
Protein sequence
MEQHERTTRP RGRRAVHATL AVALGLGIAV AAAPASSAGA AGAAGAAGAQ AAQAAQKKTP 
LPIYLDTHYS AQARAADLVS RMTLPEKVEQ LSTNSGPAIP RLGVQQYTYW SEGQHGVNTL
GANQDNGGNG GAVHSTSFPT NFASTMSWDP SLIYQETTAI SDEARGFLDK SLFGVNQNNL
GPSAADYGSL TFWAPTVNMD RDPRWGRTDE AFGEDPYLTS TMAGAFVNGY EGNTPTGQSK
TGTLKVAATA KHYALNDVEQ DRTGISSNVS DTDLHDYYTK QFASLIENAH VSGLMTSYNA
INGTPSVADT YTANQLAQRQ FGFNGYVTSD CGAIGTAYQS FPSGHDWAPP GWTTDGKSST
GTWTNTATGA TVPAQAGGQA YALRAGTDLN CAGGENTYAQ ITAAISAGVL SEGVIDNALV
KIFTVRVETG EFDPAGSNPY TGITKAQIQS PAHQALATKV ADNSLVLLKN QPPAASGTST
TPPAASSAAS SAAAAAKPLL PLSAAATAKI VIVGDMANAV TLGNYSSDPA LKVSPVQGIT
AAVRKANPGA SVTFDACGTS TTASAAASCS AQTLADVAGA DAVIVFVGTN QQIADEGKDR
TSIAMPGNYD SLISQVAAVG NPRMVLAVQS GGPVRIDDVQ KDFASIVFSG FNGESQGTAL
ADVLFGAQNP DGHLDFTWYA DDSQLPAMSN YGLTPAQTGG LGRTYMYFTG TPTYPFGYGL
SYSTFSFSGV HAEGRSVDAN GSQSVSVTVK NTGKTAGSTV AQLYAQPKFT VAGQTFPNEQ
LVGFAKSKVL KPGESQHLTI TAHIPDLGIW DPATMKSVVY DGTYSFGVGA DASDIRGSAD
VAVTGALSSR VSTVTVQPDQ VDFQVGQSLD LTGKNPWIAD DTTGVGSVPQ DRNMAVTADG
IIEAADTDGS FADLSKAHVS YRSSDPRVAT VNAKGVLTAV GTGTADITVT VNGVSGSTPV
VVGHAVSVST PALAQPGQTS TVTTTFTNTA AATASSSGAV SNVAMNLDLP SGWTATATSP
ASFAHVAAGA KVSTTWSVTV PANTGGTYTI NADATVGGQH DSTGFSQLAV PFTSLPAAFN
NDAITNDSNR GGADLDGAGA SFSAQALASV GVTPGAPLVH DGLTFTWPDR QVGQSDNVVA
AGQTIDISGS GSTLGLLGTS TWGASSGSGT IAYTDGSTQP YTIAFGDWAN GTPPTGGDVA
IRAPYGNQPG NQTGWAATID YFPITLDATK TVQSITLPPG SAQPHGGTPA MHIFAMSIKS
DQLSVTAPTA LAAGSSGTVT TTLTNSSPAA LSTVALALTL PAGWTATNST PDTFATVGAG
ETVSTTWSVS VPATQQPGSQ VIGVTESVGG AQAGISGTQT TVAYPSLTAG FNNVSITDDA
NHGPGNIDGG GNSFSAQALA AAGLTPGDAF TFDGLAFTWP AAAAGTSDNV EADGRSFAIA
GSGSGDAKLG FLGAAANGQS SGTATITYTD GTTQQFTLGF GDWASTSPFP GSQVAVTSAY
GNTSSGTSPW KASIFYDSVA LQAGKTVATV TLPVGNSSPL HVFAAAIG