Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4651 |
Symbol | |
ID | 8336005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5288058 |
End bp | 5292704 |
Gene Length | 4647 bp |
Protein Length | 1548 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644957751 |
Product | Beta-glucosidase |
Protein accession | YP_003115353 |
Protein GI | 256393789 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00279613 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.760952 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAGC ACGAACGCAC CACCCGGCCC AGAGGGCGAA GGGCCGTCCA CGCCACGCTC GCTGTGGCGC TCGGGCTGGG TATCGCGGTG GCCGCCGCCC CAGCCTCGTC AGCCGGAGCA GCCGGAGCAG CGGGAGCAGC CGGCGCGCAG GCGGCGCAGG CGGCGCAGAA GAAGACGCCG CTGCCGATCT ACCTCGACAC CCACTACAGC GCGCAGGCGC GCGCCGCCGA TCTGGTGTCG CGGATGACGC TGCCGGAGAA GGTGGAGCAG CTGAGCACCA ACAGCGGCCC GGCGATTCCG CGGCTGGGGG TGCAGCAGTA CACGTATTGG AGCGAGGGCC AGCACGGTGT CAACACCCTC GGCGCCAATC AGGACAACGG CGGCAACGGC GGGGCGGTGC ACTCCACGAG CTTCCCGACG AACTTCGCCA GCACCATGTC CTGGGATCCG AGCCTGATCT ATCAGGAGAC GACTGCGATA TCCGATGAGG CTCGCGGCTT CTTGGACAAG TCGCTGTTCG GGGTGAACCA GAACAACCTG GGCCCGTCGG CCGCCGACTA CGGCAGCCTG ACGTTCTGGG CTCCGACGGT GAACATGGAC CGCGATCCCC GCTGGGGTCG GACCGACGAG GCGTTCGGCG AGGACCCGTA TCTGACCTCG ACGATGGCCG GGGCGTTCGT CAACGGCTAT GAGGGCAACA CTCCGACCGG GCAGTCCAAG ACCGGGACGT TGAAGGTCGC GGCGACGGCG AAGCACTATG CGCTCAACGA CGTCGAGCAG GACCGCACCG GTATCTCCTC TAATGTCAGC GACACCGATC TGCACGACTA CTACACCAAG CAGTTCGCGA GCTTGATCGA GAACGCGCAC GTCTCAGGGC TGATGACCTC CTACAACGCG ATCAACGGCA CCCCGTCGGT GGCGGACACC TATACCGCCA ACCAGCTCGC GCAGCGCCAG TTCGGCTTCA ACGGCTACGT GACCTCCGAC TGCGGCGCCA TCGGCACCGC GTATCAGAGC TTCCCCAGCG GCCATGACTG GGCGCCGCCG GGCTGGACCA CCGACGGCAA GAGCTCCACC GGGACGTGGA CGAACACCGC GACCGGCGCC ACCGTGCCGG CGCAGGCCGG CGGCCAGGCC TATGCGCTGC GCGCGGGCAC TGACTTGAAC TGTGCTGGAG GCGAGAACAC CTACGCGCAG ATCACCGCGG CGATCAGCGC CGGGGTCCTG AGCGAAGGCG TCATCGACAA CGCGCTGGTG AAGATCTTCA CGGTGCGCGT CGAGACCGGC GAGTTCGACC CGGCCGGGTC CAACCCGTAC ACCGGCATCA CCAAGGCGCA GATCCAGAGC CCGGCGCACC AGGCGCTGGC GACGAAGGTC GCGGACAACT CGCTGGTGCT GCTGAAGAAC CAGCCGCCGG CGGCGTCCGG CACGTCGACC ACCCCGCCCG CGGCGTCCAG CGCGGCGTCC AGCGCGGCAG CGGCGGCCAA GCCGCTGCTT CCGCTGTCGG CAGCGGCCAC CGCCAAGATC GTGATCGTCG GTGACATGGC GAACGCCGTC ACCCTCGGCA ACTACTCCAG CGACCCGGCG CTGAAGGTCA GCCCGGTGCA GGGCATCACC GCCGCGGTGC GCAAGGCCAA CCCCGGGGCG AGCGTGACCT TCGACGCCTG CGGCACCTCC ACCACGGCGA GCGCCGCCGC GTCCTGCTCG GCGCAGACTC TCGCCGACGT CGCCGGCGCC GATGCGGTGA TCGTGTTCGT CGGCACCAAC CAGCAGATCG CCGACGAGGG CAAGGACCGC ACGTCCATCG CGATGCCCGG CAACTACGAC TCGCTGATCT CGCAGGTCGC GGCGGTCGGC AACCCGCGGA TGGTGCTGGC GGTCCAGTCC GGCGGCCCGG TGCGGATCGA CGACGTCCAG AAGGACTTCG CCTCGATCGT GTTCAGCGGC TTCAACGGCG AGAGCCAGGG CACCGCGTTG GCCGATGTGC TGTTCGGCGC GCAGAACCCC GACGGGCACC TGGACTTCAC CTGGTACGCC GACGACTCGC AGCTTCCGGC GATGTCGAAC TACGGCCTGA CCCCGGCGCA GACCGGCGGT CTGGGCCGGA CGTACATGTA CTTCACCGGC ACTCCGACCT ACCCGTTCGG CTACGGCCTG AGCTACTCCA CGTTCTCCTT CTCCGGTGTT CACGCCGAGG GCCGCAGCGT TGACGCCAAC GGCAGCCAGA GCGTGTCGGT GACGGTGAAG AACACCGGCA AGACCGCGGG CAGCACCGTG GCGCAGCTGT ACGCGCAGCC GAAGTTCACG GTCGCCGGGC AAACGTTCCC GAACGAGCAG CTGGTCGGGT TCGCCAAGAG CAAGGTGCTC AAGCCCGGCG AGAGCCAGCA TCTGACGATC ACCGCGCACA TCCCCGACCT GGGCATCTGG GATCCGGCCA CCATGAAGTC GGTCGTGTAT GACGGCACCT ACTCCTTCGG CGTCGGCGCC GACGCCTCCG ACATCCGCGG CAGCGCCGAC GTCGCCGTGA CCGGCGCTCT GAGCTCGCGC GTCAGCACCG TCACGGTGCA GCCGGACCAG GTCGACTTCC AGGTCGGTCA GAGCCTTGAC CTGACGGGCA AGAACCCGTG GATCGCCGAT GACACCACCG GCGTCGGCTC GGTCCCGCAG GACCGGAACA TGGCCGTCAC CGCCGACGGC ATCATCGAGG CCGCCGACAC CGACGGCAGC TTCGCCGACC TGTCCAAGGC GCACGTCAGC TACCGCAGCA GCGACCCGCG CGTCGCCACC GTCAACGCCA AGGGCGTCCT GACCGCCGTC GGAACGGGAA CCGCGGACAT CACGGTCACC GTCAACGGCG TGTCCGGGTC CACACCGGTC GTCGTCGGCC ACGCGGTGTC GGTGAGCACT CCGGCGCTCG CCCAGCCCGG TCAGACCTCG ACAGTCACCA CCACGTTCAC CAACACCGCC GCTGCCACCG CCTCCTCCTC CGGTGCGGTG AGCAACGTCG CGATGAACCT GGACCTGCCC TCGGGCTGGA CGGCGACCGC GACCAGCCCG GCGAGCTTCG CGCACGTCGC AGCCGGGGCG AAGGTCAGCA CCACCTGGTC GGTGACAGTG CCGGCCAACA CCGGCGGCAC GTACACGATC AACGCTGACG CGACCGTCGG CGGGCAGCAC GACAGCACCG GCTTCAGCCA GCTGGCGGTG CCGTTCACCT CGCTGCCGGC GGCGTTCAAC AACGACGCGA TCACCAACGA CAGCAACCGC GGCGGCGCCG ACCTGGACGG CGCCGGCGCG AGCTTCTCGG CGCAGGCGCT GGCCTCGGTC GGGGTCACCC CCGGCGCGCC GCTGGTCCAC GACGGTCTGA CCTTCACCTG GCCGGACCGG CAGGTCGGGC AGTCCGACAA CGTGGTCGCC GCCGGACAGA CCATCGACAT CTCCGGCTCG GGCTCCACCC TGGGTCTGCT GGGCACCAGC ACCTGGGGAG CCAGCAGCGG CAGCGGCACC ATCGCCTACA CCGACGGCAG CACCCAGCCG TACACGATCG CGTTCGGCGA CTGGGCCAAC GGCACACCCC CGACCGGAGG CGACGTCGCG ATCCGCGCGC CCTACGGCAA CCAGCCGGGG AACCAGACCG GGTGGGCGGC GACCATCGAC TACTTCCCCA TCACCCTGGA CGCGACCAAG ACCGTGCAGT CGATCACGTT GCCGCCCGGC AGCGCGCAGC CGCACGGCGG CACCCCGGCG ATGCACATCT TCGCCATGTC GATCAAGTCC GACCAGCTGT CCGTCACCGC GCCGACGGCG CTGGCGGCGG GATCGTCGGG CACGGTGACC ACGACGCTGA CGAACTCCTC GCCGGCGGCG CTGAGCACCG TGGCACTGGC GTTGACGCTG CCGGCCGGGT GGACGGCGAC CAACAGCACC CCGGACACGT TCGCCACAGT GGGCGCCGGT GAGACGGTGT CGACGACCTG GTCGGTGTCG GTGCCGGCGA CTCAGCAGCC GGGGTCGCAG GTGATCGGCG TGACCGAGAG CGTGGGCGGC GCGCAGGCCG GGATCTCCGG CACGCAGACC ACCGTCGCGT ACCCGTCGCT GACCGCAGGG TTCAACAACG TGTCCATCAC CGATGACGCG AACCACGGAC CCGGCAACAT CGACGGCGGC GGCAACAGCT TCTCCGCGCA GGCGCTCGCC GCCGCCGGGC TGACCCCGGG TGATGCGTTC ACCTTCGACG GGCTCGCCTT CACCTGGCCG GCCGCCGCCG CGGGCACCAG CGACAACGTC GAAGCCGACG GCCGCTCGTT CGCCATTGCC GGCAGCGGCA GCGGCGACGC CAAGCTCGGG TTCCTCGGCG CCGCCGCCAA CGGCCAGTCC TCCGGCACCG CGACCATCAC CTACACCGAC GGCACCACGC AGCAGTTCAC CCTCGGGTTC GGCGACTGGG CCTCGACGTC GCCGTTCCCC GGATCGCAGG TCGCGGTGAC CTCCGCTTAC GGCAACACCT CCAGCGGAAC GTCGCCGTGG AAGGCCTCGA TCTTCTACGA CAGCGTGGCA CTGCAGGCTG GCAAGACCGT TGCGACGGTG ACGCTGCCGG TCGGCAACTC CTCGCCGCTG CACGTGTTCG CGGCGGCGAT CGGCTGA
|
Protein sequence | MEQHERTTRP RGRRAVHATL AVALGLGIAV AAAPASSAGA AGAAGAAGAQ AAQAAQKKTP LPIYLDTHYS AQARAADLVS RMTLPEKVEQ LSTNSGPAIP RLGVQQYTYW SEGQHGVNTL GANQDNGGNG GAVHSTSFPT NFASTMSWDP SLIYQETTAI SDEARGFLDK SLFGVNQNNL GPSAADYGSL TFWAPTVNMD RDPRWGRTDE AFGEDPYLTS TMAGAFVNGY EGNTPTGQSK TGTLKVAATA KHYALNDVEQ DRTGISSNVS DTDLHDYYTK QFASLIENAH VSGLMTSYNA INGTPSVADT YTANQLAQRQ FGFNGYVTSD CGAIGTAYQS FPSGHDWAPP GWTTDGKSST GTWTNTATGA TVPAQAGGQA YALRAGTDLN CAGGENTYAQ ITAAISAGVL SEGVIDNALV KIFTVRVETG EFDPAGSNPY TGITKAQIQS PAHQALATKV ADNSLVLLKN QPPAASGTST TPPAASSAAS SAAAAAKPLL PLSAAATAKI VIVGDMANAV TLGNYSSDPA LKVSPVQGIT AAVRKANPGA SVTFDACGTS TTASAAASCS AQTLADVAGA DAVIVFVGTN QQIADEGKDR TSIAMPGNYD SLISQVAAVG NPRMVLAVQS GGPVRIDDVQ KDFASIVFSG FNGESQGTAL ADVLFGAQNP DGHLDFTWYA DDSQLPAMSN YGLTPAQTGG LGRTYMYFTG TPTYPFGYGL SYSTFSFSGV HAEGRSVDAN GSQSVSVTVK NTGKTAGSTV AQLYAQPKFT VAGQTFPNEQ LVGFAKSKVL KPGESQHLTI TAHIPDLGIW DPATMKSVVY DGTYSFGVGA DASDIRGSAD VAVTGALSSR VSTVTVQPDQ VDFQVGQSLD LTGKNPWIAD DTTGVGSVPQ DRNMAVTADG IIEAADTDGS FADLSKAHVS YRSSDPRVAT VNAKGVLTAV GTGTADITVT VNGVSGSTPV VVGHAVSVST PALAQPGQTS TVTTTFTNTA AATASSSGAV SNVAMNLDLP SGWTATATSP ASFAHVAAGA KVSTTWSVTV PANTGGTYTI NADATVGGQH DSTGFSQLAV PFTSLPAAFN NDAITNDSNR GGADLDGAGA SFSAQALASV GVTPGAPLVH DGLTFTWPDR QVGQSDNVVA AGQTIDISGS GSTLGLLGTS TWGASSGSGT IAYTDGSTQP YTIAFGDWAN GTPPTGGDVA IRAPYGNQPG NQTGWAATID YFPITLDATK TVQSITLPPG SAQPHGGTPA MHIFAMSIKS DQLSVTAPTA LAAGSSGTVT TTLTNSSPAA LSTVALALTL PAGWTATNST PDTFATVGAG ETVSTTWSVS VPATQQPGSQ VIGVTESVGG AQAGISGTQT TVAYPSLTAG FNNVSITDDA NHGPGNIDGG GNSFSAQALA AAGLTPGDAF TFDGLAFTWP AAAAGTSDNV EADGRSFAIA GSGSGDAKLG FLGAAANGQS SGTATITYTD GTTQQFTLGF GDWASTSPFP GSQVAVTSAY GNTSSGTSPW KASIFYDSVA LQAGKTVATV TLPVGNSSPL HVFAAAIG
|
| |