Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3845 |
Symbol | |
ID | 8335198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4356247 |
End bp | 4359210 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644956981 |
Product | Beta-glucosidase |
Protein accession | YP_003114584 |
Protein GI | 256393020 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.822537 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0429767 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCGCT CGAGTCCTGG CACATCGCGA CGTCACGGAA GACTCAGGGT GCTGTTCACC CTCCTGACCG TCCTGGCGAT CCTCACGGCA CTGGTCGGCG GTACCCAGCT GGCCGTGCCG AGCAAGGCTT CGGCTGCGGG CACGACGCTG TTGTCGCAAG GCAAGTCGGC GACCGCGTCC TCCACCGAGA ACGCCGGGAC TCCGGCGAGC GCCGCCGTCG ACGGCAACAC CACCACCCGC TGGTCCAGCG CCTTCAGCGA CCCGCAGTGG ATCCAGGTCG ACCTCGGTGC GAGTGACACG ATCAGCCAGG TCGTCCTGCA ATGGGAGACC GCCTACGGCA AGTCCTTCCA GATCCAGACC TCTCCTGATG CCGCCGTGTG GACGAGTATC TACTCGACCA CCACGGGCAC CGGTGGAACC CAGACCCTGA ACGTCACAGG CACCGGCCGG TACATACGCT TGTACGGCAC CGCCCGTGGA ACCGCGTGGG GTTACTCGCT CTGGGAGTTC CAGGTCTACG GAACGACCGG TGACACCGGC GGAACCTGCG GACCGGCTAA CGCCGCGCAG GGCAGGCCGG CCACCGCCTC GTCGACCGAG AACGCGATGA CGCCGGCGAG CGCCGCCTTC GACGGAGATA CCAGTACCCG CTGGTCGAGC GCCTTCAGCG ACCCGCAGTG GGTCCAAGTG GACCTGGGCT CGTCCCAGCC GATCTGCCAC GTCGACCTGA CGTGGGAGGC CGCGTTCGCC TCGGCGTTCC AGATCCAGAC GTCGCAGGAC GCGGCGACCT GGACGACGGT CTACTCGACC ACCAGCGGCC CCGGCGGCAC CCAGTCGCTC AACGTGACGG GCACCGGCCG GTACGTCCGC ATGTACGGTA CTGCCAGGGC CACCCCGTAC GGCTATTCGC TCTGGGAGTT CCAGGTCCTC ACCGGCACCG GCACCCCGCC TCCCCCCAAC TGCCCGTGGG TCGGCTCCGC GGCTCCGGTC GCGACGCGCG TCAGCCAGGT CATCGCGGCC ATGAACCAGT CCCAGAAGAT CTCGCTGCTG CACGGCGTCG GCGGCGCGTA CGTCGGGGTC GTCGCACCGA TCCCGGCCCT GTGCGTCCCC GGCCTCAATC TGCAGGACGG TCCCCAAGGC GTCGGCGACG GGCTCGGCGG CGTCACCCAG CTGCCCGCGC CCGTAGCGGC GGCGGCGACA TGGGACACCG CGCTGGAGAA CCAGTACGGG GCGACCGAGG GCACCGAGTT CGCCGGCAAG GGCGTGAGCG TCGCTCTCGG CCCGACGGTC AACATCGTCC GCGACCCGCG GTTCGGCCGG GCCTTCGAGA CGTTCAGCGA GGACCCCTAC CTGGCCGGCC AGATAGCCGC CGCCAACATC CAGGGCATCC AGAGTCAGGG CGTGATGGCC CAGGTCAAGC ACGCGGCCGT CTACAACCAG GAGACGAACC GCAACAGCCC GGCCGACAAC GCCATCGTCG ACAACCAGAC GCTGCAGGAG ATCTACCTGC CCGCGTTCAA CGCGGCGATC ACCAAGGGCA ACGCGTCCTC GGTGATGTGC TCGTACAGCA CGATCAACGG AACCTACGCG TGTGAGAACC CGTATATCCT CAACACCGCG CTGTACCAGC AGGACGCGTT CACCGGGTTC GTCACCTCGG ACTGGGGCGC CACGCACTCG ACCGTCGCGT CGGCGAACTC CGGCCTGACG ATGGAGATGC CGGGCAGCGG CTACTTCGGC ACGGCGCTGT CCTCGGCCGT CACGGCGGGC ACGGTGACCA CGGCCACGCT GAACACCATG GTCGGCAGGG TCCTGACGAA GATGTTCGAG TTCGGGCTCT TCGACAAGGC GCCCAGCGGC TCGACCGGCG CGACCGTCAC CACGCCGGCC CACGTGGCCA CCGCGCGGAC GGTCGCCGAG GAAGGCACCG TCCTGCTCAA GAACGCCAAC AGCCTCCTGC CGCTGTCCAC CTCGACGACG CACTCGATCG CCGTGCTGGG CAACGACGGC GGCTCGGGCG TGCAGACCAG CGGCGGCGGC AGCGCCGGCG TCAGCAGCTC GGGCACCGTC TCCCCGCTGA CCGGCATCAC GAACCGGGCG GGCTCCGGCG TGACCGTCTC CTACGAGGCC GGCACCGACG CCGCCGGCGT CACCCGCGCG GTCAACCTGG CCAAGGCCTC CGACGTCGCG ATCGTTTTCG CGAACTACGG GGAGTCCGAG GGCAGCGACA TCTCGAACAT CGACCTCCCG GGCAACCAGA ACACGCTGAT CTCGTCGGTG GCCGCGGCCA ACCCGAAGAC CATCGTCGTC CTCAACACCG GCTCGGCGGT CACCATGCCG TGGCTGGGCT CGGTCGCCGG TGTTTTCGAG AACTGGTACG CCGGCCAGGA GGCGGGCAAC GCCATCGCTG CCCTGTTGTT CGGCGACGCG AACCCGTCCG GTAAGCTGCC GGTCACCTTC CCGGCGAGCC TGGCCGACGT ACCGGCGCAC ACCACGGCGC AGTGGCCCGG CACGAACAAC CAGGTCCAGT ACTCCGAAGG CGTCGACGTC GGCTACCGCT GGTACGACTC GCAGAACAAG ACACCTCTCT TCCCCTTCGG ATACGGGCTC TCCTACACCA GCTTCGGGTT CTCGAACCTG AGCGTCGGCG CCCTCTCTGG AAACACCAGC ACCGTCACCG CCACGGTCAC CAACACGGGG ACTACGGCGG GAGCCGAAGT GGCGCAGCTC TACGTGGGCG ATCCCTCGTC CACCGGCGAG CCGCCCAAAC AGCTGAAGGG CTTCGTCCGG GTGAGCCTCG CCCCCGGCCA GAGCCAGACC GTGCAGTTCA CCGTGAGCAG CCATGACCTG GCCCACTGGG CGGACTCGGC CGGCGGCTGG ACGACGACGT CGGGGGCCTA CCAGATCCTC GTCGGCGACT CGTCCAGGAA CCTCCCGCTG ACCGGCACCA TCACGGTTCC CTGA
|
Protein sequence | MARSSPGTSR RHGRLRVLFT LLTVLAILTA LVGGTQLAVP SKASAAGTTL LSQGKSATAS STENAGTPAS AAVDGNTTTR WSSAFSDPQW IQVDLGASDT ISQVVLQWET AYGKSFQIQT SPDAAVWTSI YSTTTGTGGT QTLNVTGTGR YIRLYGTARG TAWGYSLWEF QVYGTTGDTG GTCGPANAAQ GRPATASSTE NAMTPASAAF DGDTSTRWSS AFSDPQWVQV DLGSSQPICH VDLTWEAAFA SAFQIQTSQD AATWTTVYST TSGPGGTQSL NVTGTGRYVR MYGTARATPY GYSLWEFQVL TGTGTPPPPN CPWVGSAAPV ATRVSQVIAA MNQSQKISLL HGVGGAYVGV VAPIPALCVP GLNLQDGPQG VGDGLGGVTQ LPAPVAAAAT WDTALENQYG ATEGTEFAGK GVSVALGPTV NIVRDPRFGR AFETFSEDPY LAGQIAAANI QGIQSQGVMA QVKHAAVYNQ ETNRNSPADN AIVDNQTLQE IYLPAFNAAI TKGNASSVMC SYSTINGTYA CENPYILNTA LYQQDAFTGF VTSDWGATHS TVASANSGLT MEMPGSGYFG TALSSAVTAG TVTTATLNTM VGRVLTKMFE FGLFDKAPSG STGATVTTPA HVATARTVAE EGTVLLKNAN SLLPLSTSTT HSIAVLGNDG GSGVQTSGGG SAGVSSSGTV SPLTGITNRA GSGVTVSYEA GTDAAGVTRA VNLAKASDVA IVFANYGESE GSDISNIDLP GNQNTLISSV AAANPKTIVV LNTGSAVTMP WLGSVAGVFE NWYAGQEAGN AIAALLFGDA NPSGKLPVTF PASLADVPAH TTAQWPGTNN QVQYSEGVDV GYRWYDSQNK TPLFPFGYGL SYTSFGFSNL SVGALSGNTS TVTATVTNTG TTAGAEVAQL YVGDPSSTGE PPKQLKGFVR VSLAPGQSQT VQFTVSSHDL AHWADSAGGW TTTSGAYQIL VGDSSRNLPL TGTITVP
|
| |