Gene Caci_3845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3845 
Symbol 
ID8335198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4356247 
End bp4359210 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content69% 
IMG OID644956981 
ProductBeta-glucosidase 
Protein accessionYP_003114584 
Protein GI256393020 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.822537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0429767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGCT CGAGTCCTGG CACATCGCGA CGTCACGGAA GACTCAGGGT GCTGTTCACC 
CTCCTGACCG TCCTGGCGAT CCTCACGGCA CTGGTCGGCG GTACCCAGCT GGCCGTGCCG
AGCAAGGCTT CGGCTGCGGG CACGACGCTG TTGTCGCAAG GCAAGTCGGC GACCGCGTCC
TCCACCGAGA ACGCCGGGAC TCCGGCGAGC GCCGCCGTCG ACGGCAACAC CACCACCCGC
TGGTCCAGCG CCTTCAGCGA CCCGCAGTGG ATCCAGGTCG ACCTCGGTGC GAGTGACACG
ATCAGCCAGG TCGTCCTGCA ATGGGAGACC GCCTACGGCA AGTCCTTCCA GATCCAGACC
TCTCCTGATG CCGCCGTGTG GACGAGTATC TACTCGACCA CCACGGGCAC CGGTGGAACC
CAGACCCTGA ACGTCACAGG CACCGGCCGG TACATACGCT TGTACGGCAC CGCCCGTGGA
ACCGCGTGGG GTTACTCGCT CTGGGAGTTC CAGGTCTACG GAACGACCGG TGACACCGGC
GGAACCTGCG GACCGGCTAA CGCCGCGCAG GGCAGGCCGG CCACCGCCTC GTCGACCGAG
AACGCGATGA CGCCGGCGAG CGCCGCCTTC GACGGAGATA CCAGTACCCG CTGGTCGAGC
GCCTTCAGCG ACCCGCAGTG GGTCCAAGTG GACCTGGGCT CGTCCCAGCC GATCTGCCAC
GTCGACCTGA CGTGGGAGGC CGCGTTCGCC TCGGCGTTCC AGATCCAGAC GTCGCAGGAC
GCGGCGACCT GGACGACGGT CTACTCGACC ACCAGCGGCC CCGGCGGCAC CCAGTCGCTC
AACGTGACGG GCACCGGCCG GTACGTCCGC ATGTACGGTA CTGCCAGGGC CACCCCGTAC
GGCTATTCGC TCTGGGAGTT CCAGGTCCTC ACCGGCACCG GCACCCCGCC TCCCCCCAAC
TGCCCGTGGG TCGGCTCCGC GGCTCCGGTC GCGACGCGCG TCAGCCAGGT CATCGCGGCC
ATGAACCAGT CCCAGAAGAT CTCGCTGCTG CACGGCGTCG GCGGCGCGTA CGTCGGGGTC
GTCGCACCGA TCCCGGCCCT GTGCGTCCCC GGCCTCAATC TGCAGGACGG TCCCCAAGGC
GTCGGCGACG GGCTCGGCGG CGTCACCCAG CTGCCCGCGC CCGTAGCGGC GGCGGCGACA
TGGGACACCG CGCTGGAGAA CCAGTACGGG GCGACCGAGG GCACCGAGTT CGCCGGCAAG
GGCGTGAGCG TCGCTCTCGG CCCGACGGTC AACATCGTCC GCGACCCGCG GTTCGGCCGG
GCCTTCGAGA CGTTCAGCGA GGACCCCTAC CTGGCCGGCC AGATAGCCGC CGCCAACATC
CAGGGCATCC AGAGTCAGGG CGTGATGGCC CAGGTCAAGC ACGCGGCCGT CTACAACCAG
GAGACGAACC GCAACAGCCC GGCCGACAAC GCCATCGTCG ACAACCAGAC GCTGCAGGAG
ATCTACCTGC CCGCGTTCAA CGCGGCGATC ACCAAGGGCA ACGCGTCCTC GGTGATGTGC
TCGTACAGCA CGATCAACGG AACCTACGCG TGTGAGAACC CGTATATCCT CAACACCGCG
CTGTACCAGC AGGACGCGTT CACCGGGTTC GTCACCTCGG ACTGGGGCGC CACGCACTCG
ACCGTCGCGT CGGCGAACTC CGGCCTGACG ATGGAGATGC CGGGCAGCGG CTACTTCGGC
ACGGCGCTGT CCTCGGCCGT CACGGCGGGC ACGGTGACCA CGGCCACGCT GAACACCATG
GTCGGCAGGG TCCTGACGAA GATGTTCGAG TTCGGGCTCT TCGACAAGGC GCCCAGCGGC
TCGACCGGCG CGACCGTCAC CACGCCGGCC CACGTGGCCA CCGCGCGGAC GGTCGCCGAG
GAAGGCACCG TCCTGCTCAA GAACGCCAAC AGCCTCCTGC CGCTGTCCAC CTCGACGACG
CACTCGATCG CCGTGCTGGG CAACGACGGC GGCTCGGGCG TGCAGACCAG CGGCGGCGGC
AGCGCCGGCG TCAGCAGCTC GGGCACCGTC TCCCCGCTGA CCGGCATCAC GAACCGGGCG
GGCTCCGGCG TGACCGTCTC CTACGAGGCC GGCACCGACG CCGCCGGCGT CACCCGCGCG
GTCAACCTGG CCAAGGCCTC CGACGTCGCG ATCGTTTTCG CGAACTACGG GGAGTCCGAG
GGCAGCGACA TCTCGAACAT CGACCTCCCG GGCAACCAGA ACACGCTGAT CTCGTCGGTG
GCCGCGGCCA ACCCGAAGAC CATCGTCGTC CTCAACACCG GCTCGGCGGT CACCATGCCG
TGGCTGGGCT CGGTCGCCGG TGTTTTCGAG AACTGGTACG CCGGCCAGGA GGCGGGCAAC
GCCATCGCTG CCCTGTTGTT CGGCGACGCG AACCCGTCCG GTAAGCTGCC GGTCACCTTC
CCGGCGAGCC TGGCCGACGT ACCGGCGCAC ACCACGGCGC AGTGGCCCGG CACGAACAAC
CAGGTCCAGT ACTCCGAAGG CGTCGACGTC GGCTACCGCT GGTACGACTC GCAGAACAAG
ACACCTCTCT TCCCCTTCGG ATACGGGCTC TCCTACACCA GCTTCGGGTT CTCGAACCTG
AGCGTCGGCG CCCTCTCTGG AAACACCAGC ACCGTCACCG CCACGGTCAC CAACACGGGG
ACTACGGCGG GAGCCGAAGT GGCGCAGCTC TACGTGGGCG ATCCCTCGTC CACCGGCGAG
CCGCCCAAAC AGCTGAAGGG CTTCGTCCGG GTGAGCCTCG CCCCCGGCCA GAGCCAGACC
GTGCAGTTCA CCGTGAGCAG CCATGACCTG GCCCACTGGG CGGACTCGGC CGGCGGCTGG
ACGACGACGT CGGGGGCCTA CCAGATCCTC GTCGGCGACT CGTCCAGGAA CCTCCCGCTG
ACCGGCACCA TCACGGTTCC CTGA
 
Protein sequence
MARSSPGTSR RHGRLRVLFT LLTVLAILTA LVGGTQLAVP SKASAAGTTL LSQGKSATAS 
STENAGTPAS AAVDGNTTTR WSSAFSDPQW IQVDLGASDT ISQVVLQWET AYGKSFQIQT
SPDAAVWTSI YSTTTGTGGT QTLNVTGTGR YIRLYGTARG TAWGYSLWEF QVYGTTGDTG
GTCGPANAAQ GRPATASSTE NAMTPASAAF DGDTSTRWSS AFSDPQWVQV DLGSSQPICH
VDLTWEAAFA SAFQIQTSQD AATWTTVYST TSGPGGTQSL NVTGTGRYVR MYGTARATPY
GYSLWEFQVL TGTGTPPPPN CPWVGSAAPV ATRVSQVIAA MNQSQKISLL HGVGGAYVGV
VAPIPALCVP GLNLQDGPQG VGDGLGGVTQ LPAPVAAAAT WDTALENQYG ATEGTEFAGK
GVSVALGPTV NIVRDPRFGR AFETFSEDPY LAGQIAAANI QGIQSQGVMA QVKHAAVYNQ
ETNRNSPADN AIVDNQTLQE IYLPAFNAAI TKGNASSVMC SYSTINGTYA CENPYILNTA
LYQQDAFTGF VTSDWGATHS TVASANSGLT MEMPGSGYFG TALSSAVTAG TVTTATLNTM
VGRVLTKMFE FGLFDKAPSG STGATVTTPA HVATARTVAE EGTVLLKNAN SLLPLSTSTT
HSIAVLGNDG GSGVQTSGGG SAGVSSSGTV SPLTGITNRA GSGVTVSYEA GTDAAGVTRA
VNLAKASDVA IVFANYGESE GSDISNIDLP GNQNTLISSV AAANPKTIVV LNTGSAVTMP
WLGSVAGVFE NWYAGQEAGN AIAALLFGDA NPSGKLPVTF PASLADVPAH TTAQWPGTNN
QVQYSEGVDV GYRWYDSQNK TPLFPFGYGL SYTSFGFSNL SVGALSGNTS TVTATVTNTG
TTAGAEVAQL YVGDPSSTGE PPKQLKGFVR VSLAPGQSQT VQFTVSSHDL AHWADSAGGW
TTTSGAYQIL VGDSSRNLPL TGTITVP