Gene Caci_4946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4946 
Symbol 
ID8336300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5646243 
End bp5648012 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content68% 
IMG OID644958045 
ProductCellulase 
Protein accessionYP_003115647 
Protein GI256394083 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0257714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCTCG TTGTCTTGTC CAGAACCCGG TGGCGGTCCC GTACGGTCGT CGCCGGAGTC 
GTCGGCGTGC TCTTCGCCGG TGTCTCCGGG GCGGTGCTCC CGGCGAACGC CGCGACAGCC
GGTCCGGCCG CCGTCGCCGC GCCGAGTGTC GCCGCGCCGA CGAACGCCGC GACGAAGTGC
GTCGCCGCCA TGCAGCCCGG CTTCAACATC GGCAACTCCC TCGACGCGAT CCCCGACGAG
ACCTCGTGGG GCAACCCGCC GATCACCCAG GCGCTGCTCC AGAAGATCAA GTCTCTGGGA
TACAAGAGCG TGCGCCTTCC GGTGACCTGG AGCGGACACG AGGGCGCCGC CCCCGATTAC
CTGATCGACC CGGCCTGGAT GGCCCGCGTC AAGCAGGTGG TCGACTGGGC CCGGGCCGAC
GGCCTGTCCG TGGTGGTCAA TGTTCACCAC GACTCGTGGC AGTGGATCAC GAACATGCCC
ACCGACCCGA CGGTGCAGCC CCACTACGAC GCGATCTGGA CCCAGATCGC GAACGCGCTC
AAGGACGAGC CCCGCTCGGT GGTCTTCGAA GCCGACAACG AGCAGGAGTT CACCGGCGTC
ACCGACGACC AGGGCGAAGC GCTGCTCAAC ACGCTCCAGA CGGACTTCTT CCACATCGTG
CGCGGCTCAG GAGGCGCGAA CGCCACACGC TTCCTGATGC TGTCGACGCT GGGCGACTCC
GCCCAGAAGG CGTCAGAGGA CGCCCTCTCC TCCGAGATCG CCTCGCTGCA CGACCCGAAC
CTGATCGCCT CGTTCCACTA CTACGGCTAC TGGCCGTTCG GGGTGAACAT CGCCGGCGTC
GACACCTTCG ACGCCACCTC GCAGCAGGAC GTCCTGAACG CCTTCACCCT GATGCACGAC
GAGTTCGTGG CCAAGGGCAT ACCGGTCTAC GCCGGTGAGG TCGGTCTCTA CAACGACTTC
AGAGGGTTCG GCGGCCTGGA GACCGGCGAG ATGCTGAAGT ACTACGAGCT GCTGGGCTAC
GAGGCGCGCA CCACCGGCAT CACCCTGAGC TACTGGGACG ACGGCGGCCG CATCCTGGAC
CGCACCAGCC TGCAGCTGAT CGAGCCAACC ACGTTCGCCG CGGCGGCGTC GAGCTGGAAG
ACCCGCTCGG GTACCGCGTC CAACGACACG CTGTACGTGC CCAAGACGAG CCCGATCGCG
GACGAGAGTC TGACGCTCAG CCCGAACGGT CTGCACTTCA CCGGGCTCTA CGACGGGAAC
CGGCGGCTGC AAGAAGGCTG TGACTACACC GTCAGCGGCA CCAAGCTCAC CCTCAAGGCC
GCCCTGCTGA CCAAGCTGGT CGGCGCCCAG AACTACGGGG TGAACGCCAC GCTGTCAGCG
CACTTCTCGG CGGGACTGCC GTGGCAGATC AACGTCGTGA CCAACGCCCA GCCTGTGCTG
TCCGCGGCGA CCGGCACGGC GACCGATCCG CTCGCCGTCC CCACGCAGTT CAACGGCGAC
AGGGTCTTCA TGATGCAGTC CGTCTACGCG GACGGCACCA ACGCCGGTAC CGCCGCCTGG
ACCGCGTACC AGGCCTACGG TCCGCCGACG GTGTCGGGCT CGGCGTTCTC CGGTGACTAC
GCGAACAACG CAATCGTCTT GACCCCGGCC TACTTCGCCG CGCTCACCGA CGGCGCGCGC
GTGACGCTCA CCTTCCACTT CTGGAGCGGC GCCACCGCGA CGTACTACGT GACCAAGTCC
GGCAGCACCG TCACCGGAAC GCTGTCCTGA
 
Protein sequence
MRLVVLSRTR WRSRTVVAGV VGVLFAGVSG AVLPANAATA GPAAVAAPSV AAPTNAATKC 
VAAMQPGFNI GNSLDAIPDE TSWGNPPITQ ALLQKIKSLG YKSVRLPVTW SGHEGAAPDY
LIDPAWMARV KQVVDWARAD GLSVVVNVHH DSWQWITNMP TDPTVQPHYD AIWTQIANAL
KDEPRSVVFE ADNEQEFTGV TDDQGEALLN TLQTDFFHIV RGSGGANATR FLMLSTLGDS
AQKASEDALS SEIASLHDPN LIASFHYYGY WPFGVNIAGV DTFDATSQQD VLNAFTLMHD
EFVAKGIPVY AGEVGLYNDF RGFGGLETGE MLKYYELLGY EARTTGITLS YWDDGGRILD
RTSLQLIEPT TFAAAASSWK TRSGTASNDT LYVPKTSPIA DESLTLSPNG LHFTGLYDGN
RRLQEGCDYT VSGTKLTLKA ALLTKLVGAQ NYGVNATLSA HFSAGLPWQI NVVTNAQPVL
SAATGTATDP LAVPTQFNGD RVFMMQSVYA DGTNAGTAAW TAYQAYGPPT VSGSAFSGDY
ANNAIVLTPA YFAALTDGAR VTLTFHFWSG ATATYYVTKS GSTVTGTLS