Gene Caci_4099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4099 
Symbol 
ID8335453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4628762 
End bp4631980 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content70% 
IMG OID644957202 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003114804 
Protein GI256393240 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.148886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0624461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGAA CGGCAAGACC CCGGCTCGGC CGGGTCCGAG TCCTGATATC GCTGGCCGCG 
CTGTCCGCGT CCGCGGTCGC GACCTACGCG GCCGTGCCCG GGGCGCAGGC CGCACCGGCC
GCGTCCGCGG CGTCGGCGGC GTGTCCGTGG GTGGGATCGA CCGCGCCGAT TCCGCAACGG
GTGAGCCAAT TGCTGGCGGC CATGACGCTG AACCAGAAGG TCCAGCTGAT GACTGGATCC
AGCGGCTCGA GTTACGTCGG GTTCACCCCG GCGATCGGCT CGCTGTGCAT CCCGGCGATG
AACCTGGAGG ACGGTCCCGC CGGTGTCGCC GACGGCATGA CCAACGTCAC CCAGCTGCCC
GCGCCGGTGG ACGTCGCAGC CACCTTCGAC ACCTCCGCCG AGCAGAGCTT CGGTCAGCTG
ATCGGCGCTG AGGAGGCGGC CAAGGGCACC ACCGTCGACC TCGGTCCGAC CATCAACATC
GTGCGCGACC CGCGCTGGGG GCGCGCCTTC GAATCGGTCG GCGAGGATCC CTATCTCAAT
GGTCAGATGG GCGCCGCCGA CATCCGCGGC GTGCAGTCCA CCGGCACGAT GGCGCAGGTC
AAGCACCTGG TCGCGTACAA CCAGGAGACG AACCGCAACT CCCCGTCGGA CAACGTGATC
GCCAGCAACC AGACGCTGGA GGAGATCTAC GACCCGGCCT TCCAGACCTC GGTGCAGAAG
GGCGCCGCGT CGTCGGTGAT GTGCTCCTAC AGCACCATCA ACGGCACCTA CGCCTGCCAG
AACCCGACGG TCCTGAACAC CGTGTTGCGC AACCAGTTCG GCTTCGGCGG CTTCGTGACC
TCCGACTGGG GCGCCACGCA CGCCGGCGCC GCCTCGGTGA ACGCCGGGCT CGACCAGGAC
ATGCCGGGTG ACAACACCTA CTACGGCAGC GCCCTGATAT CCGCCGTGAA CTCCGGCCAG
GTGTCGCAGG CCACGATCAA CACCGCGGTG TCGCGCATCC TGACCGAGGA GTTCGCCTTC
GGGATGTTCG ACAAGCCGCC GACAGGCTCG CCCGGCGCGA CCGCGACCAG CTCGGCGAAC
CAGACCGCCG GCGAACACCT CGCCGAGCAG GGCACCGTGC TGCTGAAGAA CTCCGGCAAC
GTGCTGCCGT TCGGCTCGGG CGACACCTCC ATCGCCGTCA TCGGCGCGGA CGCCTCGACC
AACGTGCAGA GCGCCGGCGG CGGCAGCGCG TCGGTGAACT CCAGCGGCAC GGTCACGCCG
TTGCAGGGCA TCACCAGCGC CGCCCCGGCC GGGACCACCG TGTCCTACGA CTCCGGATCC
TCGACCAGCT CGGCGGCGGC GCTCGCGGGG AGGTCGAGTG TCGCAGTGGT CTTCGTCAGT
ACCAACGAGT CCGAGGGCAG CGACCTGTCG GGCATCGACC TGTCCAGTGC GAACAACTCG
CTGATCTCCG CGGTGGCGAA CGCGAACCCC AACACCGTCG TGGTCCTGAA CACCGGCTCG
GCGGTCACCA TGCCGTGGCT GTCCTCGGTC AAGGGCGTGC TCGAGGCCTG GTACCCGGGC
CAGAGCGACG GCACGGCGAT CGCGAGGATC CTGTACGGCA CCACCAACCC CTCCGGCCAC
CTGCCGGTGA CGTTCCCGAC CTCGCTGTCC CAAGTCCCGG CGAGCACGAG CGCGCAGTGG
CCGGGGACCA ACGGCCAGGT GCAGTACTCC GAGGGCGTGG ACGTCGGCTA CCGCTGGTAC
GACAGCAAAG GCCTGACACC GCTGTTCCCG TTCGGATACG GCCTGTCCTA CACCAGCTTC
TCCTACTCGA ATCTGCAGAT CAGCTCCCTG CCACAGGGCG GCGCGGCGAC CGTGACCGCG
ACGGTGACCA ACACCGGGTC CCGGGCCGGA GCCGACGTCG CGCAGCTGTA CGTGAGCGAC
CCGGCCGCCT CCGGCCAGCC GCCGCGCCAG CTGGAGGGCT TCGCGCGCGT GAACCTGCAG
CCGGGTCAGA GCCAGACCGT CTCCTTCCCG CTGACCGAGC AGAACCTGCA CTACTGGAGC
ACGAGCACGA ACAACTGGGC CACCAGCACC GGCAACTACG GCGTCGCCGT CGGAGACGCC
GACTCTGCCA GTGCCCTGAC ACTGTCCGGC ACGCTCGCCG TCGCGGCGAA CCAGCTCGGC
CAGCCGGTCA GCGTCACCAA CCCGGGTCCG CAGGAGGGCG TGGCCGGCGC CGCGGTCTCG
GTGCAGGTCA CGGCCGGTGA CACCACGGCC GGTCAGACCG CGGCGTTCAC CGCCGCCGGG
CTGCCGGCCG GGCTGGCGAT CTCGTCCTCC GGGAAGATCA CCGGCACGCC GATCACCGCC
GGTACCAGCA CCGTCGATGT CACCGCCAAG GACGGCAACG GAGCCACGGC CACCACGTCC
TTCGTGTGGA CGGTCGCGGC CTCCTCCGGC GGCGTTCCGA CGACGCCTCT GGTCGGTTAT
CAGGGCTTGT GCTTGGACGT GGCCGCGGCG AACAACGCCG ACGGCACCGC CGTGCAGGTC
TACACCTGCA ACGGGACCAA CTCCCAGCAG TGGACCGAGG AAGCCGACGG CACGGTGCAC
TCGCTCGGCA AGTGCCTGGA CATCGCCGCC GGCGGTACCG CGAACGGCAC CGCCGTGGAT
CTGTACACCT GTAACGGAAG CGGCGCGCAG CAGTGGCAGC CGCAGACCAA CGGCACGTTG
CGCAACCCGG CGTCCGGCCG GTGCCTCGAC GACACCGGCT CGGGCCTGTC CGGGACGAAG
ACCGAGATCT ACGACTGCTC CGGCGCGGCG AACCAAGTGT GGAAGTCCCC GGCGGGCACG
TCCACCGGTG GCGGCGGCGG CAGCACCGGT CCGATCACCG GCTACCAGGG CATGTGCGTG
GACGTGCGCA GCGCCAACAG CGCCGACGGC ACTCCGGTGC AGGTCTACAC CTGCAACGGC
ACCACCGCGC AGCAGTGGAC GGTCGAGTCC AACGGCAGCC TGCAGGCGCT GGGCAAGTGC
CTGGACGTGA ACGCCGCCGG TACCGCGAAC GGCAGCCTCG TCCAGCTCTA CACCTGCAAC
GGGACCGTCG CGCAGGTCTG GCAGGCGCAG AGCAACGGCG AGCTGGTCAA CCCGCACTCC
GGCCGGTGTC TGGACGACAC CGCGTCGGGC GGCTCCGGGA CCCAGCTGCA GATCTGGGAC
TGCACCGCCA GTGCGAACCA GAAGTGGCAG CTGCCTTGA
 
Protein sequence
MSRTARPRLG RVRVLISLAA LSASAVATYA AVPGAQAAPA ASAASAACPW VGSTAPIPQR 
VSQLLAAMTL NQKVQLMTGS SGSSYVGFTP AIGSLCIPAM NLEDGPAGVA DGMTNVTQLP
APVDVAATFD TSAEQSFGQL IGAEEAAKGT TVDLGPTINI VRDPRWGRAF ESVGEDPYLN
GQMGAADIRG VQSTGTMAQV KHLVAYNQET NRNSPSDNVI ASNQTLEEIY DPAFQTSVQK
GAASSVMCSY STINGTYACQ NPTVLNTVLR NQFGFGGFVT SDWGATHAGA ASVNAGLDQD
MPGDNTYYGS ALISAVNSGQ VSQATINTAV SRILTEEFAF GMFDKPPTGS PGATATSSAN
QTAGEHLAEQ GTVLLKNSGN VLPFGSGDTS IAVIGADAST NVQSAGGGSA SVNSSGTVTP
LQGITSAAPA GTTVSYDSGS STSSAAALAG RSSVAVVFVS TNESEGSDLS GIDLSSANNS
LISAVANANP NTVVVLNTGS AVTMPWLSSV KGVLEAWYPG QSDGTAIARI LYGTTNPSGH
LPVTFPTSLS QVPASTSAQW PGTNGQVQYS EGVDVGYRWY DSKGLTPLFP FGYGLSYTSF
SYSNLQISSL PQGGAATVTA TVTNTGSRAG ADVAQLYVSD PAASGQPPRQ LEGFARVNLQ
PGQSQTVSFP LTEQNLHYWS TSTNNWATST GNYGVAVGDA DSASALTLSG TLAVAANQLG
QPVSVTNPGP QEGVAGAAVS VQVTAGDTTA GQTAAFTAAG LPAGLAISSS GKITGTPITA
GTSTVDVTAK DGNGATATTS FVWTVAASSG GVPTTPLVGY QGLCLDVAAA NNADGTAVQV
YTCNGTNSQQ WTEEADGTVH SLGKCLDIAA GGTANGTAVD LYTCNGSGAQ QWQPQTNGTL
RNPASGRCLD DTGSGLSGTK TEIYDCSGAA NQVWKSPAGT STGGGGGSTG PITGYQGMCV
DVRSANSADG TPVQVYTCNG TTAQQWTVES NGSLQALGKC LDVNAAGTAN GSLVQLYTCN
GTVAQVWQAQ SNGELVNPHS GRCLDDTASG GSGTQLQIWD CTASANQKWQ LP