Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4099 |
Symbol | |
ID | 8335453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4628762 |
End bp | 4631980 |
Gene Length | 3219 bp |
Protein Length | 1072 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644957202 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_003114804 |
Protein GI | 256393240 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.148886 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0624461 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGAA CGGCAAGACC CCGGCTCGGC CGGGTCCGAG TCCTGATATC GCTGGCCGCG CTGTCCGCGT CCGCGGTCGC GACCTACGCG GCCGTGCCCG GGGCGCAGGC CGCACCGGCC GCGTCCGCGG CGTCGGCGGC GTGTCCGTGG GTGGGATCGA CCGCGCCGAT TCCGCAACGG GTGAGCCAAT TGCTGGCGGC CATGACGCTG AACCAGAAGG TCCAGCTGAT GACTGGATCC AGCGGCTCGA GTTACGTCGG GTTCACCCCG GCGATCGGCT CGCTGTGCAT CCCGGCGATG AACCTGGAGG ACGGTCCCGC CGGTGTCGCC GACGGCATGA CCAACGTCAC CCAGCTGCCC GCGCCGGTGG ACGTCGCAGC CACCTTCGAC ACCTCCGCCG AGCAGAGCTT CGGTCAGCTG ATCGGCGCTG AGGAGGCGGC CAAGGGCACC ACCGTCGACC TCGGTCCGAC CATCAACATC GTGCGCGACC CGCGCTGGGG GCGCGCCTTC GAATCGGTCG GCGAGGATCC CTATCTCAAT GGTCAGATGG GCGCCGCCGA CATCCGCGGC GTGCAGTCCA CCGGCACGAT GGCGCAGGTC AAGCACCTGG TCGCGTACAA CCAGGAGACG AACCGCAACT CCCCGTCGGA CAACGTGATC GCCAGCAACC AGACGCTGGA GGAGATCTAC GACCCGGCCT TCCAGACCTC GGTGCAGAAG GGCGCCGCGT CGTCGGTGAT GTGCTCCTAC AGCACCATCA ACGGCACCTA CGCCTGCCAG AACCCGACGG TCCTGAACAC CGTGTTGCGC AACCAGTTCG GCTTCGGCGG CTTCGTGACC TCCGACTGGG GCGCCACGCA CGCCGGCGCC GCCTCGGTGA ACGCCGGGCT CGACCAGGAC ATGCCGGGTG ACAACACCTA CTACGGCAGC GCCCTGATAT CCGCCGTGAA CTCCGGCCAG GTGTCGCAGG CCACGATCAA CACCGCGGTG TCGCGCATCC TGACCGAGGA GTTCGCCTTC GGGATGTTCG ACAAGCCGCC GACAGGCTCG CCCGGCGCGA CCGCGACCAG CTCGGCGAAC CAGACCGCCG GCGAACACCT CGCCGAGCAG GGCACCGTGC TGCTGAAGAA CTCCGGCAAC GTGCTGCCGT TCGGCTCGGG CGACACCTCC ATCGCCGTCA TCGGCGCGGA CGCCTCGACC AACGTGCAGA GCGCCGGCGG CGGCAGCGCG TCGGTGAACT CCAGCGGCAC GGTCACGCCG TTGCAGGGCA TCACCAGCGC CGCCCCGGCC GGGACCACCG TGTCCTACGA CTCCGGATCC TCGACCAGCT CGGCGGCGGC GCTCGCGGGG AGGTCGAGTG TCGCAGTGGT CTTCGTCAGT ACCAACGAGT CCGAGGGCAG CGACCTGTCG GGCATCGACC TGTCCAGTGC GAACAACTCG CTGATCTCCG CGGTGGCGAA CGCGAACCCC AACACCGTCG TGGTCCTGAA CACCGGCTCG GCGGTCACCA TGCCGTGGCT GTCCTCGGTC AAGGGCGTGC TCGAGGCCTG GTACCCGGGC CAGAGCGACG GCACGGCGAT CGCGAGGATC CTGTACGGCA CCACCAACCC CTCCGGCCAC CTGCCGGTGA CGTTCCCGAC CTCGCTGTCC CAAGTCCCGG CGAGCACGAG CGCGCAGTGG CCGGGGACCA ACGGCCAGGT GCAGTACTCC GAGGGCGTGG ACGTCGGCTA CCGCTGGTAC GACAGCAAAG GCCTGACACC GCTGTTCCCG TTCGGATACG GCCTGTCCTA CACCAGCTTC TCCTACTCGA ATCTGCAGAT CAGCTCCCTG CCACAGGGCG GCGCGGCGAC CGTGACCGCG ACGGTGACCA ACACCGGGTC CCGGGCCGGA GCCGACGTCG CGCAGCTGTA CGTGAGCGAC CCGGCCGCCT CCGGCCAGCC GCCGCGCCAG CTGGAGGGCT TCGCGCGCGT GAACCTGCAG CCGGGTCAGA GCCAGACCGT CTCCTTCCCG CTGACCGAGC AGAACCTGCA CTACTGGAGC ACGAGCACGA ACAACTGGGC CACCAGCACC GGCAACTACG GCGTCGCCGT CGGAGACGCC GACTCTGCCA GTGCCCTGAC ACTGTCCGGC ACGCTCGCCG TCGCGGCGAA CCAGCTCGGC CAGCCGGTCA GCGTCACCAA CCCGGGTCCG CAGGAGGGCG TGGCCGGCGC CGCGGTCTCG GTGCAGGTCA CGGCCGGTGA CACCACGGCC GGTCAGACCG CGGCGTTCAC CGCCGCCGGG CTGCCGGCCG GGCTGGCGAT CTCGTCCTCC GGGAAGATCA CCGGCACGCC GATCACCGCC GGTACCAGCA CCGTCGATGT CACCGCCAAG GACGGCAACG GAGCCACGGC CACCACGTCC TTCGTGTGGA CGGTCGCGGC CTCCTCCGGC GGCGTTCCGA CGACGCCTCT GGTCGGTTAT CAGGGCTTGT GCTTGGACGT GGCCGCGGCG AACAACGCCG ACGGCACCGC CGTGCAGGTC TACACCTGCA ACGGGACCAA CTCCCAGCAG TGGACCGAGG AAGCCGACGG CACGGTGCAC TCGCTCGGCA AGTGCCTGGA CATCGCCGCC GGCGGTACCG CGAACGGCAC CGCCGTGGAT CTGTACACCT GTAACGGAAG CGGCGCGCAG CAGTGGCAGC CGCAGACCAA CGGCACGTTG CGCAACCCGG CGTCCGGCCG GTGCCTCGAC GACACCGGCT CGGGCCTGTC CGGGACGAAG ACCGAGATCT ACGACTGCTC CGGCGCGGCG AACCAAGTGT GGAAGTCCCC GGCGGGCACG TCCACCGGTG GCGGCGGCGG CAGCACCGGT CCGATCACCG GCTACCAGGG CATGTGCGTG GACGTGCGCA GCGCCAACAG CGCCGACGGC ACTCCGGTGC AGGTCTACAC CTGCAACGGC ACCACCGCGC AGCAGTGGAC GGTCGAGTCC AACGGCAGCC TGCAGGCGCT GGGCAAGTGC CTGGACGTGA ACGCCGCCGG TACCGCGAAC GGCAGCCTCG TCCAGCTCTA CACCTGCAAC GGGACCGTCG CGCAGGTCTG GCAGGCGCAG AGCAACGGCG AGCTGGTCAA CCCGCACTCC GGCCGGTGTC TGGACGACAC CGCGTCGGGC GGCTCCGGGA CCCAGCTGCA GATCTGGGAC TGCACCGCCA GTGCGAACCA GAAGTGGCAG CTGCCTTGA
|
Protein sequence | MSRTARPRLG RVRVLISLAA LSASAVATYA AVPGAQAAPA ASAASAACPW VGSTAPIPQR VSQLLAAMTL NQKVQLMTGS SGSSYVGFTP AIGSLCIPAM NLEDGPAGVA DGMTNVTQLP APVDVAATFD TSAEQSFGQL IGAEEAAKGT TVDLGPTINI VRDPRWGRAF ESVGEDPYLN GQMGAADIRG VQSTGTMAQV KHLVAYNQET NRNSPSDNVI ASNQTLEEIY DPAFQTSVQK GAASSVMCSY STINGTYACQ NPTVLNTVLR NQFGFGGFVT SDWGATHAGA ASVNAGLDQD MPGDNTYYGS ALISAVNSGQ VSQATINTAV SRILTEEFAF GMFDKPPTGS PGATATSSAN QTAGEHLAEQ GTVLLKNSGN VLPFGSGDTS IAVIGADAST NVQSAGGGSA SVNSSGTVTP LQGITSAAPA GTTVSYDSGS STSSAAALAG RSSVAVVFVS TNESEGSDLS GIDLSSANNS LISAVANANP NTVVVLNTGS AVTMPWLSSV KGVLEAWYPG QSDGTAIARI LYGTTNPSGH LPVTFPTSLS QVPASTSAQW PGTNGQVQYS EGVDVGYRWY DSKGLTPLFP FGYGLSYTSF SYSNLQISSL PQGGAATVTA TVTNTGSRAG ADVAQLYVSD PAASGQPPRQ LEGFARVNLQ PGQSQTVSFP LTEQNLHYWS TSTNNWATST GNYGVAVGDA DSASALTLSG TLAVAANQLG QPVSVTNPGP QEGVAGAAVS VQVTAGDTTA GQTAAFTAAG LPAGLAISSS GKITGTPITA GTSTVDVTAK DGNGATATTS FVWTVAASSG GVPTTPLVGY QGLCLDVAAA NNADGTAVQV YTCNGTNSQQ WTEEADGTVH SLGKCLDIAA GGTANGTAVD LYTCNGSGAQ QWQPQTNGTL RNPASGRCLD DTGSGLSGTK TEIYDCSGAA NQVWKSPAGT STGGGGGSTG PITGYQGMCV DVRSANSADG TPVQVYTCNG TTAQQWTVES NGSLQALGKC LDVNAAGTAN GSLVQLYTCN GTVAQVWQAQ SNGELVNPHS GRCLDDTASG GSGTQLQIWD CTASANQKWQ LP
|
| |