Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4906 |
Symbol | |
ID | 8336260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5591111 |
End bp | 5592922 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644958005 |
Product | Ricin B lectin |
Protein accession | YP_003115607 |
Protein GI | 256394043 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5520] O-Glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.20179 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACGAG CATTACGGCG CGTCGGCACT GCCGGCGCGC TGGCCGGCAT ACTCGGACTG GCTCTGCTGG TGATGCCGGC ACCGCCGGCC TCGGCCGCCA ACGAGACGGT CCACAAGTGG CTGACGACAT CAGATCTGAG CCAACACCTG ACGCAACAGA CTGATCTCGG CTTCTCCGCG TCGTCCGGTT CGGGGACCAT CAGCGTCGAC AACACGCAGA AGTTCCAGAG CATCGTGGGC TTCGGCGCCG CGATGACGGA CAGCTCGGCA TGGCTGCTCT CCGACAAGCT GAGCAGCACG GCTCACACGA ACCTGATGAA CGCGTTGTTC AGCCCGAGCC AGGGCATCGG GATGAGCTGG GTGCGGGTTC CGATGGGCTC CTCGGACTTC TCCGCCACGG CACAGCCGTA CTCCTACGAC GACAACCTGT CGGCGTCCAC GGGCACCACG GTCGGCGTCG GTTCCGGACG CTGCCTGGAC GACACCGGCA ACACGGCGAA CGGCACGCAG ATCTACATCT GGGACTGCAC CAGCGGCAAC GCCAACCAGC AGTTCGCCTA CACCAGCGCC TCGGAACTGC AGGTCGCCGG CAAGTGCCTG GACGCCAACG GCAAGGGCAC CGCCAACGGC ACCAAGGTGA TCCTGTGGAC GTGCAACGGC CAGGCGAACC AGCAGTGGAA ACTGAACACC AACGGCTCGA TCACCGGCGT GCAGTCCGGA CTGTGCCTGG ACGTCTCCGG CGCGGCCACG GCCAACGGCT CGCTGATGCA GCTGTGGGCC TGCAACGGCG CGACGAACCA GCGATGGACC CGGCCCGACC CGGCGCTCGC GAACTTCTCG ATCGCGCACG ACCTGCAGTA CATCGTCCCG GACCTGAAGG AAGCGCTCGC GCTCAACCCG GGCCTGAAGC TCATGGCGAA CCCGTGGAGC CCGCCGGGGT GGATGAAGAC GAACGGCCAG ATGAACAACG TCAACAACGC CGGATCGCTG CTTCCCGCCA GCTACGGACC GCTGGCCCAG TACTTCGTGA AGTTCCTCCA GGGCTACGCC GCGCAAGGCA TCCCGATCGC CGCGATCACC CCGCAGAACG AGCCGTCCTA CGCCACCGCC TACCCGGGGA TGCAGTTCAG CGAGCAGAAC GAAGCGGACT TCATCGCGAA CAACCTCGGG CCCGCCCTGG CCCAGGCGAA CCTCTCCCCG GCGCTGCTCG GCACCGACTT CAACACCAAC GTGCTCAGCG ACTACGCCGA GCCGCTGATG CAGAACGCGA ACGCCGCCAA GTACCTGGCG GGGACGTCCT GGCACTGCTA CGCCGGCGGC CTGAACGCCA TCAGCACCAT GCAGGCGGCG TTCCCGACCA AGGACAACTA CGAGACCGAA TGCTCTGACG GCATCGACCC GCAGAACGCG ATCGAGACCT TCATCCAGAG CACCCGCAAC TCGGCGCGGA CCGCCACGAT GTGGAACATC GTCCAGGACC AGAACAACGG CCCGGTGATC CCCGGCGGCT GCAACGCCTG CACCCCGCTG GTCACCGTCA ACCAGAGCAC CGGGAACGTG ACCTACGACG CCGGGTACTA CTCGGTCGGC CACTTCAGCA AGTTCGTGCT CCCCGGCGCG AAGCGCATCG CCTCGACCAC CACCGCGAAC CTCGACAACG TGGCGTTCCA GAATCCGGAC GGCTCGCTCG TGCTGATCGT CGACAACACC TCCAGCTCGA CGCAGTCCTT CAGCACCAGC TGGGGCGGCC AGAAGTTCAG CGACTCGCTG CCCGGCCACG GCATCGCGAC GTACGAGTGG AAGCCGGCGT GA
|
Protein sequence | MPRALRRVGT AGALAGILGL ALLVMPAPPA SAANETVHKW LTTSDLSQHL TQQTDLGFSA SSGSGTISVD NTQKFQSIVG FGAAMTDSSA WLLSDKLSST AHTNLMNALF SPSQGIGMSW VRVPMGSSDF SATAQPYSYD DNLSASTGTT VGVGSGRCLD DTGNTANGTQ IYIWDCTSGN ANQQFAYTSA SELQVAGKCL DANGKGTANG TKVILWTCNG QANQQWKLNT NGSITGVQSG LCLDVSGAAT ANGSLMQLWA CNGATNQRWT RPDPALANFS IAHDLQYIVP DLKEALALNP GLKLMANPWS PPGWMKTNGQ MNNVNNAGSL LPASYGPLAQ YFVKFLQGYA AQGIPIAAIT PQNEPSYATA YPGMQFSEQN EADFIANNLG PALAQANLSP ALLGTDFNTN VLSDYAEPLM QNANAAKYLA GTSWHCYAGG LNAISTMQAA FPTKDNYETE CSDGIDPQNA IETFIQSTRN SARTATMWNI VQDQNNGPVI PGGCNACTPL VTVNQSTGNV TYDAGYYSVG HFSKFVLPGA KRIASTTTAN LDNVAFQNPD GSLVLIVDNT SSSTQSFSTS WGGQKFSDSL PGHGIATYEW KPA
|
| |