Gene Caci_4906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4906 
Symbol 
ID8336260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5591111 
End bp5592922 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content66% 
IMG OID644958005 
ProductRicin B lectin 
Protein accessionYP_003115607 
Protein GI256394043 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5520] O-Glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.20179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGAG CATTACGGCG CGTCGGCACT GCCGGCGCGC TGGCCGGCAT ACTCGGACTG 
GCTCTGCTGG TGATGCCGGC ACCGCCGGCC TCGGCCGCCA ACGAGACGGT CCACAAGTGG
CTGACGACAT CAGATCTGAG CCAACACCTG ACGCAACAGA CTGATCTCGG CTTCTCCGCG
TCGTCCGGTT CGGGGACCAT CAGCGTCGAC AACACGCAGA AGTTCCAGAG CATCGTGGGC
TTCGGCGCCG CGATGACGGA CAGCTCGGCA TGGCTGCTCT CCGACAAGCT GAGCAGCACG
GCTCACACGA ACCTGATGAA CGCGTTGTTC AGCCCGAGCC AGGGCATCGG GATGAGCTGG
GTGCGGGTTC CGATGGGCTC CTCGGACTTC TCCGCCACGG CACAGCCGTA CTCCTACGAC
GACAACCTGT CGGCGTCCAC GGGCACCACG GTCGGCGTCG GTTCCGGACG CTGCCTGGAC
GACACCGGCA ACACGGCGAA CGGCACGCAG ATCTACATCT GGGACTGCAC CAGCGGCAAC
GCCAACCAGC AGTTCGCCTA CACCAGCGCC TCGGAACTGC AGGTCGCCGG CAAGTGCCTG
GACGCCAACG GCAAGGGCAC CGCCAACGGC ACCAAGGTGA TCCTGTGGAC GTGCAACGGC
CAGGCGAACC AGCAGTGGAA ACTGAACACC AACGGCTCGA TCACCGGCGT GCAGTCCGGA
CTGTGCCTGG ACGTCTCCGG CGCGGCCACG GCCAACGGCT CGCTGATGCA GCTGTGGGCC
TGCAACGGCG CGACGAACCA GCGATGGACC CGGCCCGACC CGGCGCTCGC GAACTTCTCG
ATCGCGCACG ACCTGCAGTA CATCGTCCCG GACCTGAAGG AAGCGCTCGC GCTCAACCCG
GGCCTGAAGC TCATGGCGAA CCCGTGGAGC CCGCCGGGGT GGATGAAGAC GAACGGCCAG
ATGAACAACG TCAACAACGC CGGATCGCTG CTTCCCGCCA GCTACGGACC GCTGGCCCAG
TACTTCGTGA AGTTCCTCCA GGGCTACGCC GCGCAAGGCA TCCCGATCGC CGCGATCACC
CCGCAGAACG AGCCGTCCTA CGCCACCGCC TACCCGGGGA TGCAGTTCAG CGAGCAGAAC
GAAGCGGACT TCATCGCGAA CAACCTCGGG CCCGCCCTGG CCCAGGCGAA CCTCTCCCCG
GCGCTGCTCG GCACCGACTT CAACACCAAC GTGCTCAGCG ACTACGCCGA GCCGCTGATG
CAGAACGCGA ACGCCGCCAA GTACCTGGCG GGGACGTCCT GGCACTGCTA CGCCGGCGGC
CTGAACGCCA TCAGCACCAT GCAGGCGGCG TTCCCGACCA AGGACAACTA CGAGACCGAA
TGCTCTGACG GCATCGACCC GCAGAACGCG ATCGAGACCT TCATCCAGAG CACCCGCAAC
TCGGCGCGGA CCGCCACGAT GTGGAACATC GTCCAGGACC AGAACAACGG CCCGGTGATC
CCCGGCGGCT GCAACGCCTG CACCCCGCTG GTCACCGTCA ACCAGAGCAC CGGGAACGTG
ACCTACGACG CCGGGTACTA CTCGGTCGGC CACTTCAGCA AGTTCGTGCT CCCCGGCGCG
AAGCGCATCG CCTCGACCAC CACCGCGAAC CTCGACAACG TGGCGTTCCA GAATCCGGAC
GGCTCGCTCG TGCTGATCGT CGACAACACC TCCAGCTCGA CGCAGTCCTT CAGCACCAGC
TGGGGCGGCC AGAAGTTCAG CGACTCGCTG CCCGGCCACG GCATCGCGAC GTACGAGTGG
AAGCCGGCGT GA
 
Protein sequence
MPRALRRVGT AGALAGILGL ALLVMPAPPA SAANETVHKW LTTSDLSQHL TQQTDLGFSA 
SSGSGTISVD NTQKFQSIVG FGAAMTDSSA WLLSDKLSST AHTNLMNALF SPSQGIGMSW
VRVPMGSSDF SATAQPYSYD DNLSASTGTT VGVGSGRCLD DTGNTANGTQ IYIWDCTSGN
ANQQFAYTSA SELQVAGKCL DANGKGTANG TKVILWTCNG QANQQWKLNT NGSITGVQSG
LCLDVSGAAT ANGSLMQLWA CNGATNQRWT RPDPALANFS IAHDLQYIVP DLKEALALNP
GLKLMANPWS PPGWMKTNGQ MNNVNNAGSL LPASYGPLAQ YFVKFLQGYA AQGIPIAAIT
PQNEPSYATA YPGMQFSEQN EADFIANNLG PALAQANLSP ALLGTDFNTN VLSDYAEPLM
QNANAAKYLA GTSWHCYAGG LNAISTMQAA FPTKDNYETE CSDGIDPQNA IETFIQSTRN
SARTATMWNI VQDQNNGPVI PGGCNACTPL VTVNQSTGNV TYDAGYYSVG HFSKFVLPGA
KRIASTTTAN LDNVAFQNPD GSLVLIVDNT SSSTQSFSTS WGGQKFSDSL PGHGIATYEW
KPA