Gene Caci_4911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4911 
Symbol 
ID8336265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5599639 
End bp5601057 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content68% 
IMG OID644958010 
ProductRicin B lectin 
Protein accessionYP_003115612 
Protein GI256394048 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0412] Dienelactone hydrolase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.036155 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGAA GCTTGAATCC ATTCCCGAAA TCCCGTTCCG AAGACGTTCG GCGGCAGCGC 
GCCCGACCCG GACGGCGTCC GCGATCCGGC GGGGTCGCCC TGGCCGTCCT CGCACTGATC
ACCGCGTTGT TCGGCGCGGC GACCCTGTCC CCGGCGTCGG CGTCGGCGTC GGCGTCCGCA
TCGGCGCCGG CGGCGTCCCG GGTCCAGGCC GCCGCCTCGG GGAACACCTA CCAGCGCGGT
CCGGATCCGA CCCTGTCCAG CGTGGCGGCC TCCACCGGGC CGTTCGCGAC CGCGCAGGTC
TCCGTGCCCG CGGGCTACGG CTTCAAGGGC GGGATGATCT ACTACCCGAC CGACACCAGC
CTGGGGACCT GGGGCGCGGT CGCCATCGTG CCCGGCTACA CCGCGCTGTT CGCGAACGAG
GAAGCCTGGA TGGGGCCCTG GCTGGCCTCC TTCGGGTTCG TGGTGATCGG CGTGGAGACC
AACAGCACCA CCGACTACGA CACGCAGCGC GGGACAGAGC TGCTGGCGGC GCTGAACTAT
CTCACCACGC AGAGCCCGGT GCGCGACCGG GTGGATCCGA CCCGGCTGGG CGTGATCGGG
CACTCGATGG GCGGCGGCGG AGTCGTCTAC GCCACCGAGC ACCAGCCCTC GCTCAAGGGC
GCCGTGGCGC TGGCGCCGTT CTCCCCGTCG CAGAGCATGG CCACGGACAC CGTGCCCACC
ATGGTCATGG GCGGCCAGAA CGACACCGTG GTCACACCGT CCTACCTCGC CGGCCTGTAT
GCGACGCTGC CCGCCTCGAC GCAGAGCGAC TTCATCCAGA TCGCCGGAGC CGATCACATC
TACTACACCC ACCCCAACCC GGTGGAGATG AGGATCCTGA TTCCCTGGCT CAAGACGTTC
CTGGACGAGG ACACCCGCTA CACCCAGTTC CTGTGCCCGA CCCTCGCCGA CCCGAGCGGG
GTGTCGATGT ACCAGAGCAA GTGCCCGTAT GTGCCCGGTG GCGGCTCTAC TCCTCCTCCG
CCGGCCGGTG GTGCGCTGCA CGCTGTCGGT GCAGGTAAGT GTGTGGATGT GCCGAACTCG
ACCACCACCA GTGGGACGCA GGTGCAGATC TACTCCTGCA ATGGCCAGGC CAACCAGGCC
TTCACCCACA ACTCCGCCGG TGAGCTAGCC GTCACCGACG CCGGAGTCAC CGACTGCCTG
GACGCCAACG GCAAGGGAAC CACCAACGGC ACCAAGGTCA TCATCTATCC CTGCAACGGC
CAGCCCAACC AGCAATGGAC GATCAACTCC AACGGCACCA TCACCGGAGT GCAGTCAGGA
CTCTGCCTCG ACGTCACCGG CGCATCCACC GCCAACGGCG CCCTAGTGGA GCTGTGGACC
TGCAACGGCG GCAGCAACCA GAAATGGACT CTGAGCTGA
 
Protein sequence
MRRSLNPFPK SRSEDVRRQR ARPGRRPRSG GVALAVLALI TALFGAATLS PASASASASA 
SAPAASRVQA AASGNTYQRG PDPTLSSVAA STGPFATAQV SVPAGYGFKG GMIYYPTDTS
LGTWGAVAIV PGYTALFANE EAWMGPWLAS FGFVVIGVET NSTTDYDTQR GTELLAALNY
LTTQSPVRDR VDPTRLGVIG HSMGGGGVVY ATEHQPSLKG AVALAPFSPS QSMATDTVPT
MVMGGQNDTV VTPSYLAGLY ATLPASTQSD FIQIAGADHI YYTHPNPVEM RILIPWLKTF
LDEDTRYTQF LCPTLADPSG VSMYQSKCPY VPGGGSTPPP PAGGALHAVG AGKCVDVPNS
TTTSGTQVQI YSCNGQANQA FTHNSAGELA VTDAGVTDCL DANGKGTTNG TKVIIYPCNG
QPNQQWTINS NGTITGVQSG LCLDVTGAST ANGALVELWT CNGGSNQKWT LS