Gene Caci_4880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4880 
Symbol 
ID8336234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5553632 
End bp5555308 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content68% 
IMG OID644957979 
ProductRicin B lectin 
Protein accessionYP_003115581 
Protein GI256394017 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0253313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGGG CTGCGATCCT TGCCGTGGTC TGCGCGCTGG CCTCGCCTGT CGGGCTGGTC 
GGCTCGCGCG TGCAAGCCGC CGTGGCGGAC TCCCCGGCGC GTGCGGCGCC TGCGGCGGTG
GCGGCGGCCT CGACGGTGAG CGCGCCGCCG ATTGGCTGGG CGTCGTGGAA CACTTTCGCC
GCGCAGATCA ACTACAACGT CATCAAGGGG CAGGCCGACG CGTTGGCGTC CTCCGGCATG
GAAGCCGCCG GCTATCAGTA CGTGAACATC GACGAGGGCT GGTGGCAGGG TACCCGCGAC
GCCTCCGGAA ACATCACGGT GGACTCGGCA GACTGGCCCG GCGGGATGAA GGCCATCGCC
GACTACATCC ACAGCAAGGG GTTGAAGGCC GGGATCTACA CCGACGCCGG GAAGAACGGC
TGCGGCTACT ACTACCCGAC CGGCCGTCCT GCGGCTCCGG GAAGCGGCAG CGAAGGTCAT
TACGATCAGG ACTTCCTGCA GTTCTCCCAG TGGGGCTTCG ACTACGTCAA GGTCGACTGG
TGCGGCGGCA ACGCCGAGGG CTTGAACGCG CAGAACACCT ACCAGGCGAT CAGCGACGCG
ATCGGCCGCG CCACGGCGCA GACCGGCCGT CCGATGGTGC TGTCGATCTG TGATTGGGGC
AATCAGAGCC CGTGGAACTG GGCGCCTGGC ATGTCCGCGC TGTGGCGCAC CAGCGGCGAC
ATCATCTACT ACGGCCAAGC CCCCTCGATG ACCAACGTGC TGGCCAACTT CGACGCCGCG
CAGCATCCGG CCGCCCAAAG CCCCGGCCAC TACAACGATC CGGACATGCT GATCGCCGGC
ATGCCCGGAT TCACCGCGGC GCAGAACCGC ACCCACCTGA GCCTGTGGGC CATCTCCGGC
GCGCCCCTGT TGGCCGGCAA CAACTTGTCG ACTATGAGCA GCGACACCCG TGCCGTCCTG
ACCAACCCCG AAGCGATAGC CATCGACCAG GACTCCCGCG GCCAGCAGGG CGTGAAGGTG
GCCGAAGCCC AAAGCGGGCT GCAGGTGTAC AGCAAGGTCC TGTCCGGCAG CGGCCGCCGC
GCGGTGGTCC TACTCAACCG CACCGGCTCG ACGGCGACCA TCACCGCCCC ATGGTCGGCC
TTGGGACTGA CCGGCGCCGC CTCGGTCCGC GACGTGTGGG CCGCCGTCGA CCGCGGCAGC
TTCACCGGAA GTTACGCCGC CACCGTCCCC GCCGGCCAGG CCGTCCTGCT CACCGTGACC
GGCACCGACG GTACCGGCGG CGGCACCGGC AGCGCCAAGC AGATCATCGG CACCCCATCC
GGACGCTGCG TCGACATCAA CAACTCCTCC ACCACCAACG GCACCCAGGC CCAACTGTGG
GACTGCAACG GCCAAAGCAA CCAGCAGTGG ACCCCCACCG CCACCAAACA GCTGATGATC
TACGGCACCA AGTGCCTGGA CGCCTCCAAC CAGGGCACCA CCAACGGCAC CCCAGCCGTC
ATCTGGGACT GCAACGGCCA AACCAACCAG CAATGGACCA TCAACGCCAA CGGCACCATC
ACCGGCGTCC AATCAGGCCT CTGCCTCGAC GCCTCCGGCG CGGCCACCGC CAACGGCACC
AAGCTGCTGC TGTGGACATG CAGCGGCGCC GCCAACCAGA AGTGGACTGT GAAGTAG
 
Protein sequence
MLRAAILAVV CALASPVGLV GSRVQAAVAD SPARAAPAAV AAASTVSAPP IGWASWNTFA 
AQINYNVIKG QADALASSGM EAAGYQYVNI DEGWWQGTRD ASGNITVDSA DWPGGMKAIA
DYIHSKGLKA GIYTDAGKNG CGYYYPTGRP AAPGSGSEGH YDQDFLQFSQ WGFDYVKVDW
CGGNAEGLNA QNTYQAISDA IGRATAQTGR PMVLSICDWG NQSPWNWAPG MSALWRTSGD
IIYYGQAPSM TNVLANFDAA QHPAAQSPGH YNDPDMLIAG MPGFTAAQNR THLSLWAISG
APLLAGNNLS TMSSDTRAVL TNPEAIAIDQ DSRGQQGVKV AEAQSGLQVY SKVLSGSGRR
AVVLLNRTGS TATITAPWSA LGLTGAASVR DVWAAVDRGS FTGSYAATVP AGQAVLLTVT
GTDGTGGGTG SAKQIIGTPS GRCVDINNSS TTNGTQAQLW DCNGQSNQQW TPTATKQLMI
YGTKCLDASN QGTTNGTPAV IWDCNGQTNQ QWTINANGTI TGVQSGLCLD ASGAATANGT
KLLLWTCSGA ANQKWTVK