Gene Caci_5166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5166 
Symbol 
ID8336520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5934689 
End bp5936617 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content70% 
IMG OID644958264 
ProductRicin B lectin 
Protein accessionYP_003115866 
Protein GI256394302 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.400775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0878066 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGGCACG CGTCCCAGTC CGGCGGTCTC CGCCGCCGAA CGTCGATCGT CGCGGCCGCG 
GTGGTGCTGT CCGCCGCGGC CGCGATCCCC ACGGCCTTGG CCGCGCAGGC TCCGTCGGCG
GCTGCCGCGG TCGCCAGCCC GACCGCGAGC GCGGTCATCA GCGTCTCGAG CGCCGGTACC
GGTACGGCGC TGACGAATGC CGACGTCGGT CTGTCCTACG AAGCGTCCTT CCTCGCCTTG
CCGGGCTTCG GGCAAGGTAA CCAGTTCCAG TACCTCAAGA CCCTGGGAAC CTCGGTCGTG
CGTATCGGCG GCAACCAGGT CGACCGGAGC TTCTGGACGT CCACCAATGA GGCGCAGCCC
TCGTGGGCGG ACGCCACGAT CACCCCCGCC GATCTGACCG CGCTGGCGAA CCTGGCCACG
AAGAGCGGCT GGAAGGTGAT CCTGGGTGTG ACCATGAAGG AGTACGACCC GGCCCGCGCC
GCCGACGAGG CCAAGCACGC CGCGGCCGCG CTCGGCTCGT CGTTGCAGGA CATCGAGATC
GGCAACGAGC CGGACCTGTA TCCGCAGTAC AGCGGCAACT CCGCGCAGTA CGTCACCGAC
TTCCACGCCT ACGTGCGGGC GATCACCGCC GCGGCGCCCG GGGTGAAGAT CGAGGGCAGC
GACGCCGCGA CGTCGCCGAC CGGCGCGCTG CAGACGGCGT TCGTCAACGA CCAGGCGTCG
ATGGCGAGCC CGCAAATCAG CGAGCTGACG AGCCACCATT ACCCGCTGTC GAACTGTGGC
AGTCCGAACC CGGCGCCGAG CATCGCCGAC CTGCTGTCCT CCTCGACGCA CGCGAAGGAG
ACCTCGGCGG CCGACAGCGC GGTGACGGTC GCGAAGCGGC TGGCGCTGCC GGCGGTCATC
GACGAGGGCA ACTCGGTGGT GTGCAGCGGG ATTCAGGGCG TGTCGGACGT CTACGCCTCG
GCGCTGTGGG CGGTGGACGA GGAGCTGAAC TTCGCGCAGG AGGGTGCGGC CGGGTACTAC
ATGCACGGCA CTGTGACGCA GTGCTACGGC ACGACGAGCT ACCCGTACTA CACGCCGCTG
TGCGCGGCGA CCGCGGCGGA TGCCGCTGCC GGCAACGTGT CCGCGCAGCC GGAGTACTAC
GGGCTGGCAG CGGTGCACGC CGCGGGCACG GGCAACTTCC TGCAGGTCAA CAACCCGTCT
TCGGCGACGG TGCGCGCCTA CGCGATCCAG CATGCTGACA AGTCGGTCAC GGTGGTGCTG
GACAACGTGG CGGATCCGGC GTCCAACGGC GCGACGAGCG TGCAGCTGAA CCTGCCGCAG
ACGTTCGGAT CGGCGTCGCG GTTCGACCTG TCGGCGTCCA GCCTGACCAC CAGGAGCGGG
ATCACGCTCG GCGGCAAGAC GGTGCAGTCC GACGGGACGC TGCCCGCGCC GGCGACGACG
TCGTCCTCGG TCGGCTCGAA CACCTTCTCC GCCTCCGTCC CGGCCGGCGA CACGGCGCTG
ATCACCTTCT CCGCACCGTC CGGCGCGAGC CCGACCACGC TGGTCGGCAG CCCGTCCGGG
AAGTGCCTGT CGGTCACCGG CGGTTCCACC GCGCCCGGGA CGACGTCGGA CATCTACACG
TGCAACGCCA GTGCCGGCGA GAGCTGGCTG GTCAACGCCA ACGGGACGGT CACCGGCGCG
TTCAGCGGTC TGTGTCTGGA GGTGCAGGGA AGCGCGACGG CGGACAAGTC GTATGTGGGT
GTGAACACGT GCACCGGCGC GGCGAACCAG CAGTGGATGG TGCACTCGGC GCCGGGTGCG
GCCGGGACGA TCGTCGGGGC GCAGTCCGAG AAGTGCCTGA GCGTGTTCGG GGCGTCGACG
GCCAACTATG CGCTCGCTGA GATCTACACG TGCAACGGGA GCTCGAGTGA GAACTGGAGC
GAGCACTAG
 
Protein sequence
MRHASQSGGL RRRTSIVAAA VVLSAAAAIP TALAAQAPSA AAAVASPTAS AVISVSSAGT 
GTALTNADVG LSYEASFLAL PGFGQGNQFQ YLKTLGTSVV RIGGNQVDRS FWTSTNEAQP
SWADATITPA DLTALANLAT KSGWKVILGV TMKEYDPARA ADEAKHAAAA LGSSLQDIEI
GNEPDLYPQY SGNSAQYVTD FHAYVRAITA AAPGVKIEGS DAATSPTGAL QTAFVNDQAS
MASPQISELT SHHYPLSNCG SPNPAPSIAD LLSSSTHAKE TSAADSAVTV AKRLALPAVI
DEGNSVVCSG IQGVSDVYAS ALWAVDEELN FAQEGAAGYY MHGTVTQCYG TTSYPYYTPL
CAATAADAAA GNVSAQPEYY GLAAVHAAGT GNFLQVNNPS SATVRAYAIQ HADKSVTVVL
DNVADPASNG ATSVQLNLPQ TFGSASRFDL SASSLTTRSG ITLGGKTVQS DGTLPAPATT
SSSVGSNTFS ASVPAGDTAL ITFSAPSGAS PTTLVGSPSG KCLSVTGGST APGTTSDIYT
CNASAGESWL VNANGTVTGA FSGLCLEVQG SATADKSYVG VNTCTGAANQ QWMVHSAPGA
AGTIVGAQSE KCLSVFGAST ANYALAEIYT CNGSSSENWS EH