Gene Caci_6252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6252 
Symbol 
ID8337615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7190368 
End bp7192581 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content67% 
IMG OID644959353 
ProductRicin B lectin 
Protein accessionYP_003116947 
Protein GI256395383 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.189274 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACCG CACCACTGGT ACGTCCCGGT CCACGCCGCG TCTTCGCCGT CTTCTTGGCG 
GCGCTGCTCG CCCTGTTCTG TGCCAACGGC AGCCAGCCGA AGGCGTACGC GCAGAGCAAC
GGCGCGGCGC TCACGCCGCT CATGGGCTGG AGCAGCTGGT CCTTCCTGCG CAGCGCGCCG
ACCGAGGCGA AGATGAAGGC CCAAGCCCAA TCCATGTCCA GCTCCGGGCT GGTCGCCGCG
GGCTACAAGT ACGTCAACCT CGACGACTTC TACTACCTGA ACCCCGGGAC CACCGTCGAC
TCCTACGGCC GCTGGGTCAT CGACACCGGC AAGTTCCCCG ACGGCATGGC GAATCTGGGC
TCGTACATCC ACTCCCTCGG TGAGAAGTTC GGGATGTACC TGACCCCGGG CATCCCGGTC
GCGGCGTACA AGCAGAACAC GCCGATCCAG GGGACCTCGT TCCACGCCCA GGACATCGTG
TCCAACACCA GCAGCTACGC GACGAACTAC AACTTCGGCA ACGGCGCGAT GTACAACATC
GACTACGCCA AGAACCCCGC CGCCGCGCAG GCGTTCCTGA ACTCCTGGGC CAACGAGCTC
GCCGGCTACG GCATCGACTA CCTGAAGGTC GACGGCATCA GCCCGAACGA CGGCGACGCC
CAGGGCGTGG CCGACACCCA GCACTGGTCC CAGGCGCTCA ACCAGACCGG CCGCACCATC
CACCTGGAGC TGTCCAACTC GCTGACGCCC GCGGACGCCG CCTCCTGGCA GCAGTACTCC
AACGGCTGGC GCATCGACGG GGACGTCGAG TGCTACTGCG GCTCGAACTC CTCCTTCCCG
CTGACCGACT GGAACAACGT CAGCCAGCGT TTCACCGACG TGCAGCCGTG GATCGGCGTC
GGCGGCACCG GCGGCTGGAA CGACCTGGAC TCGGTGGAGA TCGGCAACGG CTCCAACGAC
GGGCTCACCC TCGACGAGCG GAAGACGCAG CTGACGCTGT GGGCGATCGA GAACTCCAAC
CTCACCCTCG GCGTCGACAT GACGCACCTG GACTCCACCG ACGTCGGGCT GCTGACCAAC
AGCGAAGTGC TCGCCGTGGA CCAGGCCGGA CACCCGGCGC GCGCGGTCGA CCGCACGACG
CCGCTGCAGA CCTGGTACGC CGCGAACGGC GACGGCAGCT ACACCGTGGC TCTGTTCAAC
CTGTCGGGCT CGGCTGCCAC GGTCACGGCG AACTGGAAGG ACATCGGGTT CACCGGCTCC
TCGGCGACGG TGCACGACGA CTGGTCGCAC AGCAACCTGG GCACGTTCGC GACGAGCTAC
GGCGTTTCAC TGCCTGCGCA CGGCACGACG CTGCTCAAGG TGGTCCCGAC CGGCGCGGCG
AGCTACAACG CGATTTCCTA CAACATAGTC AACGCGAACA GCAGCATGAA CCTGGCCACC
TCGGGCTCCT CGATCGTGCA GCAGAGCCCT GACAACGCCC TCGACCAGGA GTGGCAGCTG
GTCCCGGTCG GCGACGGCAG CTACAAGGTC CTCAACCGCA CCAGCCACCA GCAGCTGGCC
GTTCCCTCAA CCACTCAGGG TGCACAGCTC ACGCAGAAGG CCAACGACAA CGCCGCCGAC
TCGCAATGGC GCTTCGTCCC GACCGGCAGC GGCTCGTACA CGCTGAAGTC CTCCTCCGAC
GGACAGGTCG CCGACGTGTC CGGCGCCTCG ACGAGCGCCG GGGCGTCGGT GATCCAGTGG
CCCGCCAACA ACGGCGCGAA CCAGAAGTGG ACGCTGGTCC CGGTCCCGGA CGCGAACCAG
GGCTACCGGG TCGAGAACCT GCTGACCGGC GGCCGGCTGG ACGTGAACGG CGACTCCACC
GCCGACAGCG CGACACTGGT GCAGTGGTCT GACAACGGCC AGGCCGACCA GCGCTGGACC
TTCGCCAAGC AGAGCGGCGG CGCGTACACG ATCGTCAACG CCAACAGCGG CAAGCTGGTC
AACATCCCGG GCCCGACCAC CGCGACGGCG ACGCAGCTGA TCCAGTTCTC CGACGACGGG
AACAGCAACT CCCGCTGGAC GCTGGTCGAC GAAGGACCGG GCGTCGTCGG GCTGAGGAGC
GTCTACGACG GACAGATGAT CGACGTCTCC AACGGCAGCA CCAACACCGG TACGGCGGTG
ATTCAGTTCA CTGCGAACGG CGGGCAGAAC CAGGACTGGA CACTGGTGCC GTGA
 
Protein sequence
MHTAPLVRPG PRRVFAVFLA ALLALFCANG SQPKAYAQSN GAALTPLMGW SSWSFLRSAP 
TEAKMKAQAQ SMSSSGLVAA GYKYVNLDDF YYLNPGTTVD SYGRWVIDTG KFPDGMANLG
SYIHSLGEKF GMYLTPGIPV AAYKQNTPIQ GTSFHAQDIV SNTSSYATNY NFGNGAMYNI
DYAKNPAAAQ AFLNSWANEL AGYGIDYLKV DGISPNDGDA QGVADTQHWS QALNQTGRTI
HLELSNSLTP ADAASWQQYS NGWRIDGDVE CYCGSNSSFP LTDWNNVSQR FTDVQPWIGV
GGTGGWNDLD SVEIGNGSND GLTLDERKTQ LTLWAIENSN LTLGVDMTHL DSTDVGLLTN
SEVLAVDQAG HPARAVDRTT PLQTWYAANG DGSYTVALFN LSGSAATVTA NWKDIGFTGS
SATVHDDWSH SNLGTFATSY GVSLPAHGTT LLKVVPTGAA SYNAISYNIV NANSSMNLAT
SGSSIVQQSP DNALDQEWQL VPVGDGSYKV LNRTSHQQLA VPSTTQGAQL TQKANDNAAD
SQWRFVPTGS GSYTLKSSSD GQVADVSGAS TSAGASVIQW PANNGANQKW TLVPVPDANQ
GYRVENLLTG GRLDVNGDST ADSATLVQWS DNGQADQRWT FAKQSGGAYT IVNANSGKLV
NIPGPTTATA TQLIQFSDDG NSNSRWTLVD EGPGVVGLRS VYDGQMIDVS NGSTNTGTAV
IQFTANGGQN QDWTLVP