Gene Caci_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0444 
Symbol 
ID8331771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp498312 
End bp500072 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content68% 
IMG OID644953610 
ProductRicin B lectin 
Protein accessionYP_003111237 
Protein GI256389673 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.828213 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAT CCCCGCGGCG CCTCATCACG GCGCTCGCAG CCCTCTGTCT GGGCATCGGC 
GCGGCCCTCG GCATGATCGC CGCGCCGGCT CAGGCGGCGG CGTCGGCCAC CAGCGAGCAG
ACCTTCCTGA CCTTCTACGG CTGGTGGGAC AACACGCCGC CCGGCGCCGA CATCGCCTAT
CCGCAGATCC ACCAGACCGC GGGCGGCACC GGGACGTACG CCGATCCGAT CACCTTCGCC
ACCGACTCCA ACGAGCAGCC GCCGGGAACG ATCGTCTACG TCCCGCGCGT CGGCAAGTAC
TTCATCATGG AGGACGGCTG CGACGAGTGC AGCTCGGACT GGACCGGCCA CGGTCCCAAC
GGCGGTCCCA ACCTGCGGCA CCTGGACCTG TGGCTCGGCG GGAAGGGCGG CAACGCCTTC
GACGCCATCG AGTGCGAGGA CGCGCTGACC AACTACAACA GCGACGGCAC GCCGACCATG
GAGCCGGTGA TCGTCAATCC GCCGTCGAAC GAGACGGTGT CCTCCTCGCC GATCTTCAAC
ACCAGCACCG GCGCGTGTTA CGGCGGTGCG AAGCCGACGA TCTCCGTCGG GCAGTACAAG
AACGTCTCGA CCGGCAACTG CATGACCGAC CCGAACAACA GCTCCTCGGC CGGCGCGCTG
CTGGTGACGG CCGCCTGTGA CAGCACCGCG GCCAGCCAGC GCTTCACCTT CGACGGCACG
TTCCTGCAGA TCAACAACCT CTGCGCGGAC TACTCGACCT CGCAGATCTC GATGCAGAAA
TGTACCGCCG GACCCAGCCA ACAGTGGTCG TACAACACCG ACCTGACGTT CACCGACATC
CAGACCGGCA AGAAGTACAT CAACGACTCC TCGGGCAAGG TCAAGTCGGG CAGCAGCTCC
AGCAGCACGA AGACCTGGAC CTACGTCCCG GCCGGCTCCG GCACGACGAA CGACTTCTCC
GTGGCGGCCA GCCCGGCGAG CGCCTCCGTC ACCGCCGGCG GCACCGCGAC CGCGACGGTC
TCCACCGCCG TGACCGCCGG TGCCGCTGAG TCCGTCGCGC TGAGCGCCAG CGGCGGCCCG
GCCGGCTCCA CGGTCAGCCT GAGCCCGACC AACGTCACCT CCGGCGGCAG CTCGACCCTG
AGCGTCGCGA CCACCTCCAC GACCGCCCCC GGGACGTACA CCATCACGGT CACCGGTAAG
GCTGCGACCG GTACCCACAC CGCCACCTAC ACGCTGACGG TGAACCCCGT CTCCGGAGGG
GGCGGCTGCA CCGCGGCCCA GCTGCTGACC AACCCCGGCT TCGAGAGCGG CGCCAGCACC
GGCTGGACCG GCAGCTCCAC CCTGGGCTTC AACCCGATCA CCAACAGCAC CAGCGGCGAG
CCGACGCACG CGGGCTCCTG GGAGTCCTGG TTCAACGGCA ACGGCTCGGC CGACACGGAC
ACCGTCGCGC AGTCGGTGAC CATCCCGTCG GGCTGCACCG CGACCCTGTC CTACTGGCTG
CACATCGACA CGACCGAGAG CACGTCGACG GCCAAGCCGG ACACCTTCAG CGTGCAGCTG
CTCAACTCCT CGGGCACCGT GCTCACCACG CTGGCCACCT ACAGCAATCT GGACAAGGCC
AGCGGCTACA CCCAGCACAG CAGCGACGTG TCGGCCTACG CGGGTCAGAC CGTCAAGCTC
CGCTTCACCG GCACCGAGAC CGACAAGAAC GGCGGCACCA CCAGCTTCGT CCTCGACGAC
ACGGCGTTGA ACGCGAAGTA G
 
Protein sequence
MKISPRRLIT ALAALCLGIG AALGMIAAPA QAAASATSEQ TFLTFYGWWD NTPPGADIAY 
PQIHQTAGGT GTYADPITFA TDSNEQPPGT IVYVPRVGKY FIMEDGCDEC SSDWTGHGPN
GGPNLRHLDL WLGGKGGNAF DAIECEDALT NYNSDGTPTM EPVIVNPPSN ETVSSSPIFN
TSTGACYGGA KPTISVGQYK NVSTGNCMTD PNNSSSAGAL LVTAACDSTA ASQRFTFDGT
FLQINNLCAD YSTSQISMQK CTAGPSQQWS YNTDLTFTDI QTGKKYINDS SGKVKSGSSS
SSTKTWTYVP AGSGTTNDFS VAASPASASV TAGGTATATV STAVTAGAAE SVALSASGGP
AGSTVSLSPT NVTSGGSSTL SVATTSTTAP GTYTITVTGK AATGTHTATY TLTVNPVSGG
GGCTAAQLLT NPGFESGAST GWTGSSTLGF NPITNSTSGE PTHAGSWESW FNGNGSADTD
TVAQSVTIPS GCTATLSYWL HIDTTESTST AKPDTFSVQL LNSSGTVLTT LATYSNLDKA
SGYTQHSSDV SAYAGQTVKL RFTGTETDKN GGTTSFVLDD TALNAK