Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0444 |
Symbol | |
ID | 8331771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 498312 |
End bp | 500072 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644953610 |
Product | Ricin B lectin |
Protein accession | YP_003111237 |
Protein GI | 256389673 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.828213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATAT CCCCGCGGCG CCTCATCACG GCGCTCGCAG CCCTCTGTCT GGGCATCGGC GCGGCCCTCG GCATGATCGC CGCGCCGGCT CAGGCGGCGG CGTCGGCCAC CAGCGAGCAG ACCTTCCTGA CCTTCTACGG CTGGTGGGAC AACACGCCGC CCGGCGCCGA CATCGCCTAT CCGCAGATCC ACCAGACCGC GGGCGGCACC GGGACGTACG CCGATCCGAT CACCTTCGCC ACCGACTCCA ACGAGCAGCC GCCGGGAACG ATCGTCTACG TCCCGCGCGT CGGCAAGTAC TTCATCATGG AGGACGGCTG CGACGAGTGC AGCTCGGACT GGACCGGCCA CGGTCCCAAC GGCGGTCCCA ACCTGCGGCA CCTGGACCTG TGGCTCGGCG GGAAGGGCGG CAACGCCTTC GACGCCATCG AGTGCGAGGA CGCGCTGACC AACTACAACA GCGACGGCAC GCCGACCATG GAGCCGGTGA TCGTCAATCC GCCGTCGAAC GAGACGGTGT CCTCCTCGCC GATCTTCAAC ACCAGCACCG GCGCGTGTTA CGGCGGTGCG AAGCCGACGA TCTCCGTCGG GCAGTACAAG AACGTCTCGA CCGGCAACTG CATGACCGAC CCGAACAACA GCTCCTCGGC CGGCGCGCTG CTGGTGACGG CCGCCTGTGA CAGCACCGCG GCCAGCCAGC GCTTCACCTT CGACGGCACG TTCCTGCAGA TCAACAACCT CTGCGCGGAC TACTCGACCT CGCAGATCTC GATGCAGAAA TGTACCGCCG GACCCAGCCA ACAGTGGTCG TACAACACCG ACCTGACGTT CACCGACATC CAGACCGGCA AGAAGTACAT CAACGACTCC TCGGGCAAGG TCAAGTCGGG CAGCAGCTCC AGCAGCACGA AGACCTGGAC CTACGTCCCG GCCGGCTCCG GCACGACGAA CGACTTCTCC GTGGCGGCCA GCCCGGCGAG CGCCTCCGTC ACCGCCGGCG GCACCGCGAC CGCGACGGTC TCCACCGCCG TGACCGCCGG TGCCGCTGAG TCCGTCGCGC TGAGCGCCAG CGGCGGCCCG GCCGGCTCCA CGGTCAGCCT GAGCCCGACC AACGTCACCT CCGGCGGCAG CTCGACCCTG AGCGTCGCGA CCACCTCCAC GACCGCCCCC GGGACGTACA CCATCACGGT CACCGGTAAG GCTGCGACCG GTACCCACAC CGCCACCTAC ACGCTGACGG TGAACCCCGT CTCCGGAGGG GGCGGCTGCA CCGCGGCCCA GCTGCTGACC AACCCCGGCT TCGAGAGCGG CGCCAGCACC GGCTGGACCG GCAGCTCCAC CCTGGGCTTC AACCCGATCA CCAACAGCAC CAGCGGCGAG CCGACGCACG CGGGCTCCTG GGAGTCCTGG TTCAACGGCA ACGGCTCGGC CGACACGGAC ACCGTCGCGC AGTCGGTGAC CATCCCGTCG GGCTGCACCG CGACCCTGTC CTACTGGCTG CACATCGACA CGACCGAGAG CACGTCGACG GCCAAGCCGG ACACCTTCAG CGTGCAGCTG CTCAACTCCT CGGGCACCGT GCTCACCACG CTGGCCACCT ACAGCAATCT GGACAAGGCC AGCGGCTACA CCCAGCACAG CAGCGACGTG TCGGCCTACG CGGGTCAGAC CGTCAAGCTC CGCTTCACCG GCACCGAGAC CGACAAGAAC GGCGGCACCA CCAGCTTCGT CCTCGACGAC ACGGCGTTGA ACGCGAAGTA G
|
Protein sequence | MKISPRRLIT ALAALCLGIG AALGMIAAPA QAAASATSEQ TFLTFYGWWD NTPPGADIAY PQIHQTAGGT GTYADPITFA TDSNEQPPGT IVYVPRVGKY FIMEDGCDEC SSDWTGHGPN GGPNLRHLDL WLGGKGGNAF DAIECEDALT NYNSDGTPTM EPVIVNPPSN ETVSSSPIFN TSTGACYGGA KPTISVGQYK NVSTGNCMTD PNNSSSAGAL LVTAACDSTA ASQRFTFDGT FLQINNLCAD YSTSQISMQK CTAGPSQQWS YNTDLTFTDI QTGKKYINDS SGKVKSGSSS SSTKTWTYVP AGSGTTNDFS VAASPASASV TAGGTATATV STAVTAGAAE SVALSASGGP AGSTVSLSPT NVTSGGSSTL SVATTSTTAP GTYTITVTGK AATGTHTATY TLTVNPVSGG GGCTAAQLLT NPGFESGAST GWTGSSTLGF NPITNSTSGE PTHAGSWESW FNGNGSADTD TVAQSVTIPS GCTATLSYWL HIDTTESTST AKPDTFSVQL LNSSGTVLTT LATYSNLDKA SGYTQHSSDV SAYAGQTVKL RFTGTETDKN GGTTSFVLDD TALNAK
|
| |