Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4930 |
Symbol | |
ID | 8336284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5623519 |
End bp | 5625924 |
Gene Length | 2406 bp |
Protein Length | 801 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644958029 |
Product | Ricin B lectin |
Protein accession | YP_003115631 |
Protein GI | 256394067 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.199916 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.917817 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCCCAG GAATCCGCCG ACGCGGCGGC CGCTCCCGCC CCGCGCTCCT GGCGGCCGCC GTCACGCTCG GCGCCGCCGC CTTCGGCATG CCGGCGATCT CGCACGCCGC GACCGCCGCG ACGCTGTACG TCTCCCCCAC GGGCAGTGGA ACCGCCTGCT CGGCCGGCGC GCCGTGCTCC ATCACCCAGG CGCAGTCCTC GGTCCGCTCG ATGAACACGT CCATGAGCGG CGACATCGTC GTGCAGCTGG CCGGCGGCAC CTACCGCCTG TCCGCGCCGT TGGTCTTCAC CACTGCCGAC TCCGGAAGCA ACGGACACAA CGTCATCTGG CAGGCCGCGT CCGGCGCGGC GCCGGTACTG TCCGGCGGGC AGCAGGTGAG CGGCTGGACC CTGCACGACT CGGCGAACAA CATCTGGTCC GCGCCGGTGC CCTCCGGCGC CGACTCCCGG CAGCTGTGGG TGGACAACAC CCTCGCGCCC CGGGCCGCGG TCCCGATCTC GCGCAGCGAC GTGCAGATCA CCGCCGGCGG CATGACCATC GTGAACTCGA ACCTGAACTA CCTGGCCTCG CTGCCGGAGC AGAACCGGAT CGAGCTGGAG AGCCAGAACT CCTTCACCGA CCGGTACGCG CCGGTGTCGA GCATCAGCGG CACCACGATC ACGATGCAGC AGCCCGCCTG GAACAACAAC AACTGGGGAT ACGACACCCT CGCCAAGCCC TTCGCCGGCG GCCAGGTGTA CTTGGAGAAC TCGTACTCGT TCCTCAAGAA CGCCGGGCAG TGGTTCATAG ATCCGCAGGC CGGCCAGCTG TACTACAAAG CCGCCTCCGG CCAGTCGCCC AGCAGCCACG ACGTCGAGCT CCCCCGCCTG ACCTCGCTGA TCCAGATGAG CGGCAGCTAC AGCGCTCCGG TCTCGCACAT CACGTTCCAG GGCCTGGCCT TCGAGCACAC CACCTGGCTC ACGCCGGGCA GCTCGATCGG CTACGCCGAC CAGCAGACCG GCACGTTCCT GGCCAAGCAG TACTCCCAGC CGTCCAACTT CCTGACCTCC TGCCAGTCCG GGTGCCAGCT GTTCGAGGCC ACCCGGAACA GCTGGAACCA GGCACCGGCG GCGGTCCAGG TCTCGGCCGC GGCGAACATC AGCTTCACCG GGGACACCTT CACCCACCTG GGCCAGGTCG CGCTCGGCAT CGGCAACGAC GCGGACGCGG TCGCCTCGGG CGTCGGGCTC GGAGCCTCCG GCGTCACCGT CGACCACAAC ACGTTCACGG ACGACTCCGG CGCCGGGATC GCGGTCGGCG GCGTGCAGCC GGACGCGCAC CACCCGTCCA ACCCGGCGAT GACCGTGCAG AACGTCACGA TCACCAACAA CCTGGTGTCC AACGTCGCCG AGGACTACAA GGACATGCCC GGCATCCTGT CGACGTACGC CACGCACACC GACATCGAGC ACAACGAGGT GTCGAACCTG GCCTACGACG GCATCGACGT GGGCTGGGGC TGGGGCATGA ACGACCCCGG CGGCAGCCAG GACTACGTCA ACCGCGGCAC GTACAACTAC CAGCCGATCT ACACCACCCC GACCACGCTG AAGAACACCG TCGTCAGCTA CAACAAGGTG CACGGCACCA AGAAGGTCTT CCACGACGGC GGCAGCATCT ACAACCTGTC GGCCAACCCC GGCGGCGTGA TCGACGACAA CTACGTCTAC GACAACCAGA ACACCGTCGG CCTGTACCTG GACGAGGGCT CCCGGTACCT GACGCTGACG AACAACGTCG TACAGGACGC CGGCGTCTGG GCGTTCACCA ACGCCAGCTC CACCAACAAC ACCAACGACA GCACCTTCTC CTACAACTGG TACAACTCAG GCGCGACGAA CGTCGCCACC GGCTCCCCGC ACAACAACGT GCTGACCGGC AACACGCAGG TGAGCGGAGG CTGGCCGACG GCAGCCCAGC AGGTCATCAA CAACTCCGGC GTATCCGGCA GCACCACTCC TCCGCCACCG AGCGGAGGCG CGCTGCACGC GGTCGGTGCA GGCAAGTGCG TGGACGTGCC CAACTCGACC ACCACCAGTG GCACGCAGGT GCAGATCTAC TCCTGCAACG GCCAGGCCAA CCAGGCCTTC ACCCACAACT CCACCAGCGA GCTCACCGTC ACCGACGCCG GAGTGACCGA CTGCCTGGAT GCCAACAACA AGGGAACCAC CAACGGCACC AAGGTGATCA TCTATCCCTG CAACGGCCAG CCCAACCAGC AATGGACGAT CAACTCCAAC GGCACCATCA CCGGCGTCCA ATCAGGACTC TGCCTCGACG TCACCGGCGC ATCCACCGCC AACGGCGCCC TGGTGGAGCT GTGGACCTGC AACGGCGGCA GCAACCAGCA ATGGACCCTG AGCTGA
|
Protein sequence | MIPGIRRRGG RSRPALLAAA VTLGAAAFGM PAISHAATAA TLYVSPTGSG TACSAGAPCS ITQAQSSVRS MNTSMSGDIV VQLAGGTYRL SAPLVFTTAD SGSNGHNVIW QAASGAAPVL SGGQQVSGWT LHDSANNIWS APVPSGADSR QLWVDNTLAP RAAVPISRSD VQITAGGMTI VNSNLNYLAS LPEQNRIELE SQNSFTDRYA PVSSISGTTI TMQQPAWNNN NWGYDTLAKP FAGGQVYLEN SYSFLKNAGQ WFIDPQAGQL YYKAASGQSP SSHDVELPRL TSLIQMSGSY SAPVSHITFQ GLAFEHTTWL TPGSSIGYAD QQTGTFLAKQ YSQPSNFLTS CQSGCQLFEA TRNSWNQAPA AVQVSAAANI SFTGDTFTHL GQVALGIGND ADAVASGVGL GASGVTVDHN TFTDDSGAGI AVGGVQPDAH HPSNPAMTVQ NVTITNNLVS NVAEDYKDMP GILSTYATHT DIEHNEVSNL AYDGIDVGWG WGMNDPGGSQ DYVNRGTYNY QPIYTTPTTL KNTVVSYNKV HGTKKVFHDG GSIYNLSANP GGVIDDNYVY DNQNTVGLYL DEGSRYLTLT NNVVQDAGVW AFTNASSTNN TNDSTFSYNW YNSGATNVAT GSPHNNVLTG NTQVSGGWPT AAQQVINNSG VSGSTTPPPP SGGALHAVGA GKCVDVPNST TTSGTQVQIY SCNGQANQAF THNSTSELTV TDAGVTDCLD ANNKGTTNGT KVIIYPCNGQ PNQQWTINSN GTITGVQSGL CLDVTGASTA NGALVELWTC NGGSNQQWTL S
|
| |