Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3165 |
Symbol | |
ID | 8334518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 3481703 |
End bp | 3487798 |
Gene Length | 6096 bp |
Protein Length | 2031 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644956311 |
Product | Ricin B lectin |
Protein accession | YP_003113914 |
Protein GI | 256392350 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00144153 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCGGGAGC ACCGAAACAG TCGTATTCGA CGACTGAGAA ATGCAATTGT CCTGGTTGCG GCGCCGGCGC TCGCGATCAC CGCGGCTCCG ATCGCGGCGG CTGGCGCCGC CGCTGCTGCG AGGTCGGACC AGCCCGCCGC GGCGGCTCCC GCCGCCGTCC AGGGCCCGAG CGCCGCTGCT GCCGCCAAAC CGGCGCCGGC CGCTCCCGAC CCGCAGGCCG CCTTGCGCAA GGCGATGGCG GTCGCCCACG CCAAAGCGGA GCAGACCAAG AAACAGGTCG TCGTCGAGGC CGCGAACACC GCGCACGACA CGATGTACGC CAACCCGGAC GGCTCCTTCA ACCTCACCCA GGACACAGAC GCCGTCCGTA CCGATGTCAC CGGTTCGTGG GGGCCGATCG ACACGACCCT GGTCCGGGCC GCGGACGGCT CGGTGCACGC CAAGGCCGCG GTCCTGGGGA TCACGTTCTC CGGCGGCGGC ACCGGCGCGC TGGCCCGGCT GGACGACGGC GGCCACAGGG CCACGCTGAC CTGGCCCGCC GCACTCCCCA AACCCGTTCT GTCCGGCAGT TCCGCCCTGT ACCCGAACGT CTATCCGGAC ACAGACCTGC GTGTCACGGC AGACGCTGAA GGCATCCGCG AAGTCCTCAT CGTCAAGACC GCGGCGGCAG CCGCCAATCC AGCTCTGTCC TCGATCGCCC TGGGCACGAC GACCAGCCCG GACCTCAAAC TCGGCACGAA CGTCAACGGC GCGATCGTCG CCACCGACGC CGCCGGCACG GCCGTGTTCG GCACCGGCAC GGCGTTGATG TGGGATTCGA GCACCAGCAC CACGACTGCC CTCGCCCGAT CTGCGGCACC GGCTCAACAA AGCGCTGACG GCCCCACCGC GAGCAGCGCG GACGGTCCCG GCAGCGCAGC GCAGACAGCC ACGATGCCCG AGTCGCTGAC GGGGTCGACC ATCAAGGTGC AGCCCGTCAA GGATCTTCTG ACAGGCAAGA ACACCAAGTT CCCGGTCTAC ATCGACCCCA CCGCGTACGC GCCATTGCAG AACTGGGCTG AGATCCACTC GGGTAATCCG GGTACCAGCT ACTTCGACGC GGAGGGCGGA TCTCAGACAG AGGTGATGCG CGCCGGATGG TACGGCGCCT CCGGCATCGT CCGCTCGATG TTCGAGTACA ACGTCGACAC TTCGCTCATC CCGGCCGGCG TCGATCCCTC GACGATGAGC GCGCAGTTCA CTGTGACCAG CCAATACTGC TCTGGCAATC CGCACTTGAT GGACGTGGTC GCCATCGACC GGTTCTTCCC CGGCGTCACA TGGCAAACAG CTCTGCATGA TGCCAACGGC AACCCTTTGG GCGCCAACGC CTACGCCGGC TACACCTCGC CGTGGAACGG GCAGGACGAC ACCGTGATCA GCGGCGGCAA CGGGGTGTAC CCGGTCGACT CACACGGTGC GACGCAGTGT TCCGGTCTGG GCGGTATCGG CTACAAGCTC CAGTTCAACG ACAGCAACCA GGTCCAGAAC GTGTACGAGC ACGGCGGCGC CACCGTCGAC GTCGGCCTGA AGTTCCATGA TGAGAGGGAC CAGACCGGTC TGTGGGCCTT CTTCCGCCAC GACTGGGAGG ACGGCTCCCC CGAGTACGAC CCTCAGAACC CCGGCAGCGC CCAGTTGAGC ATCACCTTCT GGGAGTCGCC ACGGGTGGTC GGTACCTCGA TCACTCCGAC CCCGACGGCC GGCGGTCCGT GCGACACGAC CGGCAACGCC CACGGCTACG TTCGGAAGTC CATCGGAAAC ACGATCACGG TCAACACCGA GATCGAGGAC ATCGATCCCA GCATTCCTCT GCAACCGCTG TTCTTCCTCA ACGATATGAC CAATTTCGCG AACGGAACCG GGAGCATCAT AGGACCCGCG GTGAACAGCG CGGACAAGGA CCATCCCGGC TCCGACGGTT GGTACCACGC CACCCAGAGC GCGACATACA GCCAGCAAAC CGCCGCGCTC CACCAAGACC CCAGCCTGAT GCTGAACGAC GGCGATACCT ACACCGCCGG CCTCGGCGCC ATCGAAGACC TCAGCCACCC ACGCTCCTCC AACTCCGAGC CGGGTCTGAA CAACTCGAAC GACGAGTGCT TCTTCACCTC GGCCCTCACG TCGCCCGACA AGCCCGAAAT CTCCTCGACG GCCTTTCCGA TCATCGGGCA GCCTGCGGTC TCCGTCGCCG GCGCCGGCGG CAGCTTCACC ATCTCGGGCA AGACCAGCGG AGTCCCCATC GACCACTACG ACTGGGCCCT GAACACCTCC GCCTCGATGG TGGGCAACGC GAACAACGGC GGTGGCACGG TGACCGGGGT CGGATACACC TCCGGAACCA GCACCCTCAC TCTGCCCGTC GGCCGCACCA CCTTCGGTGA GAACACCCTG TGGATCCGAG CAGTAGACGT CGCAGGCAAC TACTCATCCG TGTCGCAGTA CGACTTCTAC CTACCAGGCA ACCCCAAGGC CACGACCACA CTCGGCGACG TCACTGGACG CGGCGTACCC GACCTGGTCC TGGTGACACC CGACGCCAAC GGGAACGCCC ACCTGGTCGT CCAACCCGGG AACTCCGACC CCGCGATTCC GCCCACGTCA CTGCCGGCCG GCCTTCAACA GGCGGCCATC GAGGCCGCTC CGGCGTCGGC CGCACCCGAC GGCCATGACT GGAGCAACAC TCTGATCAGC CACCGCGGAG CCGCGCGCGG CGTCCCGGTG GACGATCTGT ATGCGTACTC TCTCACGACA CACGCGCTGT ACTACTACTT GAACTCCGCC GTGTTCGGCG CCGCTGTGCC GAGCGATCAG TTCTCCCGCT CCCACCAGGT CGTCGTCACC CGGCCAACCT GCGATCCCGC AGTCGACAGC AGGAACAAGT GCGCGAGCTA TGACAACGCC GACTGGTCGA AGGTCAAACA AATCCTGGCC TTCGGCTCGG TCACCGGCCA GAAGGCGGGC ACCTTCGCCG GCCGCACCAA CCTCATCACC GTTGAGGACG ACGGCAACGG CGGGACCAAC TTGTGGCTGT TCCAACCGGC TGGTGGCGAC CAGGTCACCA CTCCCAAGCT GATCGGCTCC TCCAACTTGA CCGGCGGCTA TTCCGCACTG CCCGGCTGGA ACTGGGCCAA CGTCGACCTG ATAGCGCCGG GAACCATCCC CGGTGACGCG TCCGGCGGGA CCCTGCCCGA CCTGTGGGCC CGTGACCGCA CCACCGGGAC GTTGTGGCAG TTCGACAACA AGACCTCGAA CGGCGTCGAA GACCCGACCA GTCTGGGCAA CCTGAACGCC GCCCACGCCG TCGGCCGGGT AGGGACCGCC ACCACCCAGG GTTCGTACCC GACCAGCAGC TACCTGACGC TGATCGGGGC GGGCAGCCCG ACCGTGAGCA CCGACGGCCA CGGCACCCTG ACTGAAGGCG GCCCGGGTGC CTACCCCGCT CTCTGGGGGA CCGGTCAGAA CGGCAAGCTC ACCCTGCTGC CGGGTTCTGC CGGCGGTCCG ATCACCACCT CCGACGGACC GGCGACGGCC ACTTTGGCCG ACGGCGCCTC CCGCAACGCG TGGACCCCCG TCAAGCAGGT CACGACCCTC GACGGCCACG ACCCGACGAC CTCGACCGGC CCGATCCAGA TCGGCTGGAC CCAGGGCACG AACAACGGCG GCGGCCCGCT GTGCCTGGAC CTGCCCGGCG GCAACGCCGC GAACGGCTCA CCGCTCCAGC AATACCGGTG TCTGAACAAC GCCAACCAGA CTTGGACCTT CACTGACGAC GGATCCATCC GCTGGACGGC GAACCAGCAC AAGTGCCTGA CCATCGGCTC GACCTGGGCC AACAACGGCG GCACCGCCAA CACGTCCCCG TTGCAGATCA GCGACTGCGT GACCGTGACG AACCCCGCCG ATCCCCAGCT CGGCGCCATC AGCAGCCTTC AACGGTTCGA AGTGCGGCAG AGCCCGGGCA TCACGGGCTG GTCTCAGCTG TACAACCCGG CATCCGGACG TTGTCTCGAC AACGGGGCGA CCACCAACTC CGGCACCGAC CCGTGGCTGT GGGATTGCGG CAACGGTCTT CAGCAGGCTT GGCTGGTTCC CGAAGCAGCC GGGTCCACCC AGCGGGCCGA AGCCGAGCTG CTCTACAACA TCAGTCGGAC CGGAACCGGC GGGCTCGGAC CCCAGGCGGA TTGCTGCGGT GTGAGCTGGT CCGGCGGTGC CCAGGAGATG TTCAACAACA CCCTTCCGAG CGCAGTTTTG ACGCTGCCGT GGTACGTCCC GTACAAGGGC ACCTACCGCG TCGTCCCGAC CATGACCTCA GCCACCGACT ACGGAAAGGT GACACTCACC GTCGATGCTG GGGCACCGAA TCAGCAGACC TTGGCTCGGA CGTACGATGC GTACAACCCC GCCGTCGCCG TCAATCCCGT CGACTTCGGC ACAATCGACC TGGCCGCCGG CGGTTTGCAC ACCTTCACCT TCACGCTGAA CGGCACGAAC GCAGCCAGCA CCGGCAACCG GTACAACATC GGCGTCGACA CCCTGCAGCT GGTTCCCACC ACGTCCACCG CCCCGGTCGC GTCCGAGACC GTGACCCCTG TGTCCAGCGT GGGGCAACCG ATCACCATCG ACGACTCCGC GACGAACCCC GGAGCAGCCT CGATCACCGC CTACGCCATC GATTTCGGCG ACGGCACCAG CGCCACATCG CCGACCCCGA ATGTCGCGAC CCACGCGTAC TCGACGCCAG GCACATACCC AGTCAAGTTG ACGGTTACCG ACGACAACGG CGCCAGCGCG TCAACCACCA AACAGGTCAT CGTCCTCAGC AGTGCGCCGG TCGCCAACGG CGACTTCGAG AGGGGGGATC TGTCTGGCTG GAGCGCGTCC TACAACTCTG CGGTCACTAC CGATAGCCCT CACAGCGGTA CCTACGCCGG GCAGATCAAC GCGCCTGCCG GGGGCAACGG GTCCGTCGAG CAAGTTGTCA GTGGGCTGAA GCCGAACACC TCCTACACGC TCACCGGCTG GGTCCGCACC GACGGCGGTG CGACCATCTT GGGCACCAAG GAGTACGACG CCGCCGATGA CGACACCGGT GCCACGACCG CAGCCACCGG CTGGACCCAA CTCAGCAACC AATTCACGAC TGGAGCAACC AACACCAGCG TCGACGTCTA CTGCTACCGG CCCACCGCAG GCGCCTCCGC CTGCGACGAC TTCACCCTGC TGGCCACCCC CGCGGCAGGC GCCGTGGGCA ACCCCGACTT CGAGACCGGG AACCTGGCCG GTTGGAACGA GTCGTACAAC GCCGGGGTCA CCACCACCAA CCCGCACGGC GGCACCTACG CCGGACAGAT CAACGCACCC ACCGGCGGCA ACGGATCCAT CGAACAAGTC GTCACCGGCC TGACACCCAA CACCTCCTAC ACCCTCACCG GCTGGATCCG CACCGACGGC GGCGCCACCA TCCTGGGTAC CAAGGACTAC GACGCCGACC CCGGCGACGA CACCGGCGCC ACCACCACCA ACACCGGCTG GACCCAACTC ACCAGCCAAT TCACCACCGG CACGAACAGC ACCAGCGTCG ACATCTACTG CTACCGATCC ACTGCGGGCA CCTCCGCCTG CGACGACTTC GCCTTGACGC AAACCCCGGC AACCGTGGCC AACCCCGACT TCGAAACAGG AACCCTGGAC GGCTGGGCCG CCTCCTATAA CGCCGACATC ACCACCACCA ACCCCCGCGC CGGCACCTAC GCCGGACAAC TCAACGCCGC CACCGGCGAC AACGCGTCCA TCGAACAGGT CGTCACCGGC CTGACGCCCA ACACCGCCTA CACCCTCACC GGATGGGTCC GCACCGACGG AAACACCACC TACCTGGGAG CCAAGCAGTA CGACACCGCT GGCAGCGTCA CCGACGCCAA CACCACAGCC ACCGGCTGGA CCCAACTCAC CGACCAGTTC ACCACCGACG CCACCGGAAC CAGCGTCGAC ATCTACTGCT ACCGAGACAC AGCAGGCACC TCAGCCTGCG ACGACATCAA CCTCACCAAG GACTAG
|
Protein sequence | MREHRNSRIR RLRNAIVLVA APALAITAAP IAAAGAAAAA RSDQPAAAAP AAVQGPSAAA AAKPAPAAPD PQAALRKAMA VAHAKAEQTK KQVVVEAANT AHDTMYANPD GSFNLTQDTD AVRTDVTGSW GPIDTTLVRA ADGSVHAKAA VLGITFSGGG TGALARLDDG GHRATLTWPA ALPKPVLSGS SALYPNVYPD TDLRVTADAE GIREVLIVKT AAAAANPALS SIALGTTTSP DLKLGTNVNG AIVATDAAGT AVFGTGTALM WDSSTSTTTA LARSAAPAQQ SADGPTASSA DGPGSAAQTA TMPESLTGST IKVQPVKDLL TGKNTKFPVY IDPTAYAPLQ NWAEIHSGNP GTSYFDAEGG SQTEVMRAGW YGASGIVRSM FEYNVDTSLI PAGVDPSTMS AQFTVTSQYC SGNPHLMDVV AIDRFFPGVT WQTALHDANG NPLGANAYAG YTSPWNGQDD TVISGGNGVY PVDSHGATQC SGLGGIGYKL QFNDSNQVQN VYEHGGATVD VGLKFHDERD QTGLWAFFRH DWEDGSPEYD PQNPGSAQLS ITFWESPRVV GTSITPTPTA GGPCDTTGNA HGYVRKSIGN TITVNTEIED IDPSIPLQPL FFLNDMTNFA NGTGSIIGPA VNSADKDHPG SDGWYHATQS ATYSQQTAAL HQDPSLMLND GDTYTAGLGA IEDLSHPRSS NSEPGLNNSN DECFFTSALT SPDKPEISST AFPIIGQPAV SVAGAGGSFT ISGKTSGVPI DHYDWALNTS ASMVGNANNG GGTVTGVGYT SGTSTLTLPV GRTTFGENTL WIRAVDVAGN YSSVSQYDFY LPGNPKATTT LGDVTGRGVP DLVLVTPDAN GNAHLVVQPG NSDPAIPPTS LPAGLQQAAI EAAPASAAPD GHDWSNTLIS HRGAARGVPV DDLYAYSLTT HALYYYLNSA VFGAAVPSDQ FSRSHQVVVT RPTCDPAVDS RNKCASYDNA DWSKVKQILA FGSVTGQKAG TFAGRTNLIT VEDDGNGGTN LWLFQPAGGD QVTTPKLIGS SNLTGGYSAL PGWNWANVDL IAPGTIPGDA SGGTLPDLWA RDRTTGTLWQ FDNKTSNGVE DPTSLGNLNA AHAVGRVGTA TTQGSYPTSS YLTLIGAGSP TVSTDGHGTL TEGGPGAYPA LWGTGQNGKL TLLPGSAGGP ITTSDGPATA TLADGASRNA WTPVKQVTTL DGHDPTTSTG PIQIGWTQGT NNGGGPLCLD LPGGNAANGS PLQQYRCLNN ANQTWTFTDD GSIRWTANQH KCLTIGSTWA NNGGTANTSP LQISDCVTVT NPADPQLGAI SSLQRFEVRQ SPGITGWSQL YNPASGRCLD NGATTNSGTD PWLWDCGNGL QQAWLVPEAA GSTQRAEAEL LYNISRTGTG GLGPQADCCG VSWSGGAQEM FNNTLPSAVL TLPWYVPYKG TYRVVPTMTS ATDYGKVTLT VDAGAPNQQT LARTYDAYNP AVAVNPVDFG TIDLAAGGLH TFTFTLNGTN AASTGNRYNI GVDTLQLVPT TSTAPVASET VTPVSSVGQP ITIDDSATNP GAASITAYAI DFGDGTSATS PTPNVATHAY STPGTYPVKL TVTDDNGASA STTKQVIVLS SAPVANGDFE RGDLSGWSAS YNSAVTTDSP HSGTYAGQIN APAGGNGSVE QVVSGLKPNT SYTLTGWVRT DGGATILGTK EYDAADDDTG ATTAATGWTQ LSNQFTTGAT NTSVDVYCYR PTAGASACDD FTLLATPAAG AVGNPDFETG NLAGWNESYN AGVTTTNPHG GTYAGQINAP TGGNGSIEQV VTGLTPNTSY TLTGWIRTDG GATILGTKDY DADPGDDTGA TTTNTGWTQL TSQFTTGTNS TSVDIYCYRS TAGTSACDDF ALTQTPATVA NPDFETGTLD GWAASYNADI TTTNPRAGTY AGQLNAATGD NASIEQVVTG LTPNTAYTLT GWVRTDGNTT YLGAKQYDTA GSVTDANTTA TGWTQLTDQF TTDATGTSVD IYCYRDTAGT SACDDINLTK D
|
| |