Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2737 |
Symbol | |
ID | 8334086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 3135667 |
End bp | 3139113 |
Gene Length | 3447 bp |
Protein Length | 1148 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644955886 |
Product | Ricin B lectin |
Protein accession | YP_003113492 |
Protein GI | 256391928 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.285354 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.294912 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGCTC GCCATAGGTT GTCGCGGTTT ATCGCCGCGG CGGCCGTATT TTCTCTCGCC CCGCCTCTGG TGGTGGCGAT CGGTTCTGGG ACGCCCGCGC TCGCGGCGAC CACGACCACC GCTTGGCAGA ATGGTTCCTT CGCACAGAAC GTGAGCGGGA TCGTCTCACG GTCCAATGTG GTGATCGGTA AGGCGAACAC CGCGGCGACG CAGTTCCTGC CGCTGGGCAA CGGCTCGCTG GGTGTGGCCG AGTGGGCGGC GAACGGATTC ACCGCGCAGC TCAACCGCAG CGACACGATG CCGAACCGGC TCTCGCCGGG GCAGGTCGAC ATCCCCGGCC TGTCGGCGAT GACCTCGGCG TCGAACTTCG TGGGCTACCT CGACGTGTAC AACGGGGTGC TGCACGAGTC TGGCGGCGGG ATGAGCCTGA CCGCGTGGGT GCCGGCGGGC AAGGACGAAC TGGTCGTCGA CGTGACCGGT GCGAATCCGG GGACGCGGCA GACAGCGAGC GTGAACCTGT GGAGCGGTCG CGGGCCGACC GCGTCGGCGT CCGGGTCCAT CGCCAGCCTG GCGCAGACCT GGGTGGACAA TTCCCAGACC GGATACTCGG GCAAGACCTT CGGCGCGATG GCCGCGATCA CCGCCGGCGG CTCCAACGTG ACCGCCTCGA CGTCCGGCTC GACGCAGGCG CTGGTCTCGT TCAACCCGAA CAGCGACGGC ACGTACCGCG TGATCGTCGC CTCGCCGTCA TGGGCCGGCG GCAACGCGAA CAGCACCGCG TCCTCCCTGA TCGGCAGCGA CACCGGCGCG AGCGAGGCCT CGCTGCTGGC GACGCAGTCC GCGTGGTGGA ACACCTACTG GGCCAACAGC GGCCTGATCG AGGCGAACTC CTCGGACGGC ACCGCGCAGT ACATGGAGAA TCTGCACACG CTGTACCTGT ACTTCGAGGC CGGGACCATG CACTCGGGCC AGTATCCGGG CAGCCAGGCG GGCCTGGCCG ACCTGTTCAA CTTCAACCAG GACCACCAGG CCTGGTATCC GGCCGGGTAC TGGTTGTGGA ACCTGCGCGG CCAGATCCAG GCGAACCTGG ACTCCGGCGA GTTCGCGCAG AACATTCCGA TCTTCGACAT GTATCTCAAC GACCTGCCGG CGATCCAGTC GTGGACCGGC GCGCAGATGA ACGGCAAGCC GGGCGCGTGT GTGCCGGAGA CGATGCGGTT CAACGGCAAC GGCTATTACT GGGGCGGCAG CATCACCAAC GACGCCTCGT GCGCGGTGGC CTCCAGCCCT GGCTTCAACG CCGAGACGAT CACCAGCGGC GCGGAGATCG CGCTGTGGGT GTGGCAGCAG TACCAGGACA CGGGGGATGT CAACTTCCTG CAGAAGTACT ATCCGCTGCT GCAGCAGACC TCGACGTTCC TGCTCGCCTG GCAGTCGGTC GGCTCGGACG GCTACCTGCA CGCGGTCGCG AACGCGCACG AGACGCAGTG GCAGGTGCAG GACCCGACCA CCGACATCGC CGCCGACCAG GCGCTGTTCA CCGCCACGGT GAACGCCGCG ACGCGGCTGA ACACGGACTC CTCGCTGGTC TCCCAGCTGC GTGGCGCGCT GACGCACATT CAGCCCTACG CGCGGACTGA CGAGAACAGC CACAGCCAGT TGCTCGGCCC GTCCGCGGAC TCCTCGGGCA CGGACGTGAT CGGCACCTCC TACCAGCCGA CCGCGGCGAC GCACAACGTG GAGAACCTCG GCCTGGAACC GGTGTGGCCC TTCGGCGTGA TCAGCGACAA CACGGTCGTC AACGGCGACA ACCTGACCGC ACTGGCGGAC CGCACCTACC AGCACCGGCA GAACGTCAAC AACCCGGACT GGACCTACGA CTCGATCCAG GCCGCGCGCC TGGACATGTC CTCCGAGGTG GCCAACGACC TGGTGGCCAG CACCAAGAGC TACCAGGTCT ACCCCTCCGG TCTCGCCGCG TGGAACCCGG GTTCGGTGGA CGAGCCCTAC ATCGAGCAGA TCTCGAACGT CGCCGCGACG CTGGACGAGG CGTTCGCGAC CGACTACGAC GGCACGGTGC GCTTCGCTCC GGCGTGGCCT TCGGGCTGGG ACGGTTCGGG CCGCGTGTAC ATCCAGGGCG GCTCGAAGGT CGACGTCCAG GTCGAGGGCG GCGTGCTGGC CACCGCCGCG ATCGAGGCCG GGTCGAGCGG GACCATGAGT GTGCGCAATC CCTGGTCCGG CCAGCAGGCG CAGGTGGTCA ACGGCTCGAC CGGCGCCGTC GTGGTCGCGG CCACCAATGC CGCGACGCTG AGCGTCCCGG TCACCGCCGG CCAGAGCTAC CTGGTCGAGC AGCCGGCCAC CCCGACGACC TCGCTGCCGT TCGCCCAGGT GACCGGAACC GCGGCGCGCG GGTTCCGGCA GTTGGGCAGC GTGAGCATCG GCCTGGGCGG CAACACCCTG CCCGCCGGCA ACACGGTGAC CGTCACCAGC CCCGGCAGCC AGTCCGGCAC CGTGGGCACG GCGATCAGTG CCCTGCAGAT CCACGCGACT GACTCCGCCT CGGGCCAGAC GCTGTCCTAC AGCGCCGCCG GCTTGCCGCC TGGGCTGTCG ATCAGCTCTT CGGGTCTGGT CAGCGGGACG CCGAGCGCGT CCGGGACCTT CACCGTCACC GTCACGGCGA CCGACTCCAC CGGCGCGTCC GGAGCGGCGT CGTTCACCTG GACGGTCGGC GGCGGCAGCG GGAACGTGGT GTCGGTGACG AATCCGGGCA GCCAGTCCGG GACAGTCGGT ACGGCGATCA GCGGTCTACA GATTCAGGGC ACTGATTCGG CGGGCCAGAC GCTGACGTAC ACGGCCGGTG GTCTTCCCAC CGGGCTGTCG ATCTCCTCGT CCGGTCTGAT CAGCGGGACG CCGAGCGCGT CCGGGACCTT CACCGTCACC GTCACGGCGA CCGACTCCAC CGGCGCGTCC GGGGCGGCGT CGTTCACCTG GACGATCAGT GGTGGCACCA CCGGATTCCC GGGTGGTTAT CACAGCCTGG TCGTGGCGAA GAGCAGCCTG TGCCTGGACG TGTTCGGCAA CACCAGCACC GCCGGTGCGG CCATCGACCA GTACACCTGC AACAGCCAGA GCAACCAGCA GTTCCAGTTC CTCCCGATCG CCAACGGCTA CGGTGAACTC CAAGCCCAGA ACTCCGGGCA AGACGTGACC GTCGCCAACA GCTCCACCGC CCAAGGCACC CCAGACATCG TCCAGCAGCC GGTCAACGGC GCTGCGGCAA GCCTGTGGCT GCCCCAGCAG CAGTCCGACG GCTCCTGGCA GTTCAAGAAC CAGAACAGCG GACTGTGCCT GGACGTCTAC GGCAACGGAA GCACCACCGG CCAGCAACTC GACCAATGGC CGTGCAAGAA CGCACCCGGA ACCAACCAGG ACTTCAACCC CCGCTGA
|
Protein sequence | MSARHRLSRF IAAAAVFSLA PPLVVAIGSG TPALAATTTT AWQNGSFAQN VSGIVSRSNV VIGKANTAAT QFLPLGNGSL GVAEWAANGF TAQLNRSDTM PNRLSPGQVD IPGLSAMTSA SNFVGYLDVY NGVLHESGGG MSLTAWVPAG KDELVVDVTG ANPGTRQTAS VNLWSGRGPT ASASGSIASL AQTWVDNSQT GYSGKTFGAM AAITAGGSNV TASTSGSTQA LVSFNPNSDG TYRVIVASPS WAGGNANSTA SSLIGSDTGA SEASLLATQS AWWNTYWANS GLIEANSSDG TAQYMENLHT LYLYFEAGTM HSGQYPGSQA GLADLFNFNQ DHQAWYPAGY WLWNLRGQIQ ANLDSGEFAQ NIPIFDMYLN DLPAIQSWTG AQMNGKPGAC VPETMRFNGN GYYWGGSITN DASCAVASSP GFNAETITSG AEIALWVWQQ YQDTGDVNFL QKYYPLLQQT STFLLAWQSV GSDGYLHAVA NAHETQWQVQ DPTTDIAADQ ALFTATVNAA TRLNTDSSLV SQLRGALTHI QPYARTDENS HSQLLGPSAD SSGTDVIGTS YQPTAATHNV ENLGLEPVWP FGVISDNTVV NGDNLTALAD RTYQHRQNVN NPDWTYDSIQ AARLDMSSEV ANDLVASTKS YQVYPSGLAA WNPGSVDEPY IEQISNVAAT LDEAFATDYD GTVRFAPAWP SGWDGSGRVY IQGGSKVDVQ VEGGVLATAA IEAGSSGTMS VRNPWSGQQA QVVNGSTGAV VVAATNAATL SVPVTAGQSY LVEQPATPTT SLPFAQVTGT AARGFRQLGS VSIGLGGNTL PAGNTVTVTS PGSQSGTVGT AISALQIHAT DSASGQTLSY SAAGLPPGLS ISSSGLVSGT PSASGTFTVT VTATDSTGAS GAASFTWTVG GGSGNVVSVT NPGSQSGTVG TAISGLQIQG TDSAGQTLTY TAGGLPTGLS ISSSGLISGT PSASGTFTVT VTATDSTGAS GAASFTWTIS GGTTGFPGGY HSLVVAKSSL CLDVFGNTST AGAAIDQYTC NSQSNQQFQF LPIANGYGEL QAQNSGQDVT VANSSTAQGT PDIVQQPVNG AAASLWLPQQ QSDGSWQFKN QNSGLCLDVY GNGSTTGQQL DQWPCKNAPG TNQDFNPR
|
| |