Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3803 |
Symbol | |
ID | 8335156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4300030 |
End bp | 4302861 |
Gene Length | 2832 bp |
Protein Length | 943 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644956942 |
Product | Ricin B lectin |
Protein accession | YP_003114545 |
Protein GI | 256392981 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.307116 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACTG CGTCACTCCC CGGTGATTCG TCCGGGCCCG GGTCGGGGGT GGATCGCCGC TCGCTGCTGA AGCTCGGCGG GGTGCTCGGC GCCTCCCTCG CGATCAGCGG GCTGCCGGTC TTCACCTCGA AGGCGCGGGC CGTGACCCGG CCCGGCGCGC TGAGGCTGGT CCCGCCGGCC ACCGCCACCC GGCTCTGGTA CACCTCGCCG GGCAGCGCGG CGAACATCAT GCAGGAGGGA CTGGCGGTCG GGAACGGGCG GCTTGGGGCG ATGGTGACCG GCGACCCGGC GTCGGACGCG CTTTACCTGA CCGACATCAC GCTGTGGACC GGGGGCGCCA ACGCCAGCCT CGGCAGCGAC GGCCAGTTTC CGTACGGCGT TGACAACTTC GGTTCGTACC AGTCGCTGGC CCAGGCGCAC GTCTCCGTCC CGGCGCACGC GCTGTCGGCC GTCTCCGGCT ATCAGCGGGT GCTGGACCTC AGCAACGGGT ATGTGTCAGC GTCCTACCAG TACAACGGCG TCACGTACAC GCGTGAGGTC TACTCCAGCA ACCCGGACGA CGTGCTCATC GTGCGGCTCA AGCAGAGCGG CGGCGGCAGC TACACCGGCA GCGTCTCGCT CAGCGGCACG CACGGCGAGA CCACCAGCGC CGATCCGGCA GGTCTCACCG CCTCGTTCAG CGGCACGCTC GCCAACGGCC TGAAGTACGC GTGCGCGGTC GCGGTGACCG CCACCGGCGG CCGGGTCGCG GTCTCCGGCA GCAGTGTCAC CTTCACGTCG TGCAGCGAGG TGATCCTCGT GTTCTCCGGC GGGACGAACT ACAAGCCGGA CCGCACTGTC GGCTACAAGG ACAGCACCCT GATCCCGCTG TCCGTCGCCG TCGCCAAGGC CCACGCCGTC GCCGGCGTCA GCGGCGACTC GCTGCTGGCC ACCCATGTCG CGGACTACCA GGCGCTGTAC AACGCCACGA CCGTGAACCT CGGTACCTCC AGCACCGCGC AGCGCGCCAT GGACACCCCC TCTCGGCTCA CGGCGCGCGC TGCGTCCGGT GCGGCGCCGG ACCCGGAGCT GGAGGCGTCC TACCTGCAGT TCGGCCGGTA CCTGGCCATC ACCGGCTCAC GGGGGTCGCT GCCGACGAAC CTGCAGGGAC TGTGGCTGGA CAACAACAAT CCGGCGTGGA TGAGCGACTA CCACACCGAC ATCAATGTCC AGATGAACTA TTGGCTGCCC GACCGGGCCG GGCTGGGATC CTGTTTCGAC GCGTTCGCGA ACTACTGCGT CGCGCAGCTG CCGGGGTGGA CGGCCACGAC CCAGAGCCTG TTCCAGAGCT CGACCAACGG GTTCCGGAAC AGCAGCGGCA AGGTGGCCGG CTGGACCGTC GCGATCAGCA CCAACACCTG GGGCGGCGGC GGATGGTGGT GGCACCCGGC CGGCAACGCC TGGCTGGCCA ACGCCCTGTA CAGCCACTAC GAGTTCACGC TGGACGCCGG CTACCTGCAG CGGATCTACC CGCTGCTCAA GGGCGCCTGC CAGTTCTGGC AGGCCCGGCT CGTCACGGAT CCCGGCACCG GCAAGCTGAT CGACGACGCC GACTGGTCGC CCGAGCACGG GCCGACGAAC GCCAAGGGCA TCACCTACGC CCAAGAACTG GTGTGGCAGC TCTTCCAGAA CTACACCGCG GCGGCGGCGA AGCTGAACCA GGACGCCGCC TACGCCGCCA CCATCGCCGG CCTCCAGGCG AACCTCTACC TGCCGCAGGT CAGCCCCACC ACCGGCTGGC TCGAGGAGTG GATGACGCCG GACAACCTCG ACACCTCCGA CCTCACCCAC CGTCACCTGT CGCCGTTGGT CGGCCTGTTC CCCGGCGACC GCGTCACCGC CGACCAGAGC CCGGCGGCGC TGCTCACCGG CGTCACCAAC CTGCTGACCG CGCGGGGCAT GAACTCGTTC GGCTGGGGCA TGGCCTGGCG CGCGCTGTGC TGGGCACGGC TGAAGAACGC GGGCATGGCC TATCAGGCGG TGACGACCGT GCTGCGGCCC TCGGTGAACT TCAGCAACGG GGCGGCGATC AACCTCTTCG ACATGTACAG CTTCGGCAGC AGCTCGGTGT TCCAGATCGA CGCCAACTTC GGCACCCCCA GCGCGATGAT CGAGATGCTG GTCTACCACC GCCCGGGCCT GGTGGAGCTG CTCCCCGCGC TCCCGGACGC CTGGTCGGTC GCCGGCAGCG TCACCGGCGT CCCGGTCCGC GGCGCGATGG CGCTGGACAT GGCATGGTCC GGCGGGCAGG TGACCACGGC GACCCTGCAC GGCACGCCCG GCGCCGGCAC CACCGTGAAG TTCGGCGCCT GGTCGCAGGC GGTGACCATC GGCAGCGGCG GCACCGTCAC GGTCGTCCCG CCTCCGCGCG CGACGGTCTT CAACCTGGTC AATCGCCGCA GCGGCAAGGC GATCGACGTT CCCGGATCGT CGACGACCGC CGGCACCGCC CTGATCCAGT ACACGCTGCA CAATTCGCCG AACCAGCAGT GGAAGTTCGC CCCGGCCGCG ACCGGCTACA CGGTCACGAA CATCAACTCC GGCATGGTCG CCGACGTCAA CGGCGGAAGC ACCGCCGATG GCACCGCCAT CGTCCAGTGG CCCGCCAACT CGGGCACGAA CCAGGAGTGG ACGCTCGCCG ACGCTGGCAA CGGCTACGTC AAGCTGGTGT GCGTCCGCAG CGGCAAGGTG CTGGGGGTCA GCCAGGACTC GACGTCGGAC CTGGCCGGCA TCACCCAGCA GACCGACACC GGCGACATCA GCCAGCACTG GCAGCGCATC GCAGTGCGCT GA
|
Protein sequence | MSTASLPGDS SGPGSGVDRR SLLKLGGVLG ASLAISGLPV FTSKARAVTR PGALRLVPPA TATRLWYTSP GSAANIMQEG LAVGNGRLGA MVTGDPASDA LYLTDITLWT GGANASLGSD GQFPYGVDNF GSYQSLAQAH VSVPAHALSA VSGYQRVLDL SNGYVSASYQ YNGVTYTREV YSSNPDDVLI VRLKQSGGGS YTGSVSLSGT HGETTSADPA GLTASFSGTL ANGLKYACAV AVTATGGRVA VSGSSVTFTS CSEVILVFSG GTNYKPDRTV GYKDSTLIPL SVAVAKAHAV AGVSGDSLLA THVADYQALY NATTVNLGTS STAQRAMDTP SRLTARAASG AAPDPELEAS YLQFGRYLAI TGSRGSLPTN LQGLWLDNNN PAWMSDYHTD INVQMNYWLP DRAGLGSCFD AFANYCVAQL PGWTATTQSL FQSSTNGFRN SSGKVAGWTV AISTNTWGGG GWWWHPAGNA WLANALYSHY EFTLDAGYLQ RIYPLLKGAC QFWQARLVTD PGTGKLIDDA DWSPEHGPTN AKGITYAQEL VWQLFQNYTA AAAKLNQDAA YAATIAGLQA NLYLPQVSPT TGWLEEWMTP DNLDTSDLTH RHLSPLVGLF PGDRVTADQS PAALLTGVTN LLTARGMNSF GWGMAWRALC WARLKNAGMA YQAVTTVLRP SVNFSNGAAI NLFDMYSFGS SSVFQIDANF GTPSAMIEML VYHRPGLVEL LPALPDAWSV AGSVTGVPVR GAMALDMAWS GGQVTTATLH GTPGAGTTVK FGAWSQAVTI GSGGTVTVVP PPRATVFNLV NRRSGKAIDV PGSSTTAGTA LIQYTLHNSP NQQWKFAPAA TGYTVTNINS GMVADVNGGS TADGTAIVQW PANSGTNQEW TLADAGNGYV KLVCVRSGKV LGVSQDSTSD LAGITQQTDT GDISQHWQRI AVR
|
| |