Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4995 |
Symbol | |
ID | 8336349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5714971 |
End bp | 5718021 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644958094 |
Product | Ricin B lectin |
Protein accession | YP_003115696 |
Protein GI | 256394132 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.214567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0224239 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCACGC CCCGTGCTCC GTTCCGCCGG CTCGCCGCGC TCGCCGCGGT GGTCCTGGCC GCCGGAACCG TCAGTCTTAC CACCAGCTCG CCACCGGCCG CCGCGGCGGT GTCGTTCACC ACGACGGTGG TGAACCAGGC GTCCGGCGAC TGCGCCGACG ATCCGAACTC GGCCACCACC ACCGGAACGC AGCTGATCCA GTACTCCTGC AACTCCGGCA CGAACCAGAA CTGGACGTTC ACCCCGGTCT CCGGCACCGG CGCCACCTAC ACCGTCGCCA ACGGCGCCTC GGGACTGTGC GTCGACGTCT CCGGACGCTC CACCGCCGAC AACGCCCAGA TCATCCAGTG GACCTGCAAC GGCCAGACGA ACCAGCAGTT CACCCTCCAG CCGGTCGGCA ACGGCTTCAC CCTGGCCGCC GTCCACTCCG GCAAGTGCGT CGCCCCCACC GGGGACACCA CCGCCAACGA CACCGTCCTG GTCCAGTTGC CCTGCACCAC CGCCGGCACC CGGGTCTGGA ACCTGCCCGG CTACACCAGC GGCGGAAGCG GCAGCAACAC CGTGACCGTC GCCAATCCCG GGACGCAGAC CTCGAAGGTC GGCGCGGTGG TCTCCCTGCA GATGGCCGGG AGCGACTCGG CATCGGGCCA GACGCTGACC TACTCCGCGA CCGGTCTGCC GGCCGGGCTG TCGATCAGCG CGTCGGGTCT GATCTCCGGC ACCGCGAGCA CTGCCGGCAG CTCGACCGTC GCGGTCACCG CGAAGGACTC CACCGGCGCG TCCGGCAGCA CCTCCTTCGG CTGGACGGTG AACACCGCCG GCGCGGCGTA TCAGGGCGTT CCCGCGGTCG CGCCCAACGC GTGCGGGAAC TCCTCGCTGC CCAACGCCTA CGGCACCAAC TTCCCAACGC CCAGCGACCC CTACGGCCAG GGCTACTCCA ACCAGACCGC GCTGGGCTGG GACGGCAACT ACTGGCCGGT GTTCCAGTAC CTCTCAGGGT CCTTCTTCGC ACGCGGCGTG CCGACGACGT ACAACGCCAA CGGGACCACG GTCTGCGGCG CCATGTACTC CTTCTCGATC TACAACCACA ACGGCAACCG TCCGGCGCAG TCGGTGACCT GGACCCAGGA GTCCGGATAC CTGCCGGCGA TGACGACCTC GTTCAGCACC GGCTCGATGG CCGTGGCCAT CAAGGAGTTC GAGGACAAGG CCACCATCGG CGGCCACCCG GTCGGCGTGG TGTACGCGCG GGTGTCGGTG ACGAACAACG GGTCCGCAGC GCTCACCCAG GACCCCGGCG CGTCCGGGCC CAACCTGGTG CGGCTGACGA GCACCTCGCT GAACACGGTC CAGCCCGGGG CGACCAGCAA CCACGACTAC ATCGTCGCCG TCGACAACTT CGGCTCCGGC GCGGCGCTTC CCACCGGGAG CACGCTGTCG TCCGGCGCCC CGGACTTCAC CACCGCCGAC AGCCAGATGG CGGCCTACTG GAACGGCCGG ATCGGCGAGA CCGCGTCCTT CTCGCTGCCC GACGTCGCGC TCCCGAACAC CGGCGGTCTG GCCAACCCCG GGACCGCGAT GACCAACGCC TACAAGGCCG GCACCGTCTA CATCCTGATG ATGCAGATCG GCGAGGCGCA GTTCTCCGCG GCGAACAACT ACTTCTGGCT GCTGAACCAC GACGTCCCCG GCGAGCTCAA CGCCCGGCTG GAGACCGGTG ACTTCCACGA CGCGCAGAAC CTGCTGCTGA CCGCGCGGAT ATCGCAGGCC ACGAACTTCG ACGAGCACGG CGCGAACTGG TACTTCGACG GCGACTGGAA GACGCCCTCG ACCTGGGCCT ACTACCTGGC CAAGACCAAC GACACGGCGT TCGTCTCGCA GTACTTCCAC GACGACGCCG GCGGCTCCAG CCCGTGGGGT CCGAGCCTGT ACACGATCAT GCACGGCATC TACCAGGGCC AGCTCGCCTC CGACGGGGCG CTGGCGACCA GCTTCGACAA CGATTCCTCG GGCCGCTGGC TGTTCGACGA CTACTCGGCG CTGGAGGGCC TGGCGGCGTA CAAGTACATT GCGACGCGCA TCGGCAACAC GGCAGAGGCG CAGTACGCCG ACGGCGCGTA CACCAAGCTG CTGAACGCGA CGAACACCCT GCTGTCCCGC AACGAGTCGG CCAACGGGTT CTCCTACCTG CCGTGCACGG TCGACCAGCC GAATTCGGCC AACCGCTGCA ACACCTACAA CGACGCCAAC TGGGCCTCGC CGGTGTGGGT CGGACAGAAT CAGTGGTCGA CGATGCTCAT GGGCGGCACG CTGTCCGGTA TCGCGGGCGA CCCGGCGCAG GGCGACGCGA TGTACAAGTG GGGCTTCGCG CGTCTGTCCG CGAACGGTCT GCCGTATCCG ACCTTCGGCG CGTTCAACGG CTACTCGACC GCGTACAACA CCGCCTACGC CAGCGACGGC TTGTACGGCA CGTCCTACCG CGACCTGCCG ATCACCAGCT ACGCCTGGCA GATCGCGACC ACCACCGGCG GCCCGAACGC CTGGTGGGAG GCCAACGGTA CGGGCCCGGA CAGCGGCAAC CCCTGGATCG GCAACCATGC CGGCCCGGAG TTCGGCGCCT GCCCGTACGC CTGGCCGATC TCCGCTCAGC AGGAGGGACT GCTGCAGTCG ATCGCGGCCG AGGGCTTGTC CTCGACCGGT TCTGGCCCGT ACACCTACAC GAGGCCGCTG ATCATCGGCC GCGGCGTCCC GAACGCCTGG ATCGCCAACG GCCAGAAGGT TTCGGCGTCG AACCTGACCA CCTCCTATGA CACGAACAGC GGTGCCCGGA GCACGTACGG CGTGTCGATC TCGACCTCGA CGTCCAGCGG CACGCGCGTG GTCACGGTGA CGCTGTCCGG AACGCTGCCG GCCGGCGCGG TGTCGGTGCA GCTGCCGGTC TTCACCTCGG TGGGGGTCAC GGCGGTCAGC GGCGGGTCGT ACAACTCCTC GACGCACTCG GTGAGTGTGA ACTCCGGAAC GACCACAGTG GCCATCACCC TGGCCGGCTG A
|
Protein sequence | MRTPRAPFRR LAALAAVVLA AGTVSLTTSS PPAAAAVSFT TTVVNQASGD CADDPNSATT TGTQLIQYSC NSGTNQNWTF TPVSGTGATY TVANGASGLC VDVSGRSTAD NAQIIQWTCN GQTNQQFTLQ PVGNGFTLAA VHSGKCVAPT GDTTANDTVL VQLPCTTAGT RVWNLPGYTS GGSGSNTVTV ANPGTQTSKV GAVVSLQMAG SDSASGQTLT YSATGLPAGL SISASGLISG TASTAGSSTV AVTAKDSTGA SGSTSFGWTV NTAGAAYQGV PAVAPNACGN SSLPNAYGTN FPTPSDPYGQ GYSNQTALGW DGNYWPVFQY LSGSFFARGV PTTYNANGTT VCGAMYSFSI YNHNGNRPAQ SVTWTQESGY LPAMTTSFST GSMAVAIKEF EDKATIGGHP VGVVYARVSV TNNGSAALTQ DPGASGPNLV RLTSTSLNTV QPGATSNHDY IVAVDNFGSG AALPTGSTLS SGAPDFTTAD SQMAAYWNGR IGETASFSLP DVALPNTGGL ANPGTAMTNA YKAGTVYILM MQIGEAQFSA ANNYFWLLNH DVPGELNARL ETGDFHDAQN LLLTARISQA TNFDEHGANW YFDGDWKTPS TWAYYLAKTN DTAFVSQYFH DDAGGSSPWG PSLYTIMHGI YQGQLASDGA LATSFDNDSS GRWLFDDYSA LEGLAAYKYI ATRIGNTAEA QYADGAYTKL LNATNTLLSR NESANGFSYL PCTVDQPNSA NRCNTYNDAN WASPVWVGQN QWSTMLMGGT LSGIAGDPAQ GDAMYKWGFA RLSANGLPYP TFGAFNGYST AYNTAYASDG LYGTSYRDLP ITSYAWQIAT TTGGPNAWWE ANGTGPDSGN PWIGNHAGPE FGACPYAWPI SAQQEGLLQS IAAEGLSSTG SGPYTYTRPL IIGRGVPNAW IANGQKVSAS NLTTSYDTNS GARSTYGVSI STSTSSGTRV VTVTLSGTLP AGAVSVQLPV FTSVGVTAVS GGSYNSSTHS VSVNSGTTTV AITLAG
|
| |