Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4874 |
Symbol | |
ID | 8336228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5543842 |
End bp | 5546991 |
Gene Length | 3150 bp |
Protein Length | 1049 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644957973 |
Product | Ricin B lectin |
Protein accession | YP_003115575 |
Protein GI | 256394011 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.524665 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.633916 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCGC GCGGCATCAG GTCGCGGACG GCACTGTGGC TCTCCGCTCT GGTCGCCGTC TCAGCGGCCG CATACGGGAC GCAGGCCGCT GCGACGCCGA GTGTCGCGCG TGCGGCCGTG CAGACCACCT TGTACGCCTC CCCCACTGGG TCGGGATCTT CCTGCTCCCA GACCTCGCCG TGCGCGCTGG CCGAGGCGCG CAGCGTCGTC GAGGGCATGA ACGGCTCGAT GACCGGCGAC ATCGTGGTGT ACCTGACGGG CGGCACCTAC AGGCTCACCA GCACCTTCGC TCTGGGACCG CAGGACTCCG GGACCAACGG GCACACCGTG TACTGGGAGG CTCTTCCGGG CCAGACACCG GTCTTCGACG GAGCGCAGCA GGTGACCGGG TGGTCGCAGT ACAACTCGGG CCTGAACATC TGGCGCGCGC CGGTGGCGGC GGGTACGCGG GCCGGGGACC TGTGGGTGGA CGGCGCGCGG GCCTCGGTGA CCAAGAGCGC GATCAACCCC GGCGGGTTCT CCCAGTCGGG CGCGTCGTTC ACGACCTCCG ACGGCTCGTA TCGCTCGTGG ATCGATCCCA CCGAGGCGGA GATCGTCGAT GACAACCCGT GGCGGCAGCT GCACTGCCCG CTGGCCGGCA TCACCGCCAA CGGCGGCGGC TCCAGCCTGA ACGTGAACCA GGCCTGCTAC AACGCCACGT TGAGCAACAC CGGCTTCCCG TTCAACGGCG CGGGCTACCC GACGCTGAAC CACATCACCT GGATCGAGAA CGCCTACGCG TTGCTGAACC AGCCGGGACA GTGGTTCCTC GACAGCGCCG GCGGCAGCCT GTACTACATC CCGCTGCCGG GACAAAACAT GTCCACCGCC GACGTCGAAC TGCCCGTGGT CCAGGATCTG GTCGACGTGC AGGGCACCCC GGGACACCTG ACCCCGATCA ACGACACGGC GTCAGGGATC GCCTATTCCG GATCCGGCTG GGGCTCCTCG ACCGGCCGCG GGCTCGGCGA CTTCCAGAAC GACGTGCACT CCACCCAGAC CAACGGCGAC TCGGTCAGCT ACACCTTCAC CGGCAGCGGC ATCACCGCGC TGACGGAGCT GAACAGCGAC GAAGGCAGCA TCGGGGTCTA CATCGACGGC ACGCTCAACC AGACCGTCAG CGCCGCCACC TCGGGCCAGC GAACCGCCGA GGACGCGGTG GTGGCCGTCA GCGGACTGAC GCCCGGCAGC CACACGATCA AGCTGGTCAA ACAGAGCGGG ACCTGGATGC TGCTCGACGG CTTCGTGGTG ATCCCGACCG CCGTCCAGCC GGCCCACGAC ATCACCTTCT CCGGGATCAC CTTCCAGCAC AACACCTGGC TCACCGAGCT CTCCCAGGGC TACCCGGAGA ACCAGACCGG CGTCATGTGG AGCGAGACCA ACCCGTGGAA TCAGGTCAAG GACCCCGGGA TGATCAACGT CGAGCGCGGC AACCACATCA CCTTCGCCGG CGACACCATC GCGCACACCG GCGACGCCGG CGTCGACTTC GGCAACGGGA CCCAGAACTC CACGCTCAGC ACCAGCAGGA TCCTGGACAC CGCCGTCAAC GCCGTGCAGG TCGGCGAGGT CGACGACTAC TACCTGACCG ACACGGCGCT GATGACCTCC GGGGACACCG TCACCGGCAA CTTCATCGAA CACAGCGGTG TGGTCTTCGA AAGCACCAGC GGCATCCTGG TCGGCTACAC CCGCAATGTG ACCGTCTCCC ACAACGACGT CGGCTACTCG GCCGCGCAGG GCATCTCCGT GGGCTGGGGC TGGGGCTACG CCTCGCCCTA CTGCTCCGGT TGCGCCCACG GCTATGACTA CGCCGGGGGG AACCAGGTCT CGTACAACTA CGTCTACGAC AACGGCCTGG GCGACGGAGC CGGCAACACG GTGATGGCCG AATGCATCTA CACCCTGGGC GGCCAGGGCG ACGGCAACGG CTCGGTGTGG TCGACCCTGA CCGGCAACGT GTGCCAGGAC CAGTACATCT ACAGCAACTG GGGCACCATC GCCCACGACG AAGGCAGCTC CTACTGGCAG GACCGCGACA ACGTCGTGCG CTGGTCGGGC CAGGACTGGA TGTACTACGA CCAGCCCACC GTCAACAACA TCACGGTCGG TCCCACGAAC TACTCCGACA ACGCCGGCTA CCGGGCCGTC GTCCCGAACA ACACCAGCTT CACCCAGGCG GCCATCGTTC CCGACGGCCA GTGGCCCGCC GGCGCGCAGG CCGTCATCAC CGCCGCCGGA CCGCCCGCGC AGGTCGCGCC CCTCACCGGA ACCCTGGACG ACACCTGCCT GTGCCTGAAC TACACCGGGT CCTCCTGGAC GTGGAGCGGC GACCGCAGAC TCGGCGACTT GGACAACGGA ATACACCAAG CCACCGGCAA CGGTGACAGC TTCTCCGTGC AGTTCACCGG CACCGGCGTC TCCTGGATCG GGGAGAAGAG CGGCAGCGAG GGCACGGCGG AGATCTACGT CGACGGTGCC GACAAGGGTT CGGTCAACGC CAACAGTTCC CCCACGCAAG CCCAGCAGAC GCTCTACAGT GTCGCCGGGC TGGCCGGCGG CACCCACACC CTCAAGGTCG TCAAGACCGG CGGCACCTAT CTCCAGGTCG ACGCCGTCAA CATCACCGGG ACGGCCGTCG TCACCGGCGG CTCGGGCGGT ACCGGCGGGG GCACGGGCAC CTATCCGACC GGCTACCACG CCCTGACCAT CGCCAGCAAT AACCTGTGCC TGGACAACTA TGGCGCTGGT TCCACCGCCG GCGCGATCAT CGACCAGTGG TCGTGCAACA GCGGGACCAA CCAGCAGTTC CAGTTCGTCC CCACCTCCGG CGGATACGGC CAGCTCCAGA TCGAGAACTC CGGCCAGGAC GTCACAGTGT CCGGCGGCTC GGCCTCCCAA GGGGTGGCGG GCATCGTGCA GCAACCGGTC AGCACGTCGA CCGCCGCCCA ATGGCTGCCC CAGCAGCAGT CCGACGGCTC CTGGCAGTTC AAGAACCTGA ACAGCGGCCT TTGCCTGGAC GTGTACGGAG CGAGCAGCAG CCAAGGCCAG CAACTCGACC AGTGGCCGTG CAAGAACGCA CCGGGGACCA ATCAGGACTT CAAGCCCTGA
|
Protein sequence | MKARGIRSRT ALWLSALVAV SAAAYGTQAA ATPSVARAAV QTTLYASPTG SGSSCSQTSP CALAEARSVV EGMNGSMTGD IVVYLTGGTY RLTSTFALGP QDSGTNGHTV YWEALPGQTP VFDGAQQVTG WSQYNSGLNI WRAPVAAGTR AGDLWVDGAR ASVTKSAINP GGFSQSGASF TTSDGSYRSW IDPTEAEIVD DNPWRQLHCP LAGITANGGG SSLNVNQACY NATLSNTGFP FNGAGYPTLN HITWIENAYA LLNQPGQWFL DSAGGSLYYI PLPGQNMSTA DVELPVVQDL VDVQGTPGHL TPINDTASGI AYSGSGWGSS TGRGLGDFQN DVHSTQTNGD SVSYTFTGSG ITALTELNSD EGSIGVYIDG TLNQTVSAAT SGQRTAEDAV VAVSGLTPGS HTIKLVKQSG TWMLLDGFVV IPTAVQPAHD ITFSGITFQH NTWLTELSQG YPENQTGVMW SETNPWNQVK DPGMINVERG NHITFAGDTI AHTGDAGVDF GNGTQNSTLS TSRILDTAVN AVQVGEVDDY YLTDTALMTS GDTVTGNFIE HSGVVFESTS GILVGYTRNV TVSHNDVGYS AAQGISVGWG WGYASPYCSG CAHGYDYAGG NQVSYNYVYD NGLGDGAGNT VMAECIYTLG GQGDGNGSVW STLTGNVCQD QYIYSNWGTI AHDEGSSYWQ DRDNVVRWSG QDWMYYDQPT VNNITVGPTN YSDNAGYRAV VPNNTSFTQA AIVPDGQWPA GAQAVITAAG PPAQVAPLTG TLDDTCLCLN YTGSSWTWSG DRRLGDLDNG IHQATGNGDS FSVQFTGTGV SWIGEKSGSE GTAEIYVDGA DKGSVNANSS PTQAQQTLYS VAGLAGGTHT LKVVKTGGTY LQVDAVNITG TAVVTGGSGG TGGGTGTYPT GYHALTIASN NLCLDNYGAG STAGAIIDQW SCNSGTNQQF QFVPTSGGYG QLQIENSGQD VTVSGGSASQ GVAGIVQQPV STSTAAQWLP QQQSDGSWQF KNLNSGLCLD VYGASSSQGQ QLDQWPCKNA PGTNQDFKP
|
| |