Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6907 |
Symbol | |
ID | 8338273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 7978271 |
End bp | 7979680 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644959994 |
Product | Ricin B lectin |
Protein accession | YP_003117585 |
Protein GI | 256396021 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00437458 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0000226989 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACGATCG CGCTAGCCTT CGCCCTGTTG ATCGGGTCGG TTGGCCTGAC CTCCATGGCC GCTTCCCCCG CCTACGCCAC CACGTCGCAG TTCCATGGGG TGAACTGGGC CGATCCGGAC GACAACTTCA TCACCGGCCC GAACATCCCA GTCGGGCTGT CGGCGTCGGA CAACTACGCG ACCGTGTACG CGAAGTCCAC GGCGATCCTC AAGGGCTTCC AGGCCATCGG GGCCAACACG GTGCGCATGG GGATCAACGC CGCGACGACC TCCGGATCAT GGTGGAACAG CTACACGGCG GCCCTGGACG CCGCAAGCGA CTTGGGTATG AACGTCGTCA TCGCTCCGTG GCTCCAGAAC GGCAAGGTCA GCGACACGGC GTCGTTCTAC GCCATGTGGA ACACGGTCAT CAACAAGTAC GGGTCGAAGA GCAACTTCTA CTTCGACATC ATGAACGAGC CCTACGCGTA CAGCGCCACC GACCTGACCA ACTTCGAGGC TGACTGGCTT GCGCACTACC CCAATCTGCC GCGGGGCCGC GTGATAGTCC CGGGCACTTG GGACGACGAG TCCCTGTGTG CTGAAGGAGC TGATTCCCGA CTCGCGGGGA CCTTGCTGTC CATCCACATC TACGGCATGT TCGGGAACTC GCACACGACC GAGGCTGCTT GGGTCACGAA CTTCGAGAAC AACATGTGCG GCTACGCGAG TCGCGCTGTG CTGACGGAGT TCGGAGTGGC GATGAACACC GGCGCCTATT ACGACGGTGC CAGAGACGGC AACAATGATG TCTCCTACCT GTACGGGATC ACCGATACGG TCCGCAAGCT GGGCATAGGA TCCATCCTCT GGACGGGCGT GAAGCAGGCC GACCAGAGCG TGGGTCCAGG GCCGTGCGAG AACGCATCCT GCGCGATCAC CTCGGTCAGC GGGAGCGGAA CGAACCTCTC GCTGAGCGTG ACCAACCGGT CTGGTCTCGA CAGGATCCAG TACGGATGGG GAGGCGGAAA CTCCGGCGCC GTCAGTACCG CGGTGCTTCG AGCAACCGCA TCCAACCGGT GCCTGGATGT GCCGAACGCC TCCACCACCA ACGGCACTCA AACTGAGATC TGGGACTGCA ACGGCGGCAG CAACCAGTCC TGGACGTTGA ATTCTGCCAA GTCCCTTGTC GTCTACGGCA ACAAGTGCCT CGATGCCGCC AACGCTGGTA CTGCGCCCGG CACACCGGTC ATCATCTGGG ACTGCAACGG CGGCACGAAC CAACAGTGGA CGATGAATAG CAACGGCACT GTCACCGCTG TTCAGTCCGG TCTCTGCCTC GACGTCACCA AGGGCGGAAC CTCTAACGGC ACAGCGATCG AGCTGTGGAC CTGCAACGGC GGCAGCAATC AGAAGTGGGC GCGCCAGTAA
|
Protein sequence | MTIALAFALL IGSVGLTSMA ASPAYATTSQ FHGVNWADPD DNFITGPNIP VGLSASDNYA TVYAKSTAIL KGFQAIGANT VRMGINAATT SGSWWNSYTA ALDAASDLGM NVVIAPWLQN GKVSDTASFY AMWNTVINKY GSKSNFYFDI MNEPYAYSAT DLTNFEADWL AHYPNLPRGR VIVPGTWDDE SLCAEGADSR LAGTLLSIHI YGMFGNSHTT EAAWVTNFEN NMCGYASRAV LTEFGVAMNT GAYYDGARDG NNDVSYLYGI TDTVRKLGIG SILWTGVKQA DQSVGPGPCE NASCAITSVS GSGTNLSLSV TNRSGLDRIQ YGWGGGNSGA VSTAVLRATA SNRCLDVPNA STTNGTQTEI WDCNGGSNQS WTLNSAKSLV VYGNKCLDAA NAGTAPGTPV IIWDCNGGTN QQWTMNSNGT VTAVQSGLCL DVTKGGTSNG TAIELWTCNG GSNQKWARQ
|
| |