Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3742 |
Symbol | |
ID | 8335095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4223659 |
End bp | 4224783 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644956882 |
Product | Ricin B lectin |
Protein accession | YP_003114485 |
Protein GI | 256392921 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2755] Lysophospholipase L1 and related esterases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00367387 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.957672 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTAGAC AAAAGCGGTG GTTCAACGCG ATGGTGTCGG CCGGCGCCGC GCTGGCCATG ACCATCGGGC TGGTGGGGAT CGCCGGAACC CAGAGCGCGG CCGCGGAATC CAACGGCGGC GTCAAAGTGA TGCCGCTGGG CGACTCGATC ACCGACGGGA TCACGGTGTC GGGCGCCTAT CGCACGGGCC TGTGGCAGCG CTTCGTCGCC GGCGGCTACA AGGTCGACTT CGTGGGCTCG CTGTCCGGCG GCCCGGCGGC CCTGGGAGAC CACGACCACG AAGGCCACTC CGGCTGGCGC ATCGACCAGA TCGACGCGAA CATCACCGGC TGGCTCCAGA CCTACCAGCC GCACACCGTC CTGCTGCACA TCGGCACCAA CGACATCCTG CAGAACGACG ATGTCTCCAA CACCCCCAAC CGGCTCTCCG GCCTCATCGA CCACATCACC GCCGCCGACC CGGGCGCCGA GGTGTTCGTC GCCCAGATCA TCCCGCTGGC CAACTCCGGG CAGAACGCCC AGGTGCGCAC CTACAACGCG GCGATCCCCG GCATCGTGTC GAGCAAGGTC TCCGCCGGCA AGCACGTGCA CCTGGTCGAC ATGTACGACG CGTTGACCAC GTCCGACCTG GCCGACGGCG TCCACCCCAC CGCCGCCGGC TACGACAAGA TGGCCGCGGT CTGGTACTCG GCGCTCCTGT CGGTGTCCGG CAGCATCGGC CAGCCCGGCA GCACCGGCGG TGGCGGCGCC ATCGTCGGAA CCGCCTCCGG CCGCTGCGCG GACGTCCCCA ACTCCACCCA GACCCTCGGC ACGCAGGTCC AGCTGTGGGA CTGCAGCGGC GCGACGAACC AGCAGTGGAC CGCCACCTCC GCCGGCGAGC TGCGCGTCTA CAGCGGCGAC TGCCTCGACG CCTACGGCAA GGGCACCACT CCCGGCACCA AGGTCGCGAT CTGGTCCTGC AACGGCCAGA CCAACCAGCA ATGGCGTCTG AACGCCGATG GAACCATCAC CGGCGTCCAA TCAGGACTCT GTCTCGACGC CACGGGAGCC GCCACCGCCA ACGGCACCCT GCTGGAACTG TGGACGTGCA ACGGCCAGAG CAACCAGCAA TGGACGCGGA GGTAG
|
Protein sequence | MRRQKRWFNA MVSAGAALAM TIGLVGIAGT QSAAAESNGG VKVMPLGDSI TDGITVSGAY RTGLWQRFVA GGYKVDFVGS LSGGPAALGD HDHEGHSGWR IDQIDANITG WLQTYQPHTV LLHIGTNDIL QNDDVSNTPN RLSGLIDHIT AADPGAEVFV AQIIPLANSG QNAQVRTYNA AIPGIVSSKV SAGKHVHLVD MYDALTTSDL ADGVHPTAAG YDKMAAVWYS ALLSVSGSIG QPGSTGGGGA IVGTASGRCA DVPNSTQTLG TQVQLWDCSG ATNQQWTATS AGELRVYSGD CLDAYGKGTT PGTKVAIWSC NGQTNQQWRL NADGTITGVQ SGLCLDATGA ATANGTLLEL WTCNGQSNQQ WTRR
|
| |