Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3804 |
Symbol | |
ID | 8335157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4303239 |
End bp | 4304732 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644956943 |
Product | Ricin B lectin |
Protein accession | YP_003114546 |
Protein GI | 256392982 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.404288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.817018 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATCGT CGTCCGAGCA CAACCCAGAT CGCAAAAGCC TGGCCGCCAA GCTCCGAATT CTCGCGGCGG GCGCGCTCGC CGCCACGAGT CTGGTGGCGG CCGGGCAGAT CCCGGCGCAC GCGGCCACCG CCGCGGCCGC TTCCACCTCC CAGTTCAAAG GCGTGAACTG GGCCGACCAG CGCGACAACT TCGTCAACGG CGTCCTGTAC CCCTCCGGCC TCAACGCCTC CGACACCCAC GCCTCCGCCT CGACGGTCGC GGCCCAGGTC GTGGGCCAGC TCGACACGAT CACCGGCGCG AACACCGTCC GGATGCCGAT CAACGAGCCG ACCGTCTCGA CCTACTGGAG CACCTACACC GGCGCGATCG ACGCGGCGCT CACCAAGGGC AAGGTGATCC TCGCCTACTG GGCCTACAGC GGCGGGAAGC CCACCAGCAC GGCGGCGTTC AACCAGATGT GGGACACCGT CGTCGCCTCG TACGGGAGCA ACCCGAACGT GTACTTCGAG GTCATCAACG AGCCTTACGG CTACAGCTCC ACGGACTTGA ACAACTTCTA CAACACCTGG CTGACCAGGT ATCCCGCCGT CCCGCGCGGT CAGGTCATCC TCGACGGTAC GGGCGACGCC ACGAACATCG CGGGAGTCGC CGGCGACAGC CGGCTGGCGA ACACGCTGCT CGCAGTGCAC TACTACACGT TCTTTGCCGG AACATCCACG AACGAGTCCG ACTGGGCGAA CGGCATCGCG AACGAGATCG GCAGCTATGC GAGCCGGACT GTCGCCACCG AGTGGGGCGC GCCGATGAGT CCCGGCAGCA AGAACGGCGT CCACTACGAC ACGATCAACT ATGACGTGCC GGGCGGGAAC TTCTTCGACG CCTACGTCCG GGGCGTCAGC AGCGAGCTGC GCAAGCTCGG CGTCGGCAGC GTGTACTGGC CGGGGCTGCG TGACGGCGAC TGGTACAGCC TGACCAGTAA GACCGGTACC GGTGCGTCGA TCGCGCTGAC GCTGGTGAAC GCCTCCGGGC TGGACCGGCT GCAGTACGCG TGGGGAATCG GCAACGGCGG TGGCGGCGGC GGGACGTACG ACCAGATCCG TGACGTGGCC ACCGGCCTGT GCGTCGACGG TCTGGGCAGT ACCACAGTCG GTACCAATGC CAGCCAGTCC AGCTGCGTCA CAGGCGACAC CAACCAGGAG TGGACCATCG TGAGCAGCGG GGGTTACGTC CGTATCCAGA ACCGCGCCAC CGGCCTGTTC CTCGACGGCA TGGGCCGCAC GACCAACGGT TCAGCAGCCG GTCAGTACAG CAGCTCCACC AGCAACAACC AGCAGTGGAC CGAGGTGAGC ACCGCCGGCA GCGCCCGCTT CCAGAACCGC GCGACGGGCT TGTACCTCGA CGGCATGGGC CGCACCTCCA ACGGCTCCGA CCTCGGCCAA TACGCCGGCA GTACCAGCAC CAACCAGCAG TGGACTCTGG TATCCGCGAG CTGA
|
Protein sequence | MRSSSEHNPD RKSLAAKLRI LAAGALAATS LVAAGQIPAH AATAAAASTS QFKGVNWADQ RDNFVNGVLY PSGLNASDTH ASASTVAAQV VGQLDTITGA NTVRMPINEP TVSTYWSTYT GAIDAALTKG KVILAYWAYS GGKPTSTAAF NQMWDTVVAS YGSNPNVYFE VINEPYGYSS TDLNNFYNTW LTRYPAVPRG QVILDGTGDA TNIAGVAGDS RLANTLLAVH YYTFFAGTST NESDWANGIA NEIGSYASRT VATEWGAPMS PGSKNGVHYD TINYDVPGGN FFDAYVRGVS SELRKLGVGS VYWPGLRDGD WYSLTSKTGT GASIALTLVN ASGLDRLQYA WGIGNGGGGG GTYDQIRDVA TGLCVDGLGS TTVGTNASQS SCVTGDTNQE WTIVSSGGYV RIQNRATGLF LDGMGRTTNG SAAGQYSSST SNNQQWTEVS TAGSARFQNR ATGLYLDGMG RTSNGSDLGQ YAGSTSTNQQ WTLVSAS
|
| |