Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4641 |
Symbol | |
ID | 8335995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5277274 |
End bp | 5278998 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644957741 |
Product | Ricin B lectin |
Protein accession | YP_003115343 |
Protein GI | 256393779 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0200277 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.347937 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAACGC CCCGCCCCAC GCCCCGCCGG TTCGCTGTCG CCGCCCTGGC CTGCACTCTG GCAGCTCTAC CCCTGACATC CAGCACCGCT CACGCGACGC CAACGCCACC GCCCTCAACA GCCCACACGG CCACCGCTGC CAGTACTGCC GCCACCCTGG CCGCCACACC CCCGATGGGC TGGAACGACT GGGCCCACTA CCAGTGCTCC GTCGACGAAT CCACGGTCGT CGCGAACGCC AACGCGCTCG TCAGCAGCGG CCTGGCCGCC AAGGGCTACA AGACCGTCAC GGTCGACGAC TGCTGGATGG CCTCCTCCCG CGACTCCGGC GGCACCCTGG TCGCCAACTC CACGAAGTTC CCCCACGGCA TGGCCTGGCT CGGCAGCTAC CTGCACAGCA AAGGCCTGAA CTTCGGCATC TACGAAGACG CCGGCTCCTC CACCTGCGGC GGCTATCCCG GCAGCGGCCA ACCCCAAGGC GGCGGCGCGG ACCACTTCGC GCATGACGCC GCGACCTTCG CCTCCTGGGG TGTGGACTAC CTGAAGCTCG ACGGCTGCAA CGTCTACATC CCGAGCGGGG AAAGCACCGA GCAGGCGTAC CACAACGCCT ACACCGCCGA GTCGACCGCG CTGGCGAACG CCGGCCGCCC GATCGTGTTC TCCGAGTCCG CGCCCGCGTA CTTCCAGAGC GGCGAGTGGG GCAATCCCAC CTGGTTCGAC GTCCTGGGCT GGGTCGGCCA GCTCGGGCAG CTGTGGCGCG AAGGGTACGA CATCGCCACG TACAACAGCG GCAACCCCAC CGCCAGCCGC TGGTCCTCGG TGATGTCCAA CTACGGCTAC AACCGCTGGA TCGCCCGCTA CGCCCACCCC GGCAACTGGA ACGACCCCGA CTTCCTCATC GCCGGCGACC CTGGCCTGAC CGCTGAGGAG TCCCGCAGCC AGGTCGCCTT GTGGGCGATG ATGAACGCCC CGATGATCCT GTCCTCGGAC GTCGCCAACC TCAGCGCCGA CGGCCTGGCC GCCCTGGGCA ACACCGACCT GATCGCCCTG GACCAGGACA GCGCGGGCCG CCAGGCAGGC GTGGTCTCCA CCAACGGCAC CACCGACGTC CTGGCCAAGC CGCTGGCCAA CGGCGACCGC GCCGTAGCGG TCCTGAACCG CGGCAGCGCG TCGCAGAACG TGTCGACCAC CCTGGCCTCG ATCGGCCTGC CCAACTGCAC GGCCAGCGCC AAGAACCTGT GGACCGGCAC CACCACGACC AGCAGCACCC TGACCGCGAC CATCCCCGCC CACGGCACCG CCATCTGGCG CCTGGCCCCC TCCGCCGGCT GCGCAGCCGC AGTGCCGACC GGCGAGATCG TCGGCAACGG CGCCAAGTGC GTGGACGTCA CCGGCAGCGG CACCGCCAAC GGCACGGCGG CGATCCTGTA CACCTGCACC GGCAACGCGA ACCAGTCCTG GACGCGCCCC GGCAACAGCA GCATCCAGAC CCTCGGCAAG TGCCTCACTG CCAACGGCAC CACGGCGGGC AGCACGGTCG TGATCTCCGC CTGCATCGGC GCCAGTGCAC AGCAGTGGAC GGCGCAGGCT GACGGGACGG TCGCGAACGG CGCGTCCGGC CTGTGCCTGG ACGTCTACGG CGGCGGCAGC GCCGACGGCA CGAAGCTGGA CACCTGGACG TGCGGCAGCC ACCAGGCGAA CCAGACCTGG GCGATGCCGA GCTGA
|
Protein sequence | MSTPRPTPRR FAVAALACTL AALPLTSSTA HATPTPPPST AHTATAASTA ATLAATPPMG WNDWAHYQCS VDESTVVANA NALVSSGLAA KGYKTVTVDD CWMASSRDSG GTLVANSTKF PHGMAWLGSY LHSKGLNFGI YEDAGSSTCG GYPGSGQPQG GGADHFAHDA ATFASWGVDY LKLDGCNVYI PSGESTEQAY HNAYTAESTA LANAGRPIVF SESAPAYFQS GEWGNPTWFD VLGWVGQLGQ LWREGYDIAT YNSGNPTASR WSSVMSNYGY NRWIARYAHP GNWNDPDFLI AGDPGLTAEE SRSQVALWAM MNAPMILSSD VANLSADGLA ALGNTDLIAL DQDSAGRQAG VVSTNGTTDV LAKPLANGDR AVAVLNRGSA SQNVSTTLAS IGLPNCTASA KNLWTGTTTT SSTLTATIPA HGTAIWRLAP SAGCAAAVPT GEIVGNGAKC VDVTGSGTAN GTAAILYTCT GNANQSWTRP GNSSIQTLGK CLTANGTTAG STVVISACIG ASAQQWTAQA DGTVANGASG LCLDVYGGGS ADGTKLDTWT CGSHQANQTW AMPS
|
| |