Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6873 |
Symbol | |
ID | 8338239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 7940082 |
End bp | 7941851 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644959962 |
Product | Ricin B lectin |
Protein accession | YP_003117553 |
Protein GI | 256395989 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.783664 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCGC CGATCACTTT GCGGCCACCC GGCACCGGAG GCATCCGGCG CCGCGCTTGG TGGTCCCTGC TCGCGGTCAC CGCCCTGCTG GCCTCCCTGC TGTACGGGGC CGCGTTCTCC ACCGCACCGT CGGCTCACGC GGAGTCCAAC GGCGCCGCGG TGTCACCCCA GTCGGGGTGG AGCAGTTGGA GCTTCATCCG CAAGAACCCG ACCGAGGCGG CCATCGAGGC GCAGGCCGCC GCGATGGCCT CCACCGGCCT GGTCAGCCAC GGTTTCTCCC TCATCAACAT CGACGACTTC TACTACCTGA GCCCCGGGTC GCACGTCGAC TCCTACGGGC GCTGGGTGGT GGACACCGCC AAGTTCCCGC ACGGTATGAA GGCGGTCGCG GACTACGTGC ACGGCAAGGG TGAGAAGTTC GGCATGTACC TGACGCCGGG CATCCCGGTG GCGGCGTACA ACCAGAACAC CCCGATCCAG GGCACGTCGT ACCACGCCCG CGACATCGTC TCCGACACCA CGGACTACGA GAAGAACTAC AACTTCGCCA ACGGCGCGAT GTACTTCATC GACTACGCCA AGAACCCCGC CGCGGCGCAG GCGTTCGTCA ACTCGTGGGC GAATCTGCTC GCCTCCTACG GCGTGGACTA CCTCAAGATG GACGGCGTCG GCACGTCCGA CAAGGCCGAC GTGCAGCACT GGTCCCAGGC GCTCGCCCAG TCCGGGCGCA CCATCCAGTT CGAGCTGTCC AACTCCCTGG ACATCAACCA GGGCCCGTTC TGGAAGCAGT ACGCCAACGG CTGGCGGGTC GGCGGCGACG TGGAGTGCTA CTGCAGCACC CTGACCAACT GGAGCAACGT CTCCAAGCGG TTCAACACCC TGCCGGCGTG GGTGCAGTAC GACAGCCCCG GCGGCTGGGG CGACCCGGAC TCGGTCGAGG TCGGCAACGG CTCGCAGGAC GGTCTGACCC CCGACGAGCG GCGCACCCAG CTCAGTCTGT GGGCCATCTC CGCGGCACCG CTGACCCTGG GGACCGACCT GACCCACATG GACTCCGGCG ATCTGGCTCT GCTGACCGAC GACGAGGTGC TCGGCGTCGA CCGTGCCGGA CACCCCGCGC ACCCGGTCGC GCAGGGCGCA ACGCAGCAGA CCTGGTGGGC GTACAACGGC GACGGCACCT ACACCGTCGG CCTGTTCAAC CTCGCCGGCT CGGCCGCCAA CGTCACCGCG AACTGGTCCG ACCTCGGCTT CACCGGCGGC GCCGCGGTCC GCGACCTGTG GAGCCGCACC AACCTGGGCT CCTCGTCCGG GAAGTTCACC GCCTCCGTGC CCGCCCACGG CTCCCGGCTG CTGCGCGTCA CCCCGGCCTC CGGGCACGGA ACGATCGGGG CGCCCATGGT CAGCAAGCTC TCCGGGCGCT GCGCGGACTC CCCGCAGGGC ACCACGTACA ACGCCGTCCA GGTGCAGGTC TGGACCTGCA ACGGCGGCCC CAACCAGAAC GTCACCTACA ACTCCTCGTC GAAGACGCTG GTGCTGGAAG GCAAGTGCTT CGACGCGCAC AACAACCAGA AGACCGCCGG GACGCACGTC GAGATCTACG ACTGCAACGG CGGCGCCAAC CAGCAATGGA CCAAGAACAG CAACGGCACC ATCACCGGCG TCCAGTCCGG CCTGTGCCTC GACGTCACCG GCGCCACCAA CCCCAACGGG TCGGGCCTGG AGCTGTGGAC GTGCAACGGC GGCCAGAACC AGCAGTGGAC GCTCGGGTGA
|
Protein sequence | MNPPITLRPP GTGGIRRRAW WSLLAVTALL ASLLYGAAFS TAPSAHAESN GAAVSPQSGW SSWSFIRKNP TEAAIEAQAA AMASTGLVSH GFSLINIDDF YYLSPGSHVD SYGRWVVDTA KFPHGMKAVA DYVHGKGEKF GMYLTPGIPV AAYNQNTPIQ GTSYHARDIV SDTTDYEKNY NFANGAMYFI DYAKNPAAAQ AFVNSWANLL ASYGVDYLKM DGVGTSDKAD VQHWSQALAQ SGRTIQFELS NSLDINQGPF WKQYANGWRV GGDVECYCST LTNWSNVSKR FNTLPAWVQY DSPGGWGDPD SVEVGNGSQD GLTPDERRTQ LSLWAISAAP LTLGTDLTHM DSGDLALLTD DEVLGVDRAG HPAHPVAQGA TQQTWWAYNG DGTYTVGLFN LAGSAANVTA NWSDLGFTGG AAVRDLWSRT NLGSSSGKFT ASVPAHGSRL LRVTPASGHG TIGAPMVSKL SGRCADSPQG TTYNAVQVQV WTCNGGPNQN VTYNSSSKTL VLEGKCFDAH NNQKTAGTHV EIYDCNGGAN QQWTKNSNGT ITGVQSGLCL DVTGATNPNG SGLELWTCNG GQNQQWTLG
|
| |