Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_7221 |
Symbol | |
ID | 8338589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 8390902 |
End bp | 8392806 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644960302 |
Product | Ricin B lectin |
Protein accession | YP_003117891 |
Protein GI | 256396327 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5520] O-Glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.243667 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAAAT CGTTCCAAGG CTCCACCGCG AGACGCCGGA TCCGACTCGT GATCCGGGCG GTCTCGGCGG CGGCGCTCGC CATGTCCGGC TTCGCGCTCG CCGACGTCCC CGCGCACGCC GCGAACGAGT CGGTCAACGT CTGGCTGACC AGCACCAACG ACTCCGCCGG CCGGAACGTC ACCCGCGGTC TGCAACAGCA GGCGGCGGTC TCCTTCGCCT CCGGCTCCGG CAGCGGCGGC CAGGTGGTCA CCGTCAACGA GAACACGCAC TACCAGCAGT TCACCGGGGC CGGCGCGTCG TTCACGGACA CCGCGGCGTA TCTGATGAAC AGCAGCGGCG CGCTCAGCGC GTCGACTCGC AATACCCTGA TGACCAACCT GTTCAGCCCC ACGTCCGGGA TCGGTCTGGA CTTCCTGCGC AACCCGCTGG GCGCCTCGGA CCTCGCGCGC TACAGCTACA CCTTCGACGA CATGCCCGCC GGGCAGACCG ACCCGAGCCT GGCGAAGTTC TCGATCGCCC ACGACCTGGT CGACGTGCTG CCGCTGACCA AGCAGGCGCA GCAGCTCAAC CCGGGTCTGA AGGTCATGGC CTCGCCGTGG ACCGCCCCAC CGTGGATGAA GGACAGCGGC GCGTACAGCC AGGGCTACCT CCAGTCGCAG TACTACGCCG CCTACGCGCA GTACTTCGTG AAGTACATCC AGGCCTACCA AGCGCAGGGC GTGCCGATCA ACTACGTGTC GGTCCAGAAC GAGCCCACCT GCTGCTCGGG GTATCCCTCG ATGCAGTGGA ACGGATCGGG CCTGGACTAC TTCACGGCGA ACGACCTGCT TCCGGCGTTC CACTCCGCGG GTCTGTCGAC GAAGGTCCTG GCGCTTGACT GGAACCCGGA CAGCTACGCC TCGTACGGCG CCCCCACCGT CGACGACGCG ACCGTCCGCA ACGACCCGAA CTTCGGCGGC ATCGCCTGGC ACGGGTACGA GGGCAGCGTC ACCACCCAGA CGGACATCCA CAACCAGTAC CCGAACGTGG ACGCCTACGA CACCGAGCAC TCCGGCGGCA CCTGGATCGG CAACCAGCAG CAGGAGGACA TGAACAACAT CATCGACTAC ACCCGCAACT GGGGTAAGTC GGTGGTGAAG TGGTCCCTGG CGGTGGACCA GAACATGGGC CCGCACAACG GCGGCTGCGG CACTTGCACC GGCCTGGTCA CGGTCCACAA CGGCGACTCG CGCTCCGGCC AGGTCGACTA CAACATCGAG TACTACGACA TGGGCCAGCT CACCAAGTTC GTGAAGCCCG GCGCCTACCG CATCGACTCC ACGGCGAACT CGAGCGTCCC GAACGTCGCC TGGCAGAACC CGGACGGGTC CAAGGCGCTG GTCGCGTACA ACGAGTCCGG CAGCACCCAG ACGCTGACGG TGAACTGGGG CAACGAGCAC TTCAGCTACT CCCTGCCGGC GCAGACCTCC GCGACGTTCA CCTGGAACGG CACGCAGGGC ACCGGCGGCG GCACCGGCAC CCCGACCGGC CAGATCAGCG GCTACGGCGG CAAGTGCGTC GACGTCGCGG GCGCCAACCC GGCGAACGGC ACCGCGGTCC AGCTCTACGA CTGCAACGGC ACCGGTGCGC AGCAGTGGAC GGTCGCCTCG AACGGCTCGC TGCAGTCCCT CGGGAAGTGC ATGGACGTGA CAAGCGCGGG GACGACGAAC GGAACGAAGG TGCAGCTCTA CGACTGCAAC GGAACCGCGG CGCAGCACTG GACGCACCAA GCCAACGGAG AGTTGGTGAA CGCCGGCTCC GGACGCTGCC TGGACGCCAC GGGCCCGAGC TCGGCGAACG GCACCCGGCT GCAGATCTGG GACTGCACGG ACGCCGCGAA CCAACAATGG AACCTACCGT CGTGA
|
Protein sequence | MPKSFQGSTA RRRIRLVIRA VSAAALAMSG FALADVPAHA ANESVNVWLT STNDSAGRNV TRGLQQQAAV SFASGSGSGG QVVTVNENTH YQQFTGAGAS FTDTAAYLMN SSGALSASTR NTLMTNLFSP TSGIGLDFLR NPLGASDLAR YSYTFDDMPA GQTDPSLAKF SIAHDLVDVL PLTKQAQQLN PGLKVMASPW TAPPWMKDSG AYSQGYLQSQ YYAAYAQYFV KYIQAYQAQG VPINYVSVQN EPTCCSGYPS MQWNGSGLDY FTANDLLPAF HSAGLSTKVL ALDWNPDSYA SYGAPTVDDA TVRNDPNFGG IAWHGYEGSV TTQTDIHNQY PNVDAYDTEH SGGTWIGNQQ QEDMNNIIDY TRNWGKSVVK WSLAVDQNMG PHNGGCGTCT GLVTVHNGDS RSGQVDYNIE YYDMGQLTKF VKPGAYRIDS TANSSVPNVA WQNPDGSKAL VAYNESGSTQ TLTVNWGNEH FSYSLPAQTS ATFTWNGTQG TGGGTGTPTG QISGYGGKCV DVAGANPANG TAVQLYDCNG TGAQQWTVAS NGSLQSLGKC MDVTSAGTTN GTKVQLYDCN GTAAQHWTHQ ANGELVNAGS GRCLDATGPS SANGTRLQIW DCTDAANQQW NLPS
|
| |