Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4904 |
Symbol | |
ID | 8336258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 5587771 |
End bp | 5589255 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644958003 |
Product | Ricin B lectin |
Protein accession | YP_003115605 |
Protein GI | 256394041 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.146094 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCATAC GAGCAAGGCG CAGCACCGTC AGCGCCCTCA TCGCCGCAGT GACGACCGCC CTGTCCCTCG TGGGTCTGTC CCAGACCTCG GCGCAGGCCG CGACGTCGCA GTTCTCGCCC GGGCAGACGT GGAACGACAC CTCCGGCACC GCGCTGCAGA TGCACGGCCT CGGCATCGTC AAGGTCGGCT CCACCTGGTA CGGCTTCGGC GAGGACAAGA CCGGGGAGAA CTCGGGCAAC GCCGCCTTCC AGGACATCCC GTGCTACAGC TCCACCGACC TGTCGCACTG GACGCTGCAA GGCAAGGCCC TGACCAGGCA GACCAGCGGC GACCTCGGAC CGAACCGCGT CGTGGAGCGC CCCAAGGTCC TGTTCAACGC CAGCACTAAC ACCTTCGTGA TGTACATGCA CATCGACAGC GCCAGCTACG GCGAGGCGAA GGTCGGGGTC GCGACCAGCA GCACACCGTG CGGCCCGTAC AGCTACCGGG GAAGCTTCCA GCCGCTGGGC CGTCAGAGCC GCGACATCGG CTTGTTCCAG GACACCGACG GCACCGGCTA CCTATTGTCC GAGGACCGCG CCAGCGGTCT GCGCGTCGAC AAGCTCTCCG CGGACTACCT CAGCGTCGTG AGCGCGGGCG GCAGCGGCGG CAGCGTGGCG CTGTTCGCCG ACTACGAGGC GCCCGCGATG GTCAAGACGA ACGGGACCTA CTTCGTCCTC GGCTCGCATC TGACCGGCTG GAACCTCAAC GACAACGTGT ACGCCACGGC GACCTCGCTG TCCGGCTCCT GGTCCTCCTT CAAGGACTTC GCCCCCGCCG GCACCAACAC CTACCAGACG CAGACCGCGA ACATCATCCC GGTCTCGGGA AGCGCCGGCA CGTCCTACAT CTACGCCGGC GACCGCTGGA ACCCGAACAA CCTCGGGGGA TCACAGCTGG TCTGGCTGCC CCTGACGCTG TCCGGCACGA CGGCGAACGT CGGATGGCAG AACTCGTGGT CCCTCGATGT CGCCGCCGGC ACCTGGTCCG GCAGCTCGAA TCCGGCGTCG GGCTCGACCC ACCACCTCAC CAACGCGAAC AGCTCGATGG TGATGGACGT CAGCGGCGGT TCCACCGCGA GCGGCGGCGC GGTCATCCAG TGGGCCGGCC ACGGCGGCAC CAACCAACAG TGGACTCTCC ACCAGGTCGC AGGGAACGTC TACACCCTCA CCAACCAGAA CAGCGGCCTG TGCCTGGAGG TACCGAACCG CTCCACCGCC ACCGGCACCG CGCTCGACCA GTGGACCTGC GGCGGCGGCA GCAACCAGCA GTGGGCCCTG GACCCGGTCG GCAGCTACAC CTCCTCCAGC GACGCCAGCT ATGAGCTCAC CAACCTGAAC AGCGGCCTGG TCGCCGACGT CTCCGGTGGT TCCACCGCCC AGGGCGCGCA GGTGATCCAG TGGACGACCA ACGGTCAGGC GAACCAGACA TGGACGTTGT CGTGA
|
Protein sequence | MPIRARRSTV SALIAAVTTA LSLVGLSQTS AQAATSQFSP GQTWNDTSGT ALQMHGLGIV KVGSTWYGFG EDKTGENSGN AAFQDIPCYS STDLSHWTLQ GKALTRQTSG DLGPNRVVER PKVLFNASTN TFVMYMHIDS ASYGEAKVGV ATSSTPCGPY SYRGSFQPLG RQSRDIGLFQ DTDGTGYLLS EDRASGLRVD KLSADYLSVV SAGGSGGSVA LFADYEAPAM VKTNGTYFVL GSHLTGWNLN DNVYATATSL SGSWSSFKDF APAGTNTYQT QTANIIPVSG SAGTSYIYAG DRWNPNNLGG SQLVWLPLTL SGTTANVGWQ NSWSLDVAAG TWSGSSNPAS GSTHHLTNAN SSMVMDVSGG STASGGAVIQ WAGHGGTNQQ WTLHQVAGNV YTLTNQNSGL CLEVPNRSTA TGTALDQWTC GGGSNQQWAL DPVGSYTSSS DASYELTNLN SGLVADVSGG STAQGAQVIQ WTTNGQANQT WTLS
|
| |