Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0664 |
Symbol | |
ID | 3970603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 723072 |
End bp | 724052 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637923780 |
Product | cellulose biosynthesis protein CelD |
Protein accession | YP_530555 |
Protein GI | 90422185 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.250925 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTCT ACGCATCAGA GCATTATCTC GACGCCGTCG CGGCGGTCTA TTTCAAGGGC CAGCGCGCGC GCATCGAAGA CGTACAAATC GGCGACGAGG TGCTTCGGCT TCTCGTGGTC AATGACAAGC GTGTCATTAC GCGACATCAG TTCTTGGATT TCCACCAGCC GCTGCTTGAA ACCGAGACTC GCGGAGCGAC CCGCAACGGC CGGTACGCGC CCGCCGTGGC GCGCCGGGTG ATCGAGCGGA CGCAGTGGGA TCCGGCGCAG TTTCCCGGCC TGGAGCTCGC GCCCTATGTC GACTGGTCGC AGTTTCCCGA ATACGACGAC TACAAGGCCT ATCTGCTGAA GCACCACAAG AGCCTGGTGC GCGATCGCGA GCGCCGCGGG CGCAGCCTCG CCGCGGCGCA TGGCGAACTG GTGTTCACCA TGAACGACAC GCAGGCCGAC GTGTTCGACG CCGCGCAGCG TTGGAAGAGC CGGCAGCTGC GCGACAGCGG CCTGGCGGAT TATTTCGCCG CCCCGCACAC CATGGAGTTT CTCGAGGCAT TGCGCAGCCG CGGCCAATTG GTCGCCTCGA CGCTGCGCGC CTCCGGCCAA TTGTTGTCGC TGTGGATCGG CTTCGTGCAC GACCGCACCT GGTCCGGCTG GATTTTCACT TATGATCCGG CGTTCCGGAA ATACTCGGTG GGACACCAGC TGCTCAGCTT CATGCTGAGC GAGAGCCACC GCCTCGGCCA CCGCGAGTTC GATTTTTCGA TCGGCAGCGA GGACTACAAG ATGATCTACG CCACGCACGG GCGCGTGCTG GGATCGATCG GCCAGCCGCC GCTCGGCCAG CGGTTGATCG GCTACGCCAA GGACGAATTG CGGGATCGGA CCCCGAAGCT GTTCGACGCC GCGCGGAATC TGAAGAAGCG GATCGACGGC ACGCTGCCGA CTCAGCTGGT GGCGGGCCAG ACCGGCCCGG CCAAGGCGTG A
|
Protein sequence | MNFYASEHYL DAVAAVYFKG QRARIEDVQI GDEVLRLLVV NDKRVITRHQ FLDFHQPLLE TETRGATRNG RYAPAVARRV IERTQWDPAQ FPGLELAPYV DWSQFPEYDD YKAYLLKHHK SLVRDRERRG RSLAAAHGEL VFTMNDTQAD VFDAAQRWKS RQLRDSGLAD YFAAPHTMEF LEALRSRGQL VASTLRASGQ LLSLWIGFVH DRTWSGWIFT YDPAFRKYSV GHQLLSFMLS ESHRLGHREF DFSIGSEDYK MIYATHGRVL GSIGQPPLGQ RLIGYAKDEL RDRTPKLFDA ARNLKKRIDG TLPTQLVAGQ TGPAKA
|
| |