Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3294 |
Symbol | |
ID | 4023803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 3646023 |
End bp | 3647297 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637963497 |
Product | cellulose biosynthesis protein CelD |
Protein accession | YP_570419 |
Protein GI | 91977760 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.421998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0110847 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGGCC CGAATATCCG GCGCAGGATC GACAGCGAAC CGGCGCGTTG TGGGACGGAT TCGGCCCAGA CACAACCCGC ACCACAACCG ACCGATCGCC CCGCTGCCGA TCCATCCGCG ACCGGAACCG CCCTGCCCGC CGCGCTCTCG ATCGCTGTGG CGGCGAAGCT GTCGGACCTC CCGTTCTGGC CCGGGATCAG TGAAGCGACG CCCGGCCATC GCTTCGTGTT TCAATGCGCC GACGTGCTCG AGGTCTGGCG CGACACCATC GGCGCGGCGC AGCGCGTGAA GCCGCTGTTC GTCGCCGTGA CCACACAGGA CGGCGCACCG CTGCTGCTGC TGCCGCTCGG GATCGAACGC CGCGACGGAC TGCGCGTGCT CGGCTTCCTC GACGGCACCG TCAGCGACTT CAACGCGCCG ATCGTATTCG CGCCCGCACA GCATTGGGAC GGCGCTGCGA TGCGTCGGCT GTGGGACGAT CTGCAGCCGC ATCTGCCGGC CTACGACATC GCGATCTTCC GCAAGATGCC CGAGACCATC GACGGGATCG CCAATCCGTT CCGGCATCTG GCGACCGCGC GCAGCGCCCA CGGCACGCAT ACGCTCAAAC TGCCGGCGGA CTGGTCGGCG GCTGCGCGCG ACATCCTGCC CGATCTCTCG GATTCGCGCC GCAGGCTGCG CAAGCTCGAC CGACTCGGCG CGACGGAGTT CCGCATCGCC GACAATGTCG ACGAGGCGAT CGCCTTCACG ACGGCGGCGA TGGCGATGAA AGGCCGCCGG CTGGTCGACA CGATCGGCGT CGACCGCTTC GCCAGTGAAC CCGGCTACGC CGATTACTAC GTCGAGATCA CCCGCCGGCT GTTCGCAACC GGCGCCGTCC ACGTCTCTGC GCTGATGATC GACGGCACGC CGCGCGCAGC GCATTGGGGC TTCGTGTTCG CCGGCCGGTT CTATCATCTG CTGACGGCGT TCGACGTCGA CGCGGCGTGG CGGCCCTACG CGCTCGGCCG GATGCACAAT GAATTCCTGA TGGAATGGAG CGCCAATGCG GGGCTCGCGA CGTTCGATTT CGGCGTCGGC GACGAGCCGT ACAAGACCGC CTACAGCAAC GACTATCAGC AACTCGCCGA TGCGATCCTG CCGCGCACGC TGATCGGCCA CGCCTATGGA TGGCTGGTCG ACCTGCGGCG CTATTCCGCG CGCACGATCC GCGCCTCCGC TTTCGCCACG ACGGCCGAGC GCATCCGCCG CGATCTCAAC AAATGGCGGA ATTGA
|
Protein sequence | MHGPNIRRRI DSEPARCGTD SAQTQPAPQP TDRPAADPSA TGTALPAALS IAVAAKLSDL PFWPGISEAT PGHRFVFQCA DVLEVWRDTI GAAQRVKPLF VAVTTQDGAP LLLLPLGIER RDGLRVLGFL DGTVSDFNAP IVFAPAQHWD GAAMRRLWDD LQPHLPAYDI AIFRKMPETI DGIANPFRHL ATARSAHGTH TLKLPADWSA AARDILPDLS DSRRRLRKLD RLGATEFRIA DNVDEAIAFT TAAMAMKGRR LVDTIGVDRF ASEPGYADYY VEITRRLFAT GAVHVSALMI DGTPRAAHWG FVFAGRFYHL LTAFDVDAAW RPYALGRMHN EFLMEWSANA GLATFDFGVG DEPYKTAYSN DYQQLADAIL PRTLIGHAYG WLVDLRRYSA RTIRASAFAT TAERIRRDLN KWRN
|
| |