Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2698 |
Symbol | |
ID | 4023196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 3015160 |
End bp | 3016362 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637962897 |
Product | hypothetical protein |
Protein accession | YP_569828 |
Protein GI | 91977169 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.402222 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.567569 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATGA TGGCCGTGCT GAGCGAAGGC CGGTCCGCGG GCCACGCGGC GGCGTCGGCG CAGCCCGGCC GGATTGCGCG CGTCGAAATC GTCCGCGAGA TGGCGGCGGC GGAAGCGATC TGGCGTTCGC TCGAGCAACC CGAGCAGTTC TTCACGCCCT ATCAGCGCTT CGATTTCCTC GATGCCTGGC AGCGGCACGT CGGCGTTGCG GAACAACTTG AACCCTTCAT CGTGGTGGCG CGTGATGCCG AGCTTCGGCC GTTGATGTTG CTGCCGCTCG GACTCGAGCG CCGCTTCGGC CTGCGCATCG CGCGCTTCCT CGGCGGCAAA CATGCCACGT TCAACATGCC GCTGTGGCGT CGCGACGCGG CGCAGACTGC CGATGCGCGC GAACTCGATG CGCTCATCGC GGGCCTGCGC GCACAGCCGG ACGGCGCCGA CGTGCTGGCG CTCCGCCAGC AGCCACTGCG CTGGCGCGAC CTCGCCAATC CGCTGGCGCA ACTACCGCAT CAAGCCTCGG TCAACGAATG TCCGGTGCTG TTGCTCGATC CCGCAGCGTC CCCGAGCGAT CGCATCAGCA ATGCGTCTCG CCGCCGTCTC AAGACCAAGG AAAAGAAGCT GCAGGCGCTG CCCGGCTATC GTTACAGTCA GGCGACCAGC GACGACGATG TTCGGCGCGT GCTCGATGCG TTCTTCCGGA TCAAGCCGGT CCGGATGGCG GCGCAGAAGC TGCCGAACGT GTTCGCCGAT CCCGGCGTCG AGGATTTCAT CCGCCGGGCG TGCCAGACCG AACTCGCCGG CGGCGGCCGT GCGATCGAAA TCCACGCGCT GGAATCCGAC GACGATATGA TCGCGATGTT CGCCGGCGTC GCCGACGGCC ACCGCTATTC GATGATGTTC AACACCTACA CGTTGTCCGA AGCCGCGCGC TACAGCCCCG GCCTGATCCT GATGCGCTCG ATCATCGATC ACTACGCCGA GCTGGGCTAC AGTCGGCTCG ATCTCGGCAT CGGCTCCGAC GATTACAAGA AGCAGTTCTG CAAGGATGAC GAGCCGATCT TCGACAGCTT CGTCGCCCTG ACGCCGCGCG GCCGGATTGC GGCTTCGGCG ATGGCGTCGA TCGACCGCGC CAAACGCACG GTCAAGCAGA CCCCTGCCCT GATGCAGATG GCGCAGGCGC TGCGCGGCGC GCTGTATCGC TGA
|
Protein sequence | MTMMAVLSEG RSAGHAAASA QPGRIARVEI VREMAAAEAI WRSLEQPEQF FTPYQRFDFL DAWQRHVGVA EQLEPFIVVA RDAELRPLML LPLGLERRFG LRIARFLGGK HATFNMPLWR RDAAQTADAR ELDALIAGLR AQPDGADVLA LRQQPLRWRD LANPLAQLPH QASVNECPVL LLDPAASPSD RISNASRRRL KTKEKKLQAL PGYRYSQATS DDDVRRVLDA FFRIKPVRMA AQKLPNVFAD PGVEDFIRRA CQTELAGGGR AIEIHALESD DDMIAMFAGV ADGHRYSMMF NTYTLSEAAR YSPGLILMRS IIDHYAELGY SRLDLGIGSD DYKKQFCKDD EPIFDSFVAL TPRGRIAASA MASIDRAKRT VKQTPALMQM AQALRGALYR
|
| |