Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4555 |
Symbol | |
ID | 3912372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 5149067 |
End bp | 5150086 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637886459 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_488149 |
Protein GI | 86751653 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.999711 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.705605 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACT TCGAAACGTC CGGCGAGGCG GCCTCGATCG TCTCGGTCAC CGACCGCGTC CGCGAGGTCG CGATCGGCGC GCCGGTGGGG GCGGTGCATT TCCTCGGCGA CACCGCGGTG TTCATCGGCG CGGAGGAAAA CGCGACCTTT GCCAAGTCGG ATGGCGAGAG CTCGACCGTC GCGCTGCACG GCGGCGCCGT TCTCAGCTCG GTCACCGACG GCAAGCGCAT CGTCAGCGGC GGCGACGACG GCAAGGTGAT GGCGCTCGAC GCGGGCGGCA AGGCCGAGCT GTTCGCCACC GACGCCAAGC GGCGCTGGAT CGACAATGTC GCGCTGCATC CGGACGGTGC GGTGGCGTGG TCGGCGGGCA AGATCGCTTA CGTCCGCGCG CCGAAGGCCG AGGAGAAATT CTTCGAGGTG CCGTCGACGG TCGGCGGCCT CGCTTTCGCG CCGAAGGGAA TGCGGCTCGC GATCGCGCAT TACAACGGCG TGACGCTGTG GTTTCCGAAC ATGGCGGCCA ATGCCGAAAT GCTCGAATGG GCCGGCTCGC ATCTCGGCGT GATGTTCAGC CCGGACAACC GTTTTCTGGT CACGTCGATG CACGAGCCGG CGCTGCACGG CTGGCGGCTC GCCGACGCCA AGCACATGCG GATGACCGGC TATCCCGGCC GGGTGCGGTC GATGGCCTGG ACCTCGGGCG GCAAGGGCCT CGCTACCTCG GGCGCCGACG CCGTGATCGT CTGGCCGTTC GCCAGCAAGG ACGGGCCGAT GGGCAAGCAG CCGGCGATGC TGGCGCCGCT GCAGGCGCGC GTCAGCATGG TGGCGTGCCA CCCCAAGCAG GACATCCTCG CCACCGGCTA CAGCGATGGC ACCGTGCTGA TGGTGCGGCT CACCGACGGC GCCGAGATCC TGGTCCGGCG CAACGGCACG CCGCCGGTCA CGGCACTGGC GTGGAATGCG AGCGGCAGTC TGCTGGCATT CGCCGACGAG GAGGGTGCAG CGGGACTGCT GACGCTGTAA
|
Protein sequence | MSDFETSGEA ASIVSVTDRV REVAIGAPVG AVHFLGDTAV FIGAEENATF AKSDGESSTV ALHGGAVLSS VTDGKRIVSG GDDGKVMALD AGGKAELFAT DAKRRWIDNV ALHPDGAVAW SAGKIAYVRA PKAEEKFFEV PSTVGGLAFA PKGMRLAIAH YNGVTLWFPN MAANAEMLEW AGSHLGVMFS PDNRFLVTSM HEPALHGWRL ADAKHMRMTG YPGRVRSMAW TSGGKGLATS GADAVIVWPF ASKDGPMGKQ PAMLAPLQAR VSMVACHPKQ DILATGYSDG TVLMVRLTDG AEILVRRNGT PPVTALAWNA SGSLLAFADE EGAAGLLTL
|
| |