Gene RPB_4555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4555 
Symbol 
ID3912372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5149067 
End bp5150086 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content69% 
IMG OID637886459 
ProductWD-40 repeat-containing protein 
Protein accessionYP_488149 
Protein GI86751653 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.999711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.705605 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACT TCGAAACGTC CGGCGAGGCG GCCTCGATCG TCTCGGTCAC CGACCGCGTC 
CGCGAGGTCG CGATCGGCGC GCCGGTGGGG GCGGTGCATT TCCTCGGCGA CACCGCGGTG
TTCATCGGCG CGGAGGAAAA CGCGACCTTT GCCAAGTCGG ATGGCGAGAG CTCGACCGTC
GCGCTGCACG GCGGCGCCGT TCTCAGCTCG GTCACCGACG GCAAGCGCAT CGTCAGCGGC
GGCGACGACG GCAAGGTGAT GGCGCTCGAC GCGGGCGGCA AGGCCGAGCT GTTCGCCACC
GACGCCAAGC GGCGCTGGAT CGACAATGTC GCGCTGCATC CGGACGGTGC GGTGGCGTGG
TCGGCGGGCA AGATCGCTTA CGTCCGCGCG CCGAAGGCCG AGGAGAAATT CTTCGAGGTG
CCGTCGACGG TCGGCGGCCT CGCTTTCGCG CCGAAGGGAA TGCGGCTCGC GATCGCGCAT
TACAACGGCG TGACGCTGTG GTTTCCGAAC ATGGCGGCCA ATGCCGAAAT GCTCGAATGG
GCCGGCTCGC ATCTCGGCGT GATGTTCAGC CCGGACAACC GTTTTCTGGT CACGTCGATG
CACGAGCCGG CGCTGCACGG CTGGCGGCTC GCCGACGCCA AGCACATGCG GATGACCGGC
TATCCCGGCC GGGTGCGGTC GATGGCCTGG ACCTCGGGCG GCAAGGGCCT CGCTACCTCG
GGCGCCGACG CCGTGATCGT CTGGCCGTTC GCCAGCAAGG ACGGGCCGAT GGGCAAGCAG
CCGGCGATGC TGGCGCCGCT GCAGGCGCGC GTCAGCATGG TGGCGTGCCA CCCCAAGCAG
GACATCCTCG CCACCGGCTA CAGCGATGGC ACCGTGCTGA TGGTGCGGCT CACCGACGGC
GCCGAGATCC TGGTCCGGCG CAACGGCACG CCGCCGGTCA CGGCACTGGC GTGGAATGCG
AGCGGCAGTC TGCTGGCATT CGCCGACGAG GAGGGTGCAG CGGGACTGCT GACGCTGTAA
 
Protein sequence
MSDFETSGEA ASIVSVTDRV REVAIGAPVG AVHFLGDTAV FIGAEENATF AKSDGESSTV 
ALHGGAVLSS VTDGKRIVSG GDDGKVMALD AGGKAELFAT DAKRRWIDNV ALHPDGAVAW
SAGKIAYVRA PKAEEKFFEV PSTVGGLAFA PKGMRLAIAH YNGVTLWFPN MAANAEMLEW
AGSHLGVMFS PDNRFLVTSM HEPALHGWRL ADAKHMRMTG YPGRVRSMAW TSGGKGLATS
GADAVIVWPF ASKDGPMGKQ PAMLAPLQAR VSMVACHPKQ DILATGYSDG TVLMVRLTDG
AEILVRRNGT PPVTALAWNA SGSLLAFADE EGAAGLLTL