Gene RPB_0788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0788 
Symbol 
ID3909276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp883066 
End bp884331 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content64% 
IMG OID637882680 
Productputative OpgC protein 
Protein accessionYP_484410 
Protein GI86747914 
COG category[S] Function unknown 
COG ID[COG4645] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.904469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.107967 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCG TCGTCTCGTC CGTCGCCGAA CAGACCGCAG GAGCGCCGCC GGGAGGCTTG 
CCGCCGACGG ACGCTGCCGC GCTGTCGTCG CCGCCGCTGC AGCCGGAGCT GCGCAAGCCG
GCGCCGAAGC GGGAATTGCG GCTCGATCTG TTCCGCGGCC TGGCGCTGTG GCTGATCTTC
ATCGATCATC TGCCGGCCAA TGTGCTGACC TGGCTGACGA TCCGGAACTA CGGCTTTTCC
GACGCCACCG AGATCTTCAT CTTCATTTCC GGCTACACCG CCGCTTTCGT CTACGGACGG
GCGATGCGCG ATCAGGGCGT GGTGGTGGCG TCGGCGCGGA TCATGAAGCG GGTCTGGCAG
ATCTATGTCG CCCACGTGTT TCTGTTCACG ATCTTCCTCG CCGAGATCTC CTACGTCGCC
ACCAGCTTCC AGAACCCGCT CTACACCGAG GAAATGGGCA TCCTGGATTT CCTCAAACAG
CCCGACGTCA CCATCGTGCA GGCGCTGCTG CTGCGGTTTC GCCCGGTCAA TATGGACGTG
CTGCCGCTGT ACATCGTGCT GATGTTCTTC CTGCCGCCGA TCCTGTGGAC GATGCGGCGC
TCGCCCGATC TGGCGCTGGC GCTGTCGACC GCGCTCTATG TCGCGACCTG GCAGTTCGAC
CTGCACCTCA CCGCCTATCC GAGCGGCGTC TGGGCGTTCA ATCCCTACGC ATGGCAGTTG
CTGTTCGTGT TCGGCGCCTG GTGCGCGATG GGCGGCGCGC AGCGGCTGTC GCGGGTGCTG
GCGTCGAACA TCACGCTGGG GTTGTCGGTG GCCTATCTGC TGGCGGCGTT CTTCATCGTG
CTGACCTGGC ACATGCCGCA GCTCTATCAC ATCCTGCCGA AATGGCTCGA GCAGTGGATG
TACCCGATCG ACAAGCCCAA TCTCGACGTG CTGCGGTTCG CCCACTTCCT GGCGCTGGCG
GCGATCACCG TGCGGTTTCT GCCCCGCGAC TGGCCCGGCC TCAATTCGGT CTGGCTGCGG
CCGATGGTGC TGTGCGGCCA GCATTCGCTG GAAATCTTCT GCCTCGGCAT CTTCCTTGCA
TTCGCGGGCT ACTTCATATT GGCCGAGATC TCCGGCGGAG CGGTGATGCA TTTCTTCGTC
AGTCTGGCCG GCGTCGTTAT CATGTCCGCC TCGGCATGGC TGCTTTCGTG GTACAAGAAC
GCGGTGGCGA AGGGCGGCAA TCAGAAAACA AGCCCGGATG CCGACCTCGC AGGGGGGGAT
GCATGA
 
Protein sequence
MTAVVSSVAE QTAGAPPGGL PPTDAAALSS PPLQPELRKP APKRELRLDL FRGLALWLIF 
IDHLPANVLT WLTIRNYGFS DATEIFIFIS GYTAAFVYGR AMRDQGVVVA SARIMKRVWQ
IYVAHVFLFT IFLAEISYVA TSFQNPLYTE EMGILDFLKQ PDVTIVQALL LRFRPVNMDV
LPLYIVLMFF LPPILWTMRR SPDLALALST ALYVATWQFD LHLTAYPSGV WAFNPYAWQL
LFVFGAWCAM GGAQRLSRVL ASNITLGLSV AYLLAAFFIV LTWHMPQLYH ILPKWLEQWM
YPIDKPNLDV LRFAHFLALA AITVRFLPRD WPGLNSVWLR PMVLCGQHSL EIFCLGIFLA
FAGYFILAEI SGGAVMHFFV SLAGVVIMSA SAWLLSWYKN AVAKGGNQKT SPDADLAGGD
A