Gene RPB_0050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0050 
Symbol 
ID3908136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp51434 
End bp53662 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content68% 
IMG OID637881931 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_483673 
Protein GI86747177 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCAAA TCCGGCTGAA GACAAGCAAG ACATCGAAAA GATGCACGGT CGCGCGTAAG 
CCAAAGCCAA GCCGCTCCGC GACGAAAGGC CCCAAGCCAT TGCCGCGTCC GAAGCGGCCG
TCGCGCGCCG CGGCGGTCGC CGATCGCGAG CGCGCCGTCG CGGTCGCGCT GGACGAGGCC
CGCAAGTCGC ACGAGCGGCT GCGCGAGGCG ATCGACATCC TGCCGCAGGG CATCGTGTTT
CTCGACTCCG AGGGCCGCTA CATCCTCTGG AACAAGCGCT ACGCCGAGAT CTACAAGGCC
AGCGCCGATC TGTTCAAACC CGGCGCGCGG ATCGAGGACA CGATGCGTGT CGGCGTCGCG
CGCGGCGACT ATCCGGAGGC GGTCGGCCAC GAGGAGGAAT GGATCGCCGA GCGGATGCGG
CGGCTGTTCG ACCCGGGGCA CAGCCACGAG CAGGTGCTCG CCGACGGTCG CTGCATCCGG
ATCGAGGAAC GCACCACCTC CGACGGCGGC GTGATCGGCC TGCGCGTCGA CATCACCGAG
CTGAAGCAGC GCGAGGCGTC GTTCCGGCTG ATGTTCGAGG GCAATCCGGT GCCGATGATC
GTGTGCTCGC TGAAGGACGA GCGCATCGTC GCGGTCAACG ACGCCGCGGT GCTGCATTAC
GGCTATTCGC CCGCCGAATT CGCCGGGTTG ACGATCCGTC GCCTGCAGGC GTTCGGGACC
GAGTTGCCAT GGTCCGGCGA GCAGACCAAC GAGGAGCGCG CGGCGCGGAC CTGGACGCAC
GTCAAGGCCG ACGGCTCGCT GATCGATCTG GCGGTCTATG CGCGCCAGCT CACCTACAAT
GGCGAGCCGG CCGTGCTGCT GGCGCTGATG GACATCACCG AACGCAAGCG CGCCGAGATG
CGGCTCGCCT TCATGGCGCA TCACGACGGC CTGACCAAGC TGCCGAACCG CAGCCTGCTG
CGCAAGCGGC TCGACGAATT GCTGGCCAAG ACCCGGCGCA CCGGCGACAA GATCGCGGTG
CTGTTCATCG GCGTCGATCA TTTCAAGGCG GTGAACGACA CGCTCGGCCA CGCCATCGGC
GACAAGCTGC TGCGCGGCAT CGCGCGGCGG CTGCGTTCGA CGCTGCGCGA GGAGGATCCG
CTGGCGCGGC TCAACTCCGA CGAATTCGCG GTGATCCAGA CCGGGATCAA ACGACCCGAA
GACGTCACGC TGCTGGCCAA GCGGCTGCTG AATGCGATCG CCGAACCGTA TCTGCTCGAG
GGCCATTCGG TGGTCGCCGG CGCCAGCATC GGCATCGCGG TCGCGCCGCT CGACGGCGAC
GATTCCGAGA AGCTCTTGAT GAACGCCGGC ATGGCGCTGT CGCGTGCCAA GACGGATTCG
CGTGGCAGCT TCAGCTTCTT CGAGCCGGAC ATGGACGCCC GCGCGCAGTC CCGCCGCAAG
ATCGAGACCG AGCTGCGCGC GGCGTTGCGT CACGAGGTGC TGCGGCCGTA CTACCAGCCG
CTGATCGCTC TGAGCGGCGG CGGCGTCACC GGCTGCGAGG CGCTGGTGCG CTGGCCGCAT
CCGGAGCGCG GCATGGTCTC GCCTGGCGAA TTCATCCCGG TCGCCGAGGA TACCGGCCTG
ATCAACGCGA TCGGCGCGCA GGTGCTGCGC GCGGCCTGCC GGGACGCCGC GCGCTGGCCC
GGCGACATCA GCGTCGCGGT CAATCTGTCG CCGCTGCAGT TCCGCGTCGG CAATCTGATG
GCGACGGTGA TGGATGCGCT GAAGCAATCC GGCCTGCCGC CGCGCCGGCT CGAACTCGAA
ATCACCGAGA CGCTGCTGCT GGAGAAGAGC AGCCAGGTGA TCGCGACGCT GCACGCGCTG
CGCGCGCTCG GCGTCCGGAT CTCGATGGAC GATTTCGGCA CCGGCTATTC GTCGCTCAGC
TATCTGCGCA GCTTTCCGTT CGACAAGATC AAGATCGACC AGTCGTTCGT CCGCGGCGTC
AGCGACAATC GCGAAGCCCA GGCGATCGTC CGCGCCATCA TCAGCCTCGG CATGGGGCTC
GGCGTCACCA TCACCGCCGA AGGCGTCGAG ACCGAAGCCG AACTGAACTG GCTGCGCGCG
GAGGGCTGCC ACGAGGCGCA GGGCTTCCTG TTCAGCCCGG CGCGGCCGAA CGACGAGCTC
AGGGACCTGC TCGATCGGCA GGGCACGGCC GGCGGTGCGT TCCCGGTCAG CGCGTCGCGC
GTCGCGTAG
 
Protein sequence
MPQIRLKTSK TSKRCTVARK PKPSRSATKG PKPLPRPKRP SRAAAVADRE RAVAVALDEA 
RKSHERLREA IDILPQGIVF LDSEGRYILW NKRYAEIYKA SADLFKPGAR IEDTMRVGVA
RGDYPEAVGH EEEWIAERMR RLFDPGHSHE QVLADGRCIR IEERTTSDGG VIGLRVDITE
LKQREASFRL MFEGNPVPMI VCSLKDERIV AVNDAAVLHY GYSPAEFAGL TIRRLQAFGT
ELPWSGEQTN EERAARTWTH VKADGSLIDL AVYARQLTYN GEPAVLLALM DITERKRAEM
RLAFMAHHDG LTKLPNRSLL RKRLDELLAK TRRTGDKIAV LFIGVDHFKA VNDTLGHAIG
DKLLRGIARR LRSTLREEDP LARLNSDEFA VIQTGIKRPE DVTLLAKRLL NAIAEPYLLE
GHSVVAGASI GIAVAPLDGD DSEKLLMNAG MALSRAKTDS RGSFSFFEPD MDARAQSRRK
IETELRAALR HEVLRPYYQP LIALSGGGVT GCEALVRWPH PERGMVSPGE FIPVAEDTGL
INAIGAQVLR AACRDAARWP GDISVAVNLS PLQFRVGNLM ATVMDALKQS GLPPRRLELE
ITETLLLEKS SQVIATLHAL RALGVRISMD DFGTGYSSLS YLRSFPFDKI KIDQSFVRGV
SDNREAQAIV RAIISLGMGL GVTITAEGVE TEAELNWLRA EGCHEAQGFL FSPARPNDEL
RDLLDRQGTA GGAFPVSASR VA