Gene RPD_0437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0437 
Symbol 
ID4020903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp504585 
End bp506261 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content66% 
IMG OID637960622 
ProductGGDEF domain-containing protein 
Protein accessionYP_567576 
Protein GI91974917 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACGCA TCGCCATCAT TCGCAAGAAA TCGCAGCTCC GGGGGATGCT CGGAATCCGC 
GCCAGGCTGG TGGTTCTGGC TCTGATTCTG GTCTGCCCGC TGATGCTCGA TCGCGTGCGT
CTGCTCGAAA ACACGCGCAC CACCCAGATT GCCGCGGTGG CCGACAGTGT CGCAACGCTG
GCGCAGCGCG CCACCGATGC ACAGCGCGAA GTGATCTCCT CGGTCGAGGC GGTGCTGAAA
TCGGCGGCCT ATATCCACGC CGCGGCGTCG CAGATCGGGC GCAGCTGCAG CATCCTGCGC
GCCAGCCTGC GGGTCGATCT GCCCTCGATC CGGATGCTGT CGGTGGCGGA TTCCAGCGGC
ATCATCCGCT GCTCCACCTC GTCGATGTTC GTCGGCTCGG ACGTCAGCAA CCGCGCTTAT
TTCCGCAAGG CGTTCGAGAC CCACGATTTC GTCGTCAGCG ACTTCGTGAT CGGACAACAG
AGCCGCCAGG GCACGATCCT CGCGGCCTAT CCGGTGTCGG CGGTCGACAC CGGCGAAGAA
GCCGTGCTGA TCGCCGGCAT GAATCTCGAT TGGCTGTCGG ACATCATGGC CAATCTCGCC
GGCCGCCCCG GGATCAACGT CGCGCTGATC GACGGCGAAG GCACCGTGCT GGCGACGCCG
CCCGACCACC GCAGCCTGAT CGGCCGCAAA CTCGATCAGT CGACGCTGGA TTTCGCGCTC
GGCGCCCGCC CTCTGGAGCG CGAACAGGCG ACCACCTCCA TTGATTCCGG GCAGACGCTG
TCGGTGTCGC GGATACCCGG CACCAATGCG CGGCTGGTGG TGACGCTGGA CGAGAACATC
GTCTCGGCGG CGATCAGCCG CGACATCCGC ACCGCCTATC TGCAATTGGC GCTGGTCTGC
CTGCTCGTGC TGCTCGGCGC GCTGATCGCC GCCGAACGGC TGATCGTGCG GCCGATCTCG
GTCCTGACCT CGATCGCCAA CCAGTTCGCG CAGGGCGACT GGTCGGCGCG CGTCACAAGG
GCACGGCTGC CGGCGGAATT TCTGCCCTTG GCGCGCGCGT TCAACGCGAT GGCCGCGCAG
CTCGGCCAAC GCGAGCGCGA ACTGGTCGCA AGCAACAACA GCCTGACCGT GATGGCGTCG
ATGGATCTGC TGAGCGGGCT CGCCAACCGG CGCGGCTTCC AGAGCCGGCT GGATTTCGAA
TGGATGCGCG GCCTGCAGAA CGGTCATACC GTGGCGCTGC TGCTGCTCGA CGTCGATTAT
TTCAAGTCGT TCAACGATAG CTACGGCCAC CCGGAGGGCG ACGCCTGCCT GTCCCGGATC
GGCGAGACCC TCGCCACCGT CGCCAACGAC ACCGGCGGCT TTGCGGCGCG CTATGGCGGC
GAGGAGTTCT GCCTTCTGTT GCCGGATACC GACGCCGACA CCGCCTATCA GGTCGGCGAA
ATGGTCCGCG CCACCGTCGA ACGGCTGGAC GTGCCCCACC ACACCAGCTT GTTTCAGCGC
GTCACCGTCA GCATCGGCGT CGCGTCCACT TCGCCGACCG AGGACGCCAG CCCGGCCGAG
TTGATCGAGG CCGCCGACGC CGCGCTCTAT GCCGCCAAGG GCCGCGGCCG CAACACCGTC
GTCGAACACG GCTTCATCCG CGCCACCGAC ACGATATCGA TGGCGATCGC CAGCTGA
 
Protein sequence
MSRIAIIRKK SQLRGMLGIR ARLVVLALIL VCPLMLDRVR LLENTRTTQI AAVADSVATL 
AQRATDAQRE VISSVEAVLK SAAYIHAAAS QIGRSCSILR ASLRVDLPSI RMLSVADSSG
IIRCSTSSMF VGSDVSNRAY FRKAFETHDF VVSDFVIGQQ SRQGTILAAY PVSAVDTGEE
AVLIAGMNLD WLSDIMANLA GRPGINVALI DGEGTVLATP PDHRSLIGRK LDQSTLDFAL
GARPLEREQA TTSIDSGQTL SVSRIPGTNA RLVVTLDENI VSAAISRDIR TAYLQLALVC
LLVLLGALIA AERLIVRPIS VLTSIANQFA QGDWSARVTR ARLPAEFLPL ARAFNAMAAQ
LGQRERELVA SNNSLTVMAS MDLLSGLANR RGFQSRLDFE WMRGLQNGHT VALLLLDVDY
FKSFNDSYGH PEGDACLSRI GETLATVAND TGGFAARYGG EEFCLLLPDT DADTAYQVGE
MVRATVERLD VPHHTSLFQR VTVSIGVAST SPTEDASPAE LIEAADAALY AAKGRGRNTV
VEHGFIRATD TISMAIAS