Gene Rsph17025_4279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4279 
Symbol 
ID5086457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009431 
Strand
Start bp37982 
End bp40792 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content70% 
IMG OID640485837 
Productdiguanylate phosphodiesterase 
Protein accessionYP_001170431 
Protein GI146280275 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase)
[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCG CCTGCGACCC TTCGATCTGC GAGACGGAAC CCATCGCAAC GCCCGGCGCG 
ATCCAGCCGC ACGGCGCGCT GGTCACGGCG CAGGCCGACA GCGGCCTCGT CGCCCACGCC
AGCGCCAACC TGGCCGAGAT CCTCGGCCTG CCGGCGGCCG CGGCCCTGGG CCGTCCCATC
TGCGACGTGA TCGGGCACGT CAACGAGGTC CTGCTGCGCG AGGCGCGCCG TCGCGGCTCC
GAGACACTGG AGTCGATCGG ATCGCTGCGC CGGGCGGACG GGCGGCTGCT GCATCTCCAT
GCCTTCCAGT CGGGCGACTA CATGTGCCTC GACCTCGAGG CGGTGCGCGA CGAAGACGAC
CGGCTGCCGC CGGGGCTCAC CCAATCGGTG ATCGAGACCT TCTCGGCCGC CGTGACGCAG
GTCGAGCTGT GCGAGCTTGC GGTGCATGGG CTGCGGCAGG TGACGGGCTA TGACCGGGTG
ATGGCCTATC GCTTCGGACC GGACGGCCAT GGCGAGGTCA TCGCCGAGGA CCTCAGGCCG
GATCTCGAGC CCTACCTCGG CCTGCGCTAC CCGGCCTCGG ACATTCCGCA GATCGCGCGG
GCCCTCTACC TGCGCCAGAG GGTGGGCTCG ATCGCGGATT CGCACTACGA GCCGGTGCCG
CTGCTGGGCC ATCCCGGGAT CGACGCCCTC GACCTGACGC ACAGCGCCCT GCGCAGCGTC
TCGCCGATCC ACCTCGAATA CATGCGGAAC ATGCACACGG CCGCCAGCCT GACCATCGGA
CTGGCCGATG GCGACAGGCT GTGGGGGATG CTGGTCTGCC ACAACATGAC CCCCCGGACC
GCCGGCCCCG AGCGTCGGGC GGCGGCGGGC ATGATCGGTC AGGTGGTCTC GCTGCTGCTG
AGCCGGCTGG GCGAGGTCGA GAATGCCGCC GAGAAGCTGG CCCGGCAGGC GACGCTCTCG
ACGCTGGTCG ATCGGCTGTC GATCGGGGAC ACGCTGGCCG CGGCGATCAC CGCGGCCGAT
CCGCTGCTGC TCGAACTGGT CGGGGCCAGC GGGGCCGTCG TCCGACTGGA AGGGCAGGAA
CTGCATTTCG GGCGGATACC GCCCGCCGAC GCGGTGCGCA GGGCCCTGGA GATCCTGGGC
CCCACCCGGC CGCCAGAGAT CCTGGCCATC GACGATGTCA CCCTTCGCCA TCCCGAACTG
CCGGAACTGG CCGCGGCGGG AAGCGGCCTC CTCCTGCTCC CCCTGCCCTC GGGCGACGGA
GACCTGATCG CCTGGTTCCG CCCCGAGCAT GTGCAGACCA TCACCTGGGG CGGCAATCCG
GCCGAACATG GGACCTGGAA CCCGCAAACC CGGCGGATGA GCCCGCGCGC CTCGTTCGAC
GCCTGGAAGG AGACGGTCAC CGGCCGCTCG CTTCCCTGGA CCTCGATCGA ACTGGCCTGC
GCCCATGACC TGGGTGAGGC CATCACCGCC GAGATGGCGC AGCGCACCCG GGCGGCGCTC
GCGCGCCTGC GCCACTACGA TCCGCTGACG GGCCTTGCGA ACCGCAGCTT CCTCCAGGAA
TGTCTGGCCA ACGCCACGCT GGCCGGCGGG CCGGATGTGG CCCTGCTCTT CATCGACCTC
GACCGGTTCA AGGCGGTGAA CGACTCCATG GGCCACGGCG TGGGCGACGG GCTGCTGATC
GAAGTGGCGC ACAGCCTCGT GGCGAGCGTG CGTCCCGAGG ATCTGGTGGT CCGGCTGGGC
GGCGACGAGT TCGTGGTCCT GTGCCACGGG CTGGACAGGG CCGCGGTGAC GGGACTGGCC
GAGCGGTTGC GGCAAGTGCT CGATCAGCCC TTCGAGGTCA CCGGCCGCAA GTGCCACATC
TCGGCCAGCA TCGGCATCGC GATGTCCGAC AGCATCGGCG AGCTCGATCT GGTGCGGGCG
GCCGACATTG CGATGTATGC GGCCAAGAAG AACGGCGGCA ACCGGGGCGA GCTGTTCCGT
CCCTCGCTCT ACGAAGAGAC GACGCAGCTT GTCGAGCTTG ACAACGACCT GCGCAATGCG
CTCGAGAACG GCCAGTTCCA TCTGGCCTAC CAGCCGATCT TCGCCCTGGC CCCCGGGACG
GAGCGGCTGG TGGGCTTCGA GGCCCTGCTG CGCTGGGAGC ATCCGCACTA CGGCTCCCTT
CAGCCCGGGG TCTTCATCCC GATGGCGGAA AAGCTGGGTC ACATCCATGT GCTGGGCGAC
TGGGCGCTGC GCAACGCCCT GCGGCAGGTG CAGGCGTTCC GGTCGGCCGG CCCGGAGCTG
GATCTGAAGA TCAATGTGAA CGTCTCGCCC CTGCAGCTGG CGAAGCCGGG GTTTGCCGCG
CGTCTGACGG ACATGCTGGG GCAGATGCCG GACCTGCCGT CCCACGCGCT CTGCCTCGAG
ATCACCGAGA CCTGCCTGAG CGACGAGGCC GTCTCCGAGG CTCTGGGCGC GATCCGGGCG
CTGGGGGTTC AGGTCGCCAT CGACGACTTC GGCACGGGCT TCTCGTCGCT GGCCTGCTTG
CGCCGCCTGC CCGCGGATAT CGTGAAACTG GACCGCGCCT TCCTGAAGAA TTCCAGCGCC
GAGCCGCAGG ACGACAGGTT CTTCGCGGCG GTCAACAGCC TCATCCATGC GGTGGATCTG
AAGGTGGTGC AGGAGGGGGT CGAGACGCCC GCACAGATCG ACTTCATTCG CGCCACGGGG
GCCGATTTCG CCCAGGGCTT CCACCTCGCC CGCCCTCTCT CCATCCCGGC CGCCCTCGAC
CTGATCTCGG CCTCGCGTCC GGGCCGGCCG CGCCCGCCGG ATCGGACGTG A
 
Protein sequence
MTAACDPSIC ETEPIATPGA IQPHGALVTA QADSGLVAHA SANLAEILGL PAAAALGRPI 
CDVIGHVNEV LLREARRRGS ETLESIGSLR RADGRLLHLH AFQSGDYMCL DLEAVRDEDD
RLPPGLTQSV IETFSAAVTQ VELCELAVHG LRQVTGYDRV MAYRFGPDGH GEVIAEDLRP
DLEPYLGLRY PASDIPQIAR ALYLRQRVGS IADSHYEPVP LLGHPGIDAL DLTHSALRSV
SPIHLEYMRN MHTAASLTIG LADGDRLWGM LVCHNMTPRT AGPERRAAAG MIGQVVSLLL
SRLGEVENAA EKLARQATLS TLVDRLSIGD TLAAAITAAD PLLLELVGAS GAVVRLEGQE
LHFGRIPPAD AVRRALEILG PTRPPEILAI DDVTLRHPEL PELAAAGSGL LLLPLPSGDG
DLIAWFRPEH VQTITWGGNP AEHGTWNPQT RRMSPRASFD AWKETVTGRS LPWTSIELAC
AHDLGEAITA EMAQRTRAAL ARLRHYDPLT GLANRSFLQE CLANATLAGG PDVALLFIDL
DRFKAVNDSM GHGVGDGLLI EVAHSLVASV RPEDLVVRLG GDEFVVLCHG LDRAAVTGLA
ERLRQVLDQP FEVTGRKCHI SASIGIAMSD SIGELDLVRA ADIAMYAAKK NGGNRGELFR
PSLYEETTQL VELDNDLRNA LENGQFHLAY QPIFALAPGT ERLVGFEALL RWEHPHYGSL
QPGVFIPMAE KLGHIHVLGD WALRNALRQV QAFRSAGPEL DLKINVNVSP LQLAKPGFAA
RLTDMLGQMP DLPSHALCLE ITETCLSDEA VSEALGAIRA LGVQVAIDDF GTGFSSLACL
RRLPADIVKL DRAFLKNSSA EPQDDRFFAA VNSLIHAVDL KVVQEGVETP AQIDFIRATG
ADFAQGFHLA RPLSIPAALD LISASRPGRP RPPDRT