Gene RPB_4638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4638 
Symbol 
ID3912455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5242565 
End bp5244391 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content64% 
IMG OID637886542 
Productdiguanylate cyclase/phosphodiesterase 
Protein accessionYP_488232 
Protein GI86751736 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGTG CAGCCCCCCA CGAAATGTCC GGCGACGCGA CCGAATCGGA GCCCAGCGCC 
TCCGCAGGGA CTGCGTCCGG TGGGGATCTG CCGGCGTTTG CCGAAGACAT CGACTTTCAG
CTTCTGAAAG ACATCGTCCG CGGGCTCCCA TCCTTGGTCA CGGTACAAGA TCACCAGGGC
GAGTTTCTGA TCGTTAACGA TGCCGCCGCC GCCCGCTTCA ACCGGCCGGC CGGTCCAGGC
GCGACCGCCA CGCCCTGCGT CGCGCTCGAA CAGCGCCGTG CCGACGGCTT GGCTTTGCTC
GCCACCGGCC GCGGCGCGAT CTGCGAGGAA CGCGCCGGCG ACGGCGCGAC GGCGCGCACG
TTCCTGACCG CGCATCGGCC GATCGAGATC GGCGGGCGCC GGCTGCTGCT GTCGAGCTCC
CTCGACATGA CCGAACAGAA GGCGATCGAG GACGACCTGT TTCGCCGCGC CTATTTCGAC
GAGTTGACGG GGCTGCCGAA TCGCAGCGTG ATCGAACATC GCGCCGACAC CTTGCTGCGC
GCCGGGGGAA CGCCGAGCCG GTTCGCGCTG GCGTTCCTCG ACATCGATAA TTTCAAGCAC
ATCAACGATT ATTACGGCCA CGCGGTCGGC GATGCATTGT TGGTCGGGGT CGCCAAACGG
CTCGGCCTCG AACTGCGCGA GACCGACATG CTGTCGCGCA TCAGCGGCGA CGAATTCCTG
CTGTTGCTGA ATCCGATCGA GAGCCCGGCC GAGGTCGCCG ACCACATCGA TTTCCTGCTG
CAGCGACTGA AGGCGCCGTT CTACATCGAC GGGTCGGAAT TGTTCGCTTC GGCTTCGACC
GGCATCAGCC TGTATCCCGA GCACGGCCAC ACCTTCGACG CCTTGCGCCA GAACGCCGAC
ATTGCGATGT ACCGGGTCAA GATCGCGACG AAGGGCGCGG CCGCGATGTT CGACGTCGCG
ATGGAGCGCG AGGCGCTGGA ACGCATGCGC ATCGAGCAGT CACTGCGCCA GGCGATCCTC
GACAGGCGTT TCTGCTGCGC ATTCCAGCCC AAGGTCGACA TCCGGACCCG CGAGATCAAG
GGGATCGAGG CACTGGTCCG GCTGCGCGAT GACGACGGCG TCATCCAGGC GCCAGGCACT
TTCGTCGATC TGGCGGTCGA ACTCGGGCTG ATCGACGAAC TCACCCACCT GGTCCTCGCC
GAGATCATGA AATCGATCGA CCTGATCAAT GACGCATTCG GCACCCACGC CAGCATCAGC
ATCAACGTCG CGGCCAAGCA GGCCGGCAAT TCCCATTTCA TGCAGTCGTT TGCGGCAGCG
CTCGCGGCGA CCGAGTGCCC GTCGCGCTTC ATCATCGAGG TGACGGAAGA CGCTTTCGTC
ACGAAAAGTC ACTTCCAGAG CGAAATTCTG CCGATGCTGC GCGAGATCGG CGTCGGCATC
TCGATCGACG ATTTCGGCAT CGGCTACTCG TCACTGTCGG CGCTGGCGGA CATCACCGCG
GACGAAATCA AGATCGATCG ATCCTTCATC ACGGACATCC ACAAGCGGCC GCGCAGCCAG
GGCATCCTCA GGGCGATCGA GTCGCTGAGC GAGGCGCTCG GCATGACGGT GATTGCCGAA
GGCATCGAGA CCTTCGAAGA ACTCGCCTAT CTGCAGACTA TGACCCGGAT CCGCTACGCA
CAGGGCTATT ATTTCGCAAA GCCGGTGTTC CTCGAGGAGT TGAAGCCCTC GGCGCAAGGC
TTCGAGGGCG GGCGTCCACG CGAGTCAGGT CGCCAGATCG ACCAGGCGCG ACCGGTGATT
TCGCGCACCA ATGCCGGTCG CAGTTAG
 
Protein sequence
MSSAAPHEMS GDATESEPSA SAGTASGGDL PAFAEDIDFQ LLKDIVRGLP SLVTVQDHQG 
EFLIVNDAAA ARFNRPAGPG ATATPCVALE QRRADGLALL ATGRGAICEE RAGDGATART
FLTAHRPIEI GGRRLLLSSS LDMTEQKAIE DDLFRRAYFD ELTGLPNRSV IEHRADTLLR
AGGTPSRFAL AFLDIDNFKH INDYYGHAVG DALLVGVAKR LGLELRETDM LSRISGDEFL
LLLNPIESPA EVADHIDFLL QRLKAPFYID GSELFASAST GISLYPEHGH TFDALRQNAD
IAMYRVKIAT KGAAAMFDVA MEREALERMR IEQSLRQAIL DRRFCCAFQP KVDIRTREIK
GIEALVRLRD DDGVIQAPGT FVDLAVELGL IDELTHLVLA EIMKSIDLIN DAFGTHASIS
INVAAKQAGN SHFMQSFAAA LAATECPSRF IIEVTEDAFV TKSHFQSEIL PMLREIGVGI
SIDDFGIGYS SLSALADITA DEIKIDRSFI TDIHKRPRSQ GILRAIESLS EALGMTVIAE
GIETFEELAY LQTMTRIRYA QGYYFAKPVF LEELKPSAQG FEGGRPRESG RQIDQARPVI
SRTNAGRS