Gene RPD_4416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4416 
Symbol 
ID4024941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4887560 
End bp4888666 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content66% 
IMG OID637964625 
Productmetal-dependent phosphohydrolase 
Protein accessionYP_571533 
Protein GI91978874 
COG category[T] Signal transduction mechanisms 
COG ID[COG2206] HD-GYP domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCAG TAGCCCGACA ATCCGCGCCC CAGTCCGAAC CGCGACGGCG GCTGCTGCTG 
GCGTCCGATC GCCGAGATCA GAGCGCCGAC CTCGCCCGGA TCCTGGCCGG CATCGCCGAG
ATCGAAACCA TCTCCACCGC GCAACTACCC GACGTGCCGT CGCAGAACCT GTCCGGGATC
GTCGTCGACA TCAATCTGCG GTCCGCGGAA AGCGTGCAGA TGGTCCGGCG CAAGCTGTTG
GGCGGCGGCT ATCAGCCGAT CCCGCGGCTG TTCGTGCTCG CCGACGAATT GCATCATGGC
TCGATGCAGG CCTGGGCGCT CGGTGCCACC GACACCATCG CGCGTCCGTT CGACCCGCGT
GACCTGCTGG CGCGGATCCG TGCTGCGTTC CCCGATCCCT CGGAGACGAC CGAAGCCGCG
CGCGCCGAGG CAATGAGCAA GGGCGTCGCG GCCGCGCACA GCGTATTGGT CAAGATCTTC
GACCGGCTGC CGGCGGGCCA GCCTTTGACG TATCACGACG TGATCCGGGC CGAGGCGCCG
ATCCTCAAGG CGATCAAGCG CTCGTCGCTG CGCGAATGGC TCGCCGTCGT CGGCCGTCAC
CACAACGAAA GCTACCGACT CGCTTTGTTC GCGACCGGCT ATGCCGTCGC CTTCGCCCAG
CATCTCGGTA TGCGCGAGGA AGATCAGCGT CGTCTGACCC GCGCTGCGCT GCTGTACGAC
GTCGGCAAGG CGTTCGTCGA CGTCGGTGTG CTCGACGATC TCGACGGTCT GCAGGGCGAA
CGCTTGCACA AATTCCGCGA GCATCCGCGC CGAGGCTACG AAGCGCTCGC CGCCGAGGGC
AGCTTTCCGC GAGAGACCCT CGATGTGATC CTGCATCATC ACGAGCTGCT TGACGGCTCG
GGCTATCCCG ATGCGCTGCA TGGCGACCAG ATCAGCGACA TCGTCCGCAT CACCACCATC
GTCGACATCT TCACCTCGCT GGTGGCGCCG CGCAAAAATC ACGTCCCGCT GATGCCGTTG
CACGCGTTCT CCCGGATGGA ATCGATGGGC GACAAGATCG ATCAGCGCCT GCTGCAGGCG
TTCCGCCCGG TCCCGCTCGG CGGCTAG
 
Protein sequence
MNAVARQSAP QSEPRRRLLL ASDRRDQSAD LARILAGIAE IETISTAQLP DVPSQNLSGI 
VVDINLRSAE SVQMVRRKLL GGGYQPIPRL FVLADELHHG SMQAWALGAT DTIARPFDPR
DLLARIRAAF PDPSETTEAA RAEAMSKGVA AAHSVLVKIF DRLPAGQPLT YHDVIRAEAP
ILKAIKRSSL REWLAVVGRH HNESYRLALF ATGYAVAFAQ HLGMREEDQR RLTRAALLYD
VGKAFVDVGV LDDLDGLQGE RLHKFREHPR RGYEALAAEG SFPRETLDVI LHHHELLDGS
GYPDALHGDQ ISDIVRITTI VDIFTSLVAP RKNHVPLMPL HAFSRMESMG DKIDQRLLQA
FRPVPLGG