Gene RPB_3197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3197 
Symbol 
ID3910998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3655601 
End bp3656677 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content70% 
IMG OID637885099 
ProductPpx/GppA phosphatase 
Protein accessionYP_486804 
Protein GI86750308 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.366219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.365881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAGG AGACGCGGCT CCGCGACGGC CTGATGCCCG GCGGAGACAC GCAGCATGAG 
CACGGGTGTG TTGCGAACGC CGAGGAAGAG CATGGGTCGA CGGCGCCGCA TGGCGGCGTC
TATGCCGCGC TCGACCTCGG CACCAATAAT TGCCGGCTGC TGATCGCGCG GCCGACCGGC
GACGGCTTTC GCGTGGTCGA TTCGTTTTCC CGGATCATCC GGCTCGGCGA GGGCGTCTCT
GCCACGGGGC GGATCAGCGA TGCGGCGATC GCTCGGGCGA TCTCCGCGCT GTCGATCTGC
CGTGACAAGA TCGATCAGCG CAAGGCGAAG CGGCTGCGGC TGATCGCGAC CGAGGCCTGC
CGTGCCGCCG TCAATGCCGA TGCGTTCTGT GACGCCGTCG CACACGCCAC CGGCATCCGT
CTCGAAATCA TCGATCGCGA GACCGAGGCG CGGCTGGCGG CGATCGGCTG TTCGCCGCTG
GTCGATACCG CCGGGCGCGG CGCGATCCTG TTCGATATCG GCGGCGGCTC CAGCGAATTG
GTGCGGCTCG CGCGCGATCC GGCGCGGCCG GACCTGCCGC CGCGGATCCG GGCCTGGATG
TCGATTCCGC TCGGCGTGGT GACGCTGGCC GAGCAGTTCG GCGGCAAGGT GGTGACCGCG
GACAGCTATG CGGCGATGAT CGCGGAGGTC GCCAGGCACG TCGCGCCGTT CGCGGCCGCG
CATGGCGGCG ACCTCGGCGG CCTGCATCTG CTCGGCACCT CGGGCACGGT GACAACGCTC
GCGGGGCTGT ATCTCGACCT GATCCGCTAC GATCGCCGCC GCGTCGACGG CATCTGGATG
AGCGACGCGG AACTGACCGC GACGATCGAC CGGCTGCGCG GCATGAGCTA TCACGATCGC
GCCCAGAACC ATTGCATCGG CGCCGAGCGC GCCGACCTGG TGCTGGCCGG CTGCGCCATC
CTCGACGCGG TGCGTGCGGC GTTCCCGCTG CCGCGGCTGC GGGTCGCCGA TCGCGGCCTG
CGGGAGGGCA TGCTGGTCGA AATGATGCGC GAAGACGGCG TGCCGGGCGT GGCCTGA
 
Protein sequence
MDEETRLRDG LMPGGDTQHE HGCVANAEEE HGSTAPHGGV YAALDLGTNN CRLLIARPTG 
DGFRVVDSFS RIIRLGEGVS ATGRISDAAI ARAISALSIC RDKIDQRKAK RLRLIATEAC
RAAVNADAFC DAVAHATGIR LEIIDRETEA RLAAIGCSPL VDTAGRGAIL FDIGGGSSEL
VRLARDPARP DLPPRIRAWM SIPLGVVTLA EQFGGKVVTA DSYAAMIAEV ARHVAPFAAA
HGGDLGGLHL LGTSGTVTTL AGLYLDLIRY DRRRVDGIWM SDAELTATID RLRGMSYHDR
AQNHCIGAER ADLVLAGCAI LDAVRAAFPL PRLRVADRGL REGMLVEMMR EDGVPGVA