Gene RPB_0805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0805 
Symbol 
ID3909620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp919005 
End bp922715 
Gene Length3711 bp 
Protein Length1236 aa 
Translation table11 
GC content68% 
IMG OID637882698 
Productglycerophosphoryl diester phosphodiesterase 
Protein accessionYP_484427 
Protein GI86747931 
COG category[C] Energy production and conversion 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATA TCGTTCTTCT CGCGCATCGC GGCAGCAACC CCTATCCGGA TCATTCGCGC 
GACGCCTATG TCTGGGCGAT CGACTGCGGC GCCGACTTCA TCGAGCCGGA TCTCTATCTG
ACCAAGGACG GCGTGCTGGT CTCCAGCCAC GACAACCACA ATTATTCCAA TCTGAGCTAC
GCCGAGGCGA AAGCCCTCGA GCCGTCGCTG CTGACGTTCG GCGAGATCAT CGAGCTCGTG
AAGGCGATGT CGATCGAGAC CGGCCGCGAC ATCGGCATCG TTCCCGAGAC CAAGAGCACC
GACTACGCCA CCAGCGAAGC CGTGATCAAG GAGTTGATCG CCCACGACTT CACCGATCCG
GACCGGGTCG TGATCCAGAG TTTTGCGTCG ACCAATTTGC AGCAATTGCA CGACACCATC
ATGCCGCAAT ACGGCGTCGA CATCCCGCTG GCCTATCTCG GCAGCGGCAT TGCGAATCCG
GGCCAGATCG CGACCTTTGC GGACTACGCC GCGCCCAGCG TCGGCTCGTT CACCGCGGCC
GACGTCGCGG CCGCGCATGC CGCCGGCCTC AAGGTGGTGG CCTGGACGAT TCTCGGGGCC
CGGTCCGACA TCCAGAGCCT GATCGACATG GGCGTGGACG CGGTCTTCGT CGACGATACC
CGGCTCGCCC GCGCCAGCAT CGAGGCGATC GCCGGCGCCA ACGTCGTCTA CGGAACGCCG
GAAATCGACG GCGCCTCCGG CACCGCCGGC AACGACGTGG TCTACGCCAT GCAGGGCGAT
GACATCGTCT GGTCCGGGGC CGGCGACGAT CTGGTCTATG GCGACGGCGG CGACGACGCT
CTGTTCGGCG GCGCCGGCAA CGACATCCTG GTCGGCGGTT CGGGCACCGA TCTGCTGTCC
GGCGACGCCG GTCGCGACGT TCTCGACGGC GGCGCCGGCA ACGATGTCGT GCTGGCGAGC
GGCGACACCG TGCTGTTCCG CCGTGGCTCG GGCATCGACC TTGTCGCGCT CGACGCCGCC
AGCAGCATCG ACTTCCAGGA CATCGACTCG CGCGCCATCA CGGTGATACG CGACGGCGCC
GATCTGATCG TCCGCATCGG CGACGACGCG CTGGTGATCC GTAACGGCGC CGGCAATGCC
GCGAGCCTGC CCGGCGCGGT GAGTTTTGCC GACGGCGTGA CGCTCACCGC CACCGAGCTG
CTGGCGCGCG CCACGAGCGG CACCGACGCC GGCGTCACCG CCGCGCTGCC GGCGCTCGAA
CAGCTGCTCG CCGCTGCGCC CGATCTCGCC GTCGAGCCCC CGGTGGTCGT CGAGACCAAC
CTCATCGTCA ATGGCGGCTT CGAGGATCTG ACCGGGGCCA ACAACGGAGC GAGTTGGGGC
TATCGCAACA CCAATCCGGC CGGCGTCATT CCCGGCTGGG TCAACCGCGG TGACACCCGC
GCGGAAGTCC ACAAGGATAC GGTCGGCGGC ATCGGCGCGG CGGAAGGAAC CTATTGGTTC
GACCTGGAAG GCGCGCCCAC CAACGCCAAA CTGGTGCAGA CCGTCGCCGG CGTCGAACAG
GGCGCGACCT ATCAGCTCAG CTTCAGGATC GCCGACACCG ACACCGCGCA GACGACCGAC
TCCGTCAAGG TCTATTGGGG CGGCGAACTG ATCTATACGG GAACGCCGAA GAACAAGTGG
CAGGAGATCA CCATCGACGT GATCGGCGGC GACGGTGACG GCTTCAACAC GCTGACCTTC
GAAAGCGTGA CGCCGAGTCC GAACGGCGCC GGCGTGGCGC TCGACGACGT GGCGCTGATC
CGGCTGCAGG AGAGCCCCAA TCTGATCGTG AACGGCAGCT TCGAGGACCT CACCGGCGCC
AACAACGGCA ATTGGAGCGG CGATTGGGGC TACCGCAACA ACAGCGGCGT CATTCCGGGT
TGGACCCAGG TCGAAACCTC CGCCGGCGGT CGCGCCGAAC TGCACTTCGA CACCCAGAAC
GGCGTGTCGG CCGCGGACGG CAATGTCTGG TTCGATATGG ACGGCAACGG CAACAACGCC
AGGCTGGTGC AGACCGTCGC CGGCGTCGAG GCCGGCGCCA CCTACCGGCT GACCTTTTCG
ATCGCCGACG CCGACGCCAG CACCACCGAT GACGGCGTGC GCGTCTATTG GGGCGGCCAG
GTCGTGTATG AAGGTGTGCC GACCAGCATC TGGCAGAAAA TCACGATCGA GGTCGTGGGC
AATGCCGGCG ACGGAACCAA TCAGCTGATC TTCCAGGGCA CCGAAACCAG CCTGAACGGC
TACGGCGCCG CGCTCGACGA TATTTCGCTG CGCAAGATCG CCGATGCGCC GCCGCCCAAC
ACCGCGCCGG TCGCGGCCGA CGACGGCGCT CCGGCGACCG ACTTTGGTGC GGCGCTGACC
ATCGCCGCCG CCACCTTGCT GGCCAATGAT ACGGATGCCG ACGGCGACGC GCTGGTGATC
CTGTCGGTGG CGGCCGGCGT CGGCGGCACG GTCGCGCTGG ACGCCGACCG CAATGTCGTG
TTCACCCCGG CCGAAGGCTT TTCGGGCGAG GCGTCGTTCA GCTATGTGGC ATCCGACGGC
CGAGGCGGCA CCGCCACGGC GGACGTCACC GTCGTGGTGG CGCGGCGGGT GCTCTCGGGC
ACGCCCGGCG ACGACGTGAT CATCAGCACG TCCGGCGACG ACGTGATCGA CGGTGGCGAT
GGCGTCGATA CCGTGAGCTA TGCGGCTTCG GCCGCCGGCG TCGACGTCGA CCTTGCGGCC
GGCGTCGCCT CCGGTGACGG CAACGATACG CTGTCGAGCA TCGAGTCGGT GATCGGCTCG
GCGCATGACG ACCGGCTGAG CGGCAACGAC GCCGCCAACC TGCTCGACGG CGGCGACGGC
GACGACATCC TGTCCGGCGG TCTCGGCAAC GACGTCCTCA ACGGCGGTCT CGGCAATGAC
ATCATCACCG GCGGCGCCGG TGACGACACC ATCGACGGCG GCGCGGGCTT CGACACGCTC
GACCTGTCGG AGGCCACCGG GGCGGTGACG CTCAATCTGG TGAGCGGCAC CGTCAGCGGC
GCCGGCATCG GCACCGATCA CTTCAGCTCG ATCGAGAGCT TCGTGTTCGG TAGCGGCAAC
GACGTTATCA CCGGCGGCAA CGGCGACGAC AGCCTCGACG GCGGCGCCGG CAACGACGCG
ATCGACGGCG GCAACGGCAA TGACACGCTC TCCGGCGGCG AAGGCAACGA CGCGATCGAC
GGCGGTTCGG GCAACGACAT CGTGGATGGC GGCCTCGGCA ACGACACGCT GAAGGGCGGT
TCGGGCAACG ACGTCATCGC GGCCGGCGAC GGCGACGACA ATGTCGATGC CGGCTCCGGC
GACGACATCG TCACCGGCGG TGCCGGCAAC GACACGCTGA AGGGCGGGTC GGGCGCCGAC
ATCATCACCG GCGGCGCCGG CAACGACATC CTGACCGGCG GTTCCGGCGC GGACGTCTTC
GTGTTCGCGG CCGGCTTCGG CAACGACACC GTCACCGACT TCGCCACCAC GGGGTCGTCG
GCCGATCTGC TGCAGTTCTC CAGCGACATG TTCGCCGACT TCGCCGACGT GATGGCGCAC
ACCGCGCAGG TCGGCAGCAG CGTGGTGGTC ACGCTGGACG CCGACACCAG CATCACGCTG
GCCAACGTCC AGATGACCTC GCTCGCCGCC GACGACTTCC GCTTCGTCTG A
 
Protein sequence
MSDIVLLAHR GSNPYPDHSR DAYVWAIDCG ADFIEPDLYL TKDGVLVSSH DNHNYSNLSY 
AEAKALEPSL LTFGEIIELV KAMSIETGRD IGIVPETKST DYATSEAVIK ELIAHDFTDP
DRVVIQSFAS TNLQQLHDTI MPQYGVDIPL AYLGSGIANP GQIATFADYA APSVGSFTAA
DVAAAHAAGL KVVAWTILGA RSDIQSLIDM GVDAVFVDDT RLARASIEAI AGANVVYGTP
EIDGASGTAG NDVVYAMQGD DIVWSGAGDD LVYGDGGDDA LFGGAGNDIL VGGSGTDLLS
GDAGRDVLDG GAGNDVVLAS GDTVLFRRGS GIDLVALDAA SSIDFQDIDS RAITVIRDGA
DLIVRIGDDA LVIRNGAGNA ASLPGAVSFA DGVTLTATEL LARATSGTDA GVTAALPALE
QLLAAAPDLA VEPPVVVETN LIVNGGFEDL TGANNGASWG YRNTNPAGVI PGWVNRGDTR
AEVHKDTVGG IGAAEGTYWF DLEGAPTNAK LVQTVAGVEQ GATYQLSFRI ADTDTAQTTD
SVKVYWGGEL IYTGTPKNKW QEITIDVIGG DGDGFNTLTF ESVTPSPNGA GVALDDVALI
RLQESPNLIV NGSFEDLTGA NNGNWSGDWG YRNNSGVIPG WTQVETSAGG RAELHFDTQN
GVSAADGNVW FDMDGNGNNA RLVQTVAGVE AGATYRLTFS IADADASTTD DGVRVYWGGQ
VVYEGVPTSI WQKITIEVVG NAGDGTNQLI FQGTETSLNG YGAALDDISL RKIADAPPPN
TAPVAADDGA PATDFGAALT IAAATLLAND TDADGDALVI LSVAAGVGGT VALDADRNVV
FTPAEGFSGE ASFSYVASDG RGGTATADVT VVVARRVLSG TPGDDVIIST SGDDVIDGGD
GVDTVSYAAS AAGVDVDLAA GVASGDGNDT LSSIESVIGS AHDDRLSGND AANLLDGGDG
DDILSGGLGN DVLNGGLGND IITGGAGDDT IDGGAGFDTL DLSEATGAVT LNLVSGTVSG
AGIGTDHFSS IESFVFGSGN DVITGGNGDD SLDGGAGNDA IDGGNGNDTL SGGEGNDAID
GGSGNDIVDG GLGNDTLKGG SGNDVIAAGD GDDNVDAGSG DDIVTGGAGN DTLKGGSGAD
IITGGAGNDI LTGGSGADVF VFAAGFGNDT VTDFATTGSS ADLLQFSSDM FADFADVMAH
TAQVGSSVVV TLDADTSITL ANVQMTSLAA DDFRFV