Gene RPD_3707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3707 
Symbol 
ID4024223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4136854 
End bp4139862 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content68% 
IMG OID637963911 
Productbifunctional proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase 
Protein accessionYP_570829 
Protein GI91978170 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0506] Proline dehydrogenase
[COG4230] Delta 1-pyrroline-5-carboxylate dehydrogenase 
TIGRFAM ID[TIGR01238] delta-1-pyrroline-5-carboxylate dehydrogenase (PutA C-terminal domain) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.571968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCTG ATTCGCCCGA TCCTGCCTTC ATGGCTCCCT ATGCGCCCGA CGACGCTGTG 
CTGGCGGCGG CGTTGATCGG CTCGGTTCCG ACCGACGCGG GGCACGAAGC ACGGATCGAC
CGCACGGCGA CCGAGCTGAT CGAAGCGATC CGCGCCGGCG ACGATCGCCT CGGTAGCGTC
GAGCAGATGC TTCGCGAATT TGCACTGTCG ACCAAAGAGG GGCTGGCGCT GATGGTGCTG
GCCGAAGCGC TGCTGCGGGT GCCGGATGCT GCGACCGCCG ACGCTTTCAT CGAAGACAAG
CTCGCGCAGG GCGACTTCGC GCACCACCGC GTCAAGTCCG AGGCGCTGCT GGTCAACGCC
TCGGCCTGGG CGCTGGGCCT CTCGGCGCGC GTGGTGCAGT CCGGCGACAC GCCGCAGGGG
ACGCTGGCGG CGCTGGCGAA GCGGATCGGG GCGCCAGCGG TGCGCGCCGC GACGCGGCAG
GCGATGCGGC TGATCGGCAA TCACTTCGTA CTCGGCGAAA CGATCGACGC AGCGCTGCAC
CGTGCGCAAG CGGATGCGGC CGATGGCGTG CGCTATTCCT ACGACATGCT CGGCGAAGGT
GCACGCACGC AAGCCGATGC CGAGCGCTAT TTCGATTCCT ATGCACGGGC GATCGAGGCG
ATCGGCAGCA AAGCCGGCAA CGCGCCGCTG CCGTCACGGC CGGGCATTTC GGTGAAACTG
TCGGCATTGC ATCCGCGCTT CGAGGCGGTG AGCCGCGACC GCGTGCTGGC CGAGTTGACA
CCGCTTCTGA TCGACCTGGC GCGCAAGGCG AAAGCCCACG ACCTCGCCTT CACAATCGAC
GCCGAAGAGG CCGACCGGCT CGAACTCTCG CTCGACATCT TCGCCGCCTG CTGCGCCGAT
CCGTCGCTCG CCGGGTGGGA CGGCTACGGG CTCGCCGTCC AAGCCTATCA GAAGCGCGCG
GCGGCCGTCA TCGACTACGT TGACACGCTC GCGCAGCGAC TGAACCGGCG TCTGATGCTG
CGGCTGGTGA AGGGCGCCTA TTGGGACACC GAGATCAAGC GCGCGCAGGA GCGCGGGCTC
GACGACTATC CGGTGTTCAC GCGCAAGGCG ATGACCGATC TCAACTACAT GCATTGCGCG
GACAAGCTGC TTCGCTTGCG GCCGCGGCTA TTCCCGCAAT TCGCCAGCCA CAATGCGCTG
ACGGTCGCGA CTGTGCTCGA GCTCGCCGGC AATGGCGAGG GCTATGAATT CCAACGCCTG
CACGGCATGG GCGAGGCGCT GTATGCGCGG CTTCTCGCCG AATATCCCGG ATTGGTGTGC
CGGATCTATG CGCCGGTCGG CGGCCACCGC GACCTGCTCG CCTATCTGGT GCGTCGGCTG
CTGGAGAACG GCGCCAATTC GTCGTTCGTG GCGCAGGTCG GCGACGAGCA CGTAACGGTC
GCGGAATTGC TCCGACGGCC CGCCGAGATC ATCGGGCGGC CCGAGCAGGC GCGAAACCCG
CACCTTCCTT TGCCGCGCGA CCTGTATCGG CCGCAGCGTG AGAATTCGCG CGGCATCGAA
TTTGGCGATC GCGCGGCGCT GCAACGACTG GTGAGCGAGA TCGCTACAGT GCGGCAAGCG
CTGCCGAAGA TCACATCCGC AACACCAGAG ATCGCCGCAG CAGCCGTTGC AACCGCGCGA
ACCGGCTTCG TGGAGTGGAG CCGCACGCCG GCGGCGACGC GGGCAAGCGC GCTGGAGCGC
GCGGCAGATC TGTTGGACCG TCGACGCGCG CATTTCATCG CTCTGCTGCA GGACGAAGGC
CGCAAGACGC TCGACGATTG CATCTCGGAA GTGCGCGAGG CGATCGATTA CTGCCGATAC
TATGCGGCCG AGGGCCGCCG CCTGTTCGGC GACGGCATGG CAATGCCAGG ACCGACCGGC
GAGAGCAACA CACTGCGTCT GCGCGGACGC GGCGTCATCG TCGCAATCTC GCCGTGGAAT
TTTCCGCTGG CGATCTTTGT GGGCCAGATC GCCGCGGCGC TGATGGCCGG CAACTCGGTG
GTCGCCAAGC CGGCGGAGCA GACGCCGGTG ATCGCCGACG CCGCCGTGCA ACTGTTGCAC
GAGGCCGGCG TGCCGCACGC GGCGCTGCAA CTCGTACAGG GCGACGGCGC GATCGGCGCA
GCGCTGGTCG CGCATCGCGA CGTCGCCGGC GTGGTATTCA CCGGCTCGAC CGAGGCGGCG
CTGGCGATCA ACCGGGCGCT CGCCGCCAAG GACGGCCCGA TCGTGCCGCT GATCGCCGAG
ACCGGCGGCA TCAATGCGAT GATCGTGGAT GCGACGGCTC TGCCCGAGCA GGTCGCCGAC
GACGTGGTGA CTTCGGCGTT CCGCTCCGCC GGCCAACGCT GCTCGGCGCT GCGGCTGTTG
TTCGTGCAGG ACGATGTCGC GGACCGGATG ATCGAGATGA TCGCCGGCAG CGCCCGCGAA
CTACGCATCG GCGACCCGCG CGATCCAGCG ACGCAGATCG GCCCTGTGAT CGACGCCGAG
GCGAAGGCGA AGCTCGACGC GCGTATCGCC CGAATGACGC GCGAGGCGCG GGTTCATTTG
GCCGGCACCG CCCCGGTGGA CGGCAATTAC GTCGCGCCGC AGATTTTCGA ACTGCGCGAC
GCGACGCAGC TCTGCGAAGA AGTCTTCGGC CCGATCCTGC ATGTCGTGCG CTATCCGGCT
TCCGGCCTCG ACGACATGCT CGAAACGATC GCACGCAGCG GCTACGCACT GACGCTCGGC
ATCCACTCGC GGATCGACGA CACCATCGCA CGCATCATCG ATCGGCTCGC GATCGGAAAC
GTCTACGTCA ACCGCAACAT GATCGGCGCC GTCGTCGGCG TGCAGCCATT TGGCGGCTCG
GGACACTCCG GCACAGGCCC GAAGGCCGGC GGCCCGAATT ATCTGCCGCG CTTCGCGCTG
GAGCAGACGG TGTCGATCAA CACGGCCGCA GCCGGCGGCA ACGCTGCGCT GCTGTCGGGC
GGCGAGTAA
 
Protein sequence
MPPDSPDPAF MAPYAPDDAV LAAALIGSVP TDAGHEARID RTATELIEAI RAGDDRLGSV 
EQMLREFALS TKEGLALMVL AEALLRVPDA ATADAFIEDK LAQGDFAHHR VKSEALLVNA
SAWALGLSAR VVQSGDTPQG TLAALAKRIG APAVRAATRQ AMRLIGNHFV LGETIDAALH
RAQADAADGV RYSYDMLGEG ARTQADAERY FDSYARAIEA IGSKAGNAPL PSRPGISVKL
SALHPRFEAV SRDRVLAELT PLLIDLARKA KAHDLAFTID AEEADRLELS LDIFAACCAD
PSLAGWDGYG LAVQAYQKRA AAVIDYVDTL AQRLNRRLML RLVKGAYWDT EIKRAQERGL
DDYPVFTRKA MTDLNYMHCA DKLLRLRPRL FPQFASHNAL TVATVLELAG NGEGYEFQRL
HGMGEALYAR LLAEYPGLVC RIYAPVGGHR DLLAYLVRRL LENGANSSFV AQVGDEHVTV
AELLRRPAEI IGRPEQARNP HLPLPRDLYR PQRENSRGIE FGDRAALQRL VSEIATVRQA
LPKITSATPE IAAAAVATAR TGFVEWSRTP AATRASALER AADLLDRRRA HFIALLQDEG
RKTLDDCISE VREAIDYCRY YAAEGRRLFG DGMAMPGPTG ESNTLRLRGR GVIVAISPWN
FPLAIFVGQI AAALMAGNSV VAKPAEQTPV IADAAVQLLH EAGVPHAALQ LVQGDGAIGA
ALVAHRDVAG VVFTGSTEAA LAINRALAAK DGPIVPLIAE TGGINAMIVD ATALPEQVAD
DVVTSAFRSA GQRCSALRLL FVQDDVADRM IEMIAGSARE LRIGDPRDPA TQIGPVIDAE
AKAKLDARIA RMTREARVHL AGTAPVDGNY VAPQIFELRD ATQLCEEVFG PILHVVRYPA
SGLDDMLETI ARSGYALTLG IHSRIDDTIA RIIDRLAIGN VYVNRNMIGA VVGVQPFGGS
GHSGTGPKAG GPNYLPRFAL EQTVSINTAA AGGNAALLSG GE