Gene RPB_3946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3946 
Symbol 
ID3911753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4503337 
End bp4506348 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content71% 
IMG OID637885850 
Productbifunctional proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase 
Protein accessionYP_487550 
Protein GI86751054 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0506] Proline dehydrogenase
[COG4230] Delta 1-pyrroline-5-carboxylate dehydrogenase 
TIGRFAM ID[TIGR01238] delta-1-pyrroline-5-carboxylate dehydrogenase (PutA C-terminal domain) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCTG ATTCGCCGCA TCCGGAATTC GACGCTGCCT ACGCGCCCGA CGACGCCACG 
CTCGCGGCGG CGCTGATCGC GGTGATGCCG CGCGACCCCG CGCGCGAGGC GCCGATCGAC
ACGACCGCCA CCGGACTGAT CGCAGCGATC CGTGCCGGCG ATCATCATCT CGGCAGCGTC
GAGGCGATGC TGCGCGAATA CGCGCTATCC AGCAAAGAGG GGCTTGCGCT GATGGTGCTG
GCCGAGGCGC TGCTGCGGGT GCCGGACGCG GCGACCGCCG ACGCCTTCAT CGAGGACCGG
CTGGGCCAGG GCGATTTCGC GCATCACCGC ATCAAGTCCG ATGCGCTGCT GGTCAACGCC
TCGGCCTGGG CGCTCGGCCT GTCGGCGCGC GTGGTGCATG CCGGCGACAC GCCGCAAAGC
ACGCTGGCAG CGCTGGCGAA GCGGATCGGC GCGCCGGCGG TGCGCGCGGC GACGCGGCAG
GCGATGCGGC TGCTCGGCAA TCATTTCGTG CTCGGCGAGA CCATCGACGA TGCGCTCGCC
CGCGCGCAGG CGGATGCCGC CGATGGCGTG CGCTATTCCT ACGACATGCT TGGCGAAGGT
GCGCGCACCC GGGCCGACGC CGAGCGCTAC TTCGCGTCCT ACGCGCAGGC GATCGACGCG
ATCGGCCGCG CCGCCGGCAA CGCGCGACTG CCGGCGCGGC CGGGCATCTC GGTGAAACTC
TCGGCGCTGC ATCCACGCTT CGAGGCGGTG AGCCGCGACC GCGTGCTGCG CGAGCTGACG
CCGCGCCTGA TCGAGCTGGC GCGCAAAGCC AAGGACCACG ACCTCGGCTT CACCGTCGAC
GCCGAGGAGG CCGACCGGCT CGAACTCTCG CTCGAGGTGT TCGCCGCCTG CCTCGCCGAT
CCTTCGCTCA AAGGCTGGGA CGGCTACGGC CTCGCGGTGC AGGCCTATCA GAAACGCGCA
TCCGCGGTGA TCGACTACGT CGATGCGCTG GCGCAGCGAC ACGATCGCCG GCTGATGCTG
CGGCTGGTGA AGGGCGCCTA TTGGGACACC GAGATCAAGC GCGCGCAGGA GCGCGGCCTT
GCCGACTATC CGGTGTTCAG CCGCAAGGCG ATGACCGACC TCAACTACCT GCATTGCGTG
CAGAAACTAC TCGGGCTGCG GCAACGGATC TTCCCGCAAT TCGCCAGCCA CAACGCGCTG
ACGGTCGCGA CCGTCCTCGA CCTCGCCGGC GACAGCGACG GCTACGAATT CCAGCGGCTG
CACGGCATGG GCGAGGCGCT GTATGCGCGG CTGCGCGCCG AGCATCCCGA GCTCACCTGC
CGGATCTACG CCCCGGTCGG CGGCCATCGC GATCTGCTCG CCTATCTGGT GCGGCGGCTG
CTGGAGAACG GCGCCAATTC GTCGTTCGTC GCGCAGGCCG GCGACGATAC CGTGCCGGTA
GAGGAGTTGC TGCGACGGCC GGAGGCGATC ATCGGGACGC CCGACAACGC CCGCCATCCG
CACATCCCGC TGCCGCGAGA CTTGTATCGG CCGCAGCGCG AAAACTCGCG CGGCGTCGAA
TTCGGCGACC GCGCCGCGCT GCGACGGCTG CTCAGTGAGA TCGACGCCGC GCGCCGGCCG
CTGCCGCGCG TCACTCCCGT AACGCCGGAG GATGCGGCGG CGGCGGCGGT GGCCGCCGCG
CGCGATGGCT TCGCCGCATG GAGCCGCACG CCTGCCGAGA CCCGCGCCGC CGCGCTGGAG
CGCGCCGCTG ATCTGTTGGA GCAGCGCCGC GCGAACTTCG TCGCGCTGCT GCAGGACGAA
GCCGGCAAGA CGCTCGACGA CTGCATCTCC GAGGTCCGCG AAGCGATCGA TTATTGCCGC
TACTACGCGG CCGAAGGTCG CCGGCTATTC GGCGACGGCG TCGCGCTGCC CGGCCCGACC
GGCGAGCGCA ACAGTCTGCG GCTGCGTGGC CGCGGCGTCT TCGTCGCGAT CTCGCCGTGG
AATTTTCCGC TGGCGATTTT CCTCGGCCAG ATCACCGCGG CGCTGATGGC CGGCAACGCC
GTGATCGCCA AGCCGGCGGA GCAGACGCCG GTGATCGGCG ACGCCGCCGT GGGGCTTTTG
CACGAGGCCG GCGTGCCGCG CGCGGCGCTG CAACTCGTGC AAGGCGACGG CGCGATCGGC
GCGGCGCTGG TGGCGCATCG CGACGTCGCC GGGGTCGTCT TCACCGGCTC GACAGACGTG
GCGCGCGCGA TCAACCGCGC GCTCGCCGCG AAGGACGGCC CGATCGTGCC CCTGATCGCC
GAGACCGGCG GCATCAATAC GATGATCGTC GACGCCACCG CGCTGCCCGA GCAGGTCGCC
GACGACGTCG TGACCTCGGC GTTCCGCTCC GCCGGCCAGC GCTGCTCGGC GCTGCGGCTG
TTGTTCGTGC AGGACGACGT CGCCGAGAGG ATGATCGAGA TGATCGCCGG CAGCGCCCGC
GAATTGAAGC TCGGCGACCC GCGCGATCCG GCGACGCATC TCGGCCCGGT GATCGACGCC
GAGGCCAAGG TCAGGCTCGA CGCGCATATT GCGGCGATGA CGCGCGAGGC GCGGCTGCAT
TTCGCAGGCG CCGCGCCGTC CACCGGCAAC TACGTGGCAC CGCACATCTT CGAACTGCGC
GACGCCGCGC AACTGACCGA GGAAGTGTTC GGCCCGATCC TGCACGTCGT GCGCTACAAG
GCCGCCCACC TCGACGCCGT GCTCGCTGGC ATCGCGGCGA GCGGCTACGC GCTGACGCTC
GGCGTGCAGT CGCGGATCGA CGACACGGTG GCGTGCATCG TCGACCGGCT CGCGATCGGC
AACGTCTACG TCAACCGCAA CATGATCGGC GCCGTCGTCG GCACGCAGCC GTTCGGCGGC
TCGGGCCTGT CCGGCACCGG GCCGAAGGCC GGCGGCCCGC ATTATCTGCC GCGCTTCACG
CTGGAACAGA CGGTGTCGAT CAACACCGCG GCGGCCGGCG GCAACGCTGC GTTGCTGGCC
GGCGACGAGT GA
 
Protein sequence
MPPDSPHPEF DAAYAPDDAT LAAALIAVMP RDPAREAPID TTATGLIAAI RAGDHHLGSV 
EAMLREYALS SKEGLALMVL AEALLRVPDA ATADAFIEDR LGQGDFAHHR IKSDALLVNA
SAWALGLSAR VVHAGDTPQS TLAALAKRIG APAVRAATRQ AMRLLGNHFV LGETIDDALA
RAQADAADGV RYSYDMLGEG ARTRADAERY FASYAQAIDA IGRAAGNARL PARPGISVKL
SALHPRFEAV SRDRVLRELT PRLIELARKA KDHDLGFTVD AEEADRLELS LEVFAACLAD
PSLKGWDGYG LAVQAYQKRA SAVIDYVDAL AQRHDRRLML RLVKGAYWDT EIKRAQERGL
ADYPVFSRKA MTDLNYLHCV QKLLGLRQRI FPQFASHNAL TVATVLDLAG DSDGYEFQRL
HGMGEALYAR LRAEHPELTC RIYAPVGGHR DLLAYLVRRL LENGANSSFV AQAGDDTVPV
EELLRRPEAI IGTPDNARHP HIPLPRDLYR PQRENSRGVE FGDRAALRRL LSEIDAARRP
LPRVTPVTPE DAAAAAVAAA RDGFAAWSRT PAETRAAALE RAADLLEQRR ANFVALLQDE
AGKTLDDCIS EVREAIDYCR YYAAEGRRLF GDGVALPGPT GERNSLRLRG RGVFVAISPW
NFPLAIFLGQ ITAALMAGNA VIAKPAEQTP VIGDAAVGLL HEAGVPRAAL QLVQGDGAIG
AALVAHRDVA GVVFTGSTDV ARAINRALAA KDGPIVPLIA ETGGINTMIV DATALPEQVA
DDVVTSAFRS AGQRCSALRL LFVQDDVAER MIEMIAGSAR ELKLGDPRDP ATHLGPVIDA
EAKVRLDAHI AAMTREARLH FAGAAPSTGN YVAPHIFELR DAAQLTEEVF GPILHVVRYK
AAHLDAVLAG IAASGYALTL GVQSRIDDTV ACIVDRLAIG NVYVNRNMIG AVVGTQPFGG
SGLSGTGPKA GGPHYLPRFT LEQTVSINTA AAGGNAALLA GDE