Gene RPC_4274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4274 
Symbol 
ID3971697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4758981 
End bp4762067 
Gene Length3087 bp 
Protein Length1028 aa 
Translation table11 
GC content69% 
IMG OID637927378 
Productbifunctional proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase 
Protein accessionYP_534117 
Protein GI90425747 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0506] Proline dehydrogenase
[COG4230] Delta 1-pyrroline-5-carboxylate dehydrogenase 
TIGRFAM ID[TIGR01238] delta-1-pyrroline-5-carboxylate dehydrogenase (PutA C-terminal domain) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.752611 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGACA ATTCTGCTCT TCCCGTATTC AGCGCGCCCT ATGCCGCCGA CGACGGCGAT 
CTTGCCGCCG CGCTGTTGCC GACCGCGCGG CTGTCGCCCG AGCGCGAGGC CCGGATCGAC
GCCAGCGCAA CGCGGCTGAT CGCGGCGATC CGCGCCGACC ATGACCGGCT CGGCGGCGTC
GAGGAAATGT TGCGCGAATT CGCGTTGTCC ACCAAGGAGG GCGTCGCGCT GATGGTGCTG
GCGGAGGCGC TGTTGCGGGT GCCGGACGCG CTCACCGCCG ACCAGTTCAT CGAGGACAAG
CTCGGCCAGG GCGATTTCGA ACACCACGCC ACCCGGTCCG GCGCGCTGCT GGTCAACGCC
TCGGCCTGGG CGCTGGGGAT TTCCGCGCGG GTCGTTCACA CATCGGAAAC CCCGCAAGGC
ATACTCGCCA GTTTAAGCAA ACGCATCGGC GCGCCAGCGG TGCGCGCCGC CACCCGCGCG
GCGATGCGGC TGATGGGCAA TCACTTCGTG CTCGGCGAGA CCATCGCAGC GGCGCTGCAA
CGGGCGCAGG GCGGCGGCGC GCGTTATTCG TTCGACATGC TCGGCGAAGG CGCGCGCACG
CAAGCCGACG CCGACCGTTA CTTCGCCTCT TATGCGGCGG CGATCGACGC CATCGGGCGC
AGCGCCGGCA CTGCCCGACT GCCGGCACGT CCCGGCATTT CGGTGAAACT CTCGGCGCTG
CATCCGCGGT TCGAGGCGCT CAGCCGTGCC CGGGTGATGG ACGAACTGGT GCCGCGGGTG
ATCGAGCTGG CGCGGCAGGC CAAGTCGTAT GACCTCAACC TCACGCTCGA CGCCGAGGAA
GCCGACCGGC TCGAACTGTC GCTCGAGGTG TTCGCCGCGG TGTTGCGCGA TCCGTCGTTG
GCCGGCTGGG ACGGCTTTGG CCTCGCGGTG CAGGCCTATC AGAAGCGCGC GCTGGCGGTG
ATCGATCATG CGGCTGATCT CGCACAACGG CTCGACCGCC GGCTGATGCT GCGGCTGGTC
AAGGGCGCCT ATTGGGACAG CGAGATCAAG CGCGCGCAGG AGCGCGGGCT CGCCGACTAT
CCGGTGTTCA CCCGCAAGGC GATGACCGAC CTCAACTACC TCGCCTGCGT GCGAAGACTG
CTGGGGCTGC GGCCGCAGAT CTATCCGCAA TTCGCCACCC ACAACGCGCT CACCGTGGCA
AGCATTCTGG AGCTCGCCGG CGATAGCGAC GGCTTCGAAT TGCAGCGGCT GCACGGCATG
GGCGAAGCGC TCTATGCGCG ACTGCGGCAA GATCATCCGG CGCTGCCGTG CCGGATCTAC
GCCCCGGTCG GCGGACACCG CGATCTGTTG GCCTATCTGG TGCGGCGACT GTTGGAGAAC
GGCGCCAATT CCTCTTTCGT GGCGCTGGCC GGCGACGATC GCGTGCCGGT GGCGAATCTG
CTGCGGCGGC CCGCCGACAT CGTCGGCGAC GCCGCGCGGG CGCGGCACCC AAAGATTCCG
CTGCCGCGCG AGCTGGTTCG GCCGCGGCTC AATTCGGGTG GCTTCGAGTT CGGCGATCGC
GCCACGGTGC AGGCGCTGCT GGCCGAGATC GCCGCGCAAA CCCAGCCGGT CAGCGCCACG
CCGCTGATCG ACGGCGTCGC GAGTGGCGGA GTCGGGCGAC CGGTGATCAG CCCGATCGAT
TCCACCACCG TCGTCGGCGA GGTGTTCGAA GCCACGCCGG ATCAGGTCGC GCAGGCGATC
ACCGCGGCGC GGCGCGGATT CGTGACGTGG AGCCGCACGC CGGCGGACGC GCGCGCGGCG
ATTCTGCTAT GCGCGGCCAC ACTGCTCGAA CAGCGTCGCG CGCAATTCGT CGCGCTGCTG
CAGCGCGAGG CCGGCAAGAC GCTGGAGGAT TGTCTGTCCG AAGTGCGCGA GGCGGTGGAT
TTCTGCCGCT ATTACGCGAG CGAAGGGCGC CGGCTGTTCG GCGAAGGCGA GACGATGCCG
GGGCCGACCG GCGAGAGCAA TGTGCTGAGC CTGCGCGGCC GCGGCGTGTT CGTGGCGATC
TCGCCGTGGA ATTTTCCGCT GGCGATTTTC TTGGGCCAAA TCAGCGCCGC GCTGATGGCC
GGCAATACCG TGGTGGCCAA ACCCGCCGAA CAGACGCCGC TGATCGCGGC CCTGGCGGTG
CGGCTGCTGC ACGAGGCCGG CGTGCCAAAC TCCGCGCTGC AGCTTCTGCC AGGCGCCGGC
GCGATCGGCG CGTCGCTCGC GGCGCATTCC GACATCGATG GCGTGGTGTT CACCGGATCG
GGTGAGGTGG CGCGCGCGAT CAACCGGGCG CTCGCCGCCA AAGACGGCCC GATCGTGCCG
CTGATCGCCG AAACCGGCGG CATCAACGCG ATGATCGTCG ACGCCACCGC GCTGCCCGAG
CAGGTCGCCG ACGACGTCGT CACCTCGGCC TTCCGCTCCG CCGGACAGCG CTGCTCGGCG
CTGCGGCTGC TGTTCGTGCA AGACGACGTC GCCGACCGCA TGATCGAGAC CATCGCGGGC
AGCGCACGGG AGTTGAGCGT CGGCGATCCG CGCGATCCGG CGACGCAGCT CGGGCCGGTG
ATCGACGCCG ACGCCGCGGC CCGGCTCGAA ACCCACATCG CCCGGATGAA GCGCGAGGCG
CGGGTGCACT TCGCCGGCCA CGCGCCGGGC CCGGGTTACG TCGCGCCGCA TCTGATCGAA
TTGGCCGGCG CCGATCAGCT CCACGACGAA GTGTTCGGCC CAATCCTGCA TGTGGTGCGC
TACGCCGCAG ATCAGTTCGA CGCGGTGCTG CAATCGATTG CGCGCAGCGG CTACGGGCTG
ACGCTGGGAC TGCATTCGCG GGTGGATGCG ATGATCGCGC GCGCGATCGA GCGACTTCCG
ATCGGCAATG TCTACGTCAA CCGCAATATG ATCGGCGCGG TGGTGGGTGT GCAGCCGTTT
GGCGGATCGG GGCTGTCCGG CACCGGTCCG AAAGCCGGCG GGCCGCATTA CCTGCCCCGG
TTTGCGATCG AACAGACCGT CAGCATCAAC ACCGCGGCAG CCGGCGGCAA TGCGGCGTTG
CTGTCGGAGG GCGACGAGGA TGGCTGA
 
Protein sequence
MLDNSALPVF SAPYAADDGD LAAALLPTAR LSPEREARID ASATRLIAAI RADHDRLGGV 
EEMLREFALS TKEGVALMVL AEALLRVPDA LTADQFIEDK LGQGDFEHHA TRSGALLVNA
SAWALGISAR VVHTSETPQG ILASLSKRIG APAVRAATRA AMRLMGNHFV LGETIAAALQ
RAQGGGARYS FDMLGEGART QADADRYFAS YAAAIDAIGR SAGTARLPAR PGISVKLSAL
HPRFEALSRA RVMDELVPRV IELARQAKSY DLNLTLDAEE ADRLELSLEV FAAVLRDPSL
AGWDGFGLAV QAYQKRALAV IDHAADLAQR LDRRLMLRLV KGAYWDSEIK RAQERGLADY
PVFTRKAMTD LNYLACVRRL LGLRPQIYPQ FATHNALTVA SILELAGDSD GFELQRLHGM
GEALYARLRQ DHPALPCRIY APVGGHRDLL AYLVRRLLEN GANSSFVALA GDDRVPVANL
LRRPADIVGD AARARHPKIP LPRELVRPRL NSGGFEFGDR ATVQALLAEI AAQTQPVSAT
PLIDGVASGG VGRPVISPID STTVVGEVFE ATPDQVAQAI TAARRGFVTW SRTPADARAA
ILLCAATLLE QRRAQFVALL QREAGKTLED CLSEVREAVD FCRYYASEGR RLFGEGETMP
GPTGESNVLS LRGRGVFVAI SPWNFPLAIF LGQISAALMA GNTVVAKPAE QTPLIAALAV
RLLHEAGVPN SALQLLPGAG AIGASLAAHS DIDGVVFTGS GEVARAINRA LAAKDGPIVP
LIAETGGINA MIVDATALPE QVADDVVTSA FRSAGQRCSA LRLLFVQDDV ADRMIETIAG
SARELSVGDP RDPATQLGPV IDADAAARLE THIARMKREA RVHFAGHAPG PGYVAPHLIE
LAGADQLHDE VFGPILHVVR YAADQFDAVL QSIARSGYGL TLGLHSRVDA MIARAIERLP
IGNVYVNRNM IGAVVGVQPF GGSGLSGTGP KAGGPHYLPR FAIEQTVSIN TAAAGGNAAL
LSEGDEDG