Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3946 |
Symbol | |
ID | 3911753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4503337 |
End bp | 4506348 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637885850 |
Product | bifunctional proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase |
Protein accession | YP_487550 |
Protein GI | 86751054 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0506] Proline dehydrogenase [COG4230] Delta 1-pyrroline-5-carboxylate dehydrogenase |
TIGRFAM ID | [TIGR01238] delta-1-pyrroline-5-carboxylate dehydrogenase (PutA C-terminal domain) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCTG ATTCGCCGCA TCCGGAATTC GACGCTGCCT ACGCGCCCGA CGACGCCACG CTCGCGGCGG CGCTGATCGC GGTGATGCCG CGCGACCCCG CGCGCGAGGC GCCGATCGAC ACGACCGCCA CCGGACTGAT CGCAGCGATC CGTGCCGGCG ATCATCATCT CGGCAGCGTC GAGGCGATGC TGCGCGAATA CGCGCTATCC AGCAAAGAGG GGCTTGCGCT GATGGTGCTG GCCGAGGCGC TGCTGCGGGT GCCGGACGCG GCGACCGCCG ACGCCTTCAT CGAGGACCGG CTGGGCCAGG GCGATTTCGC GCATCACCGC ATCAAGTCCG ATGCGCTGCT GGTCAACGCC TCGGCCTGGG CGCTCGGCCT GTCGGCGCGC GTGGTGCATG CCGGCGACAC GCCGCAAAGC ACGCTGGCAG CGCTGGCGAA GCGGATCGGC GCGCCGGCGG TGCGCGCGGC GACGCGGCAG GCGATGCGGC TGCTCGGCAA TCATTTCGTG CTCGGCGAGA CCATCGACGA TGCGCTCGCC CGCGCGCAGG CGGATGCCGC CGATGGCGTG CGCTATTCCT ACGACATGCT TGGCGAAGGT GCGCGCACCC GGGCCGACGC CGAGCGCTAC TTCGCGTCCT ACGCGCAGGC GATCGACGCG ATCGGCCGCG CCGCCGGCAA CGCGCGACTG CCGGCGCGGC CGGGCATCTC GGTGAAACTC TCGGCGCTGC ATCCACGCTT CGAGGCGGTG AGCCGCGACC GCGTGCTGCG CGAGCTGACG CCGCGCCTGA TCGAGCTGGC GCGCAAAGCC AAGGACCACG ACCTCGGCTT CACCGTCGAC GCCGAGGAGG CCGACCGGCT CGAACTCTCG CTCGAGGTGT TCGCCGCCTG CCTCGCCGAT CCTTCGCTCA AAGGCTGGGA CGGCTACGGC CTCGCGGTGC AGGCCTATCA GAAACGCGCA TCCGCGGTGA TCGACTACGT CGATGCGCTG GCGCAGCGAC ACGATCGCCG GCTGATGCTG CGGCTGGTGA AGGGCGCCTA TTGGGACACC GAGATCAAGC GCGCGCAGGA GCGCGGCCTT GCCGACTATC CGGTGTTCAG CCGCAAGGCG ATGACCGACC TCAACTACCT GCATTGCGTG CAGAAACTAC TCGGGCTGCG GCAACGGATC TTCCCGCAAT TCGCCAGCCA CAACGCGCTG ACGGTCGCGA CCGTCCTCGA CCTCGCCGGC GACAGCGACG GCTACGAATT CCAGCGGCTG CACGGCATGG GCGAGGCGCT GTATGCGCGG CTGCGCGCCG AGCATCCCGA GCTCACCTGC CGGATCTACG CCCCGGTCGG CGGCCATCGC GATCTGCTCG CCTATCTGGT GCGGCGGCTG CTGGAGAACG GCGCCAATTC GTCGTTCGTC GCGCAGGCCG GCGACGATAC CGTGCCGGTA GAGGAGTTGC TGCGACGGCC GGAGGCGATC ATCGGGACGC CCGACAACGC CCGCCATCCG CACATCCCGC TGCCGCGAGA CTTGTATCGG CCGCAGCGCG AAAACTCGCG CGGCGTCGAA TTCGGCGACC GCGCCGCGCT GCGACGGCTG CTCAGTGAGA TCGACGCCGC GCGCCGGCCG CTGCCGCGCG TCACTCCCGT AACGCCGGAG GATGCGGCGG CGGCGGCGGT GGCCGCCGCG CGCGATGGCT TCGCCGCATG GAGCCGCACG CCTGCCGAGA CCCGCGCCGC CGCGCTGGAG CGCGCCGCTG ATCTGTTGGA GCAGCGCCGC GCGAACTTCG TCGCGCTGCT GCAGGACGAA GCCGGCAAGA CGCTCGACGA CTGCATCTCC GAGGTCCGCG AAGCGATCGA TTATTGCCGC TACTACGCGG CCGAAGGTCG CCGGCTATTC GGCGACGGCG TCGCGCTGCC CGGCCCGACC GGCGAGCGCA ACAGTCTGCG GCTGCGTGGC CGCGGCGTCT TCGTCGCGAT CTCGCCGTGG AATTTTCCGC TGGCGATTTT CCTCGGCCAG ATCACCGCGG CGCTGATGGC CGGCAACGCC GTGATCGCCA AGCCGGCGGA GCAGACGCCG GTGATCGGCG ACGCCGCCGT GGGGCTTTTG CACGAGGCCG GCGTGCCGCG CGCGGCGCTG CAACTCGTGC AAGGCGACGG CGCGATCGGC GCGGCGCTGG TGGCGCATCG CGACGTCGCC GGGGTCGTCT TCACCGGCTC GACAGACGTG GCGCGCGCGA TCAACCGCGC GCTCGCCGCG AAGGACGGCC CGATCGTGCC CCTGATCGCC GAGACCGGCG GCATCAATAC GATGATCGTC GACGCCACCG CGCTGCCCGA GCAGGTCGCC GACGACGTCG TGACCTCGGC GTTCCGCTCC GCCGGCCAGC GCTGCTCGGC GCTGCGGCTG TTGTTCGTGC AGGACGACGT CGCCGAGAGG ATGATCGAGA TGATCGCCGG CAGCGCCCGC GAATTGAAGC TCGGCGACCC GCGCGATCCG GCGACGCATC TCGGCCCGGT GATCGACGCC GAGGCCAAGG TCAGGCTCGA CGCGCATATT GCGGCGATGA CGCGCGAGGC GCGGCTGCAT TTCGCAGGCG CCGCGCCGTC CACCGGCAAC TACGTGGCAC CGCACATCTT CGAACTGCGC GACGCCGCGC AACTGACCGA GGAAGTGTTC GGCCCGATCC TGCACGTCGT GCGCTACAAG GCCGCCCACC TCGACGCCGT GCTCGCTGGC ATCGCGGCGA GCGGCTACGC GCTGACGCTC GGCGTGCAGT CGCGGATCGA CGACACGGTG GCGTGCATCG TCGACCGGCT CGCGATCGGC AACGTCTACG TCAACCGCAA CATGATCGGC GCCGTCGTCG GCACGCAGCC GTTCGGCGGC TCGGGCCTGT CCGGCACCGG GCCGAAGGCC GGCGGCCCGC ATTATCTGCC GCGCTTCACG CTGGAACAGA CGGTGTCGAT CAACACCGCG GCGGCCGGCG GCAACGCTGC GTTGCTGGCC GGCGACGAGT GA
|
Protein sequence | MPPDSPHPEF DAAYAPDDAT LAAALIAVMP RDPAREAPID TTATGLIAAI RAGDHHLGSV EAMLREYALS SKEGLALMVL AEALLRVPDA ATADAFIEDR LGQGDFAHHR IKSDALLVNA SAWALGLSAR VVHAGDTPQS TLAALAKRIG APAVRAATRQ AMRLLGNHFV LGETIDDALA RAQADAADGV RYSYDMLGEG ARTRADAERY FASYAQAIDA IGRAAGNARL PARPGISVKL SALHPRFEAV SRDRVLRELT PRLIELARKA KDHDLGFTVD AEEADRLELS LEVFAACLAD PSLKGWDGYG LAVQAYQKRA SAVIDYVDAL AQRHDRRLML RLVKGAYWDT EIKRAQERGL ADYPVFSRKA MTDLNYLHCV QKLLGLRQRI FPQFASHNAL TVATVLDLAG DSDGYEFQRL HGMGEALYAR LRAEHPELTC RIYAPVGGHR DLLAYLVRRL LENGANSSFV AQAGDDTVPV EELLRRPEAI IGTPDNARHP HIPLPRDLYR PQRENSRGVE FGDRAALRRL LSEIDAARRP LPRVTPVTPE DAAAAAVAAA RDGFAAWSRT PAETRAAALE RAADLLEQRR ANFVALLQDE AGKTLDDCIS EVREAIDYCR YYAAEGRRLF GDGVALPGPT GERNSLRLRG RGVFVAISPW NFPLAIFLGQ ITAALMAGNA VIAKPAEQTP VIGDAAVGLL HEAGVPRAAL QLVQGDGAIG AALVAHRDVA GVVFTGSTDV ARAINRALAA KDGPIVPLIA ETGGINTMIV DATALPEQVA DDVVTSAFRS AGQRCSALRL LFVQDDVAER MIEMIAGSAR ELKLGDPRDP ATHLGPVIDA EAKVRLDAHI AAMTREARLH FAGAAPSTGN YVAPHIFELR DAAQLTEEVF GPILHVVRYK AAHLDAVLAG IAASGYALTL GVQSRIDDTV ACIVDRLAIG NVYVNRNMIG AVVGTQPFGG SGLSGTGPKA GGPHYLPRFT LEQTVSINTA AAGGNAALLA GDE
|
| |