Gene RPD_1731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1731 
Symbol 
ID4022211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1941809 
End bp1944703 
Gene Length2895 bp 
Protein Length964 aa 
Translation table11 
GC content67% 
IMG OID637961925 
Productglycine dehydrogenase 
Protein accessionYP_568868 
Protein GI91976209 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0573168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAATC GCCGACCGAT CGACGCCGCC AACAACTTCG TGCGACGCCA TATCGGCCCG 
TCGCCGCAGG ACATCGCGCA GATGCTGAGG ACCGTGGGAG CAGGCAGCAT CGACCAGTTG
ATGGCCGAAA CGCTGCCTTA TGCGATCCGC ATCAAAGAGC CTCTGTCGCT CGGCGCGCCG
CTGTCGGAGA CCGAGGCGCT GGCGCACATG ACAGAACTCG CAGCGAAGAA CGCGGTGTTC
ACCTCGCTGA TCGGCCAAGG CTACTCCGGC ACGATCCTGC CGACCGTGAT CCAGCGCAAT
ATTCTGGAAA ATCCCGCCTG GTACACGGCC TATACGCCGT ATCAGCCCGA GATCAGCCAA
GGCCGGCTGG AAGCGCTGTT CAACTTCCAG ACCATGATCT GCGACCTGAC CGGGCTCGAC
CTCGCCAACG CCTCGCTGCT CGACGAGGCG ACCGCGGCGG CGGAAGCGAT GGCGCTGGCG
GAACGCGCCG CGCAAAAGAA GACCAAGGCG TTCTTCGTCG ATCGCGACAC CCATCCGCAG
ACGCTGGCGG TGCTGCGCAC CCGCGCCGAA CCACTCGGCT GGTCGATCAT CGTCGGCGAT
CCGGACACCG AGCTCGAAGC CGCGGACGTG TTCGGTGCTT TGCTGCAATA TCCCGGCTCG
TCCGGCGCTC TGCGCGACCC GCGCCCGGCG ATCGCGACGC TGCACAACAA GGGCGCGCTC
GCGGTGATCG CCGCGGATCT GCTGGCGCTG ACGCTGATCA CGTCGCCCGG CGAACTCGGC
GCCGATATCG CGATCGGCTC GGCGCAACGC TTCGGCGTGC CGATGGGCTA TGGCGGCCCG
CATGCCGGCT ACATGGCCGC GCGCGACAGC CTCAAGCGTT CGCTCCCCGG CCGCATCGTC
GGCCTGTCGA TCGATTCGCA CGGCCAGCCG GCCTATCGGC TGGCGATGCA GACCCGCGAG
CAGCACATCC GCCGCGAGAA GGCGACCTCG AACATCTGCA CCGCGCAGGT GCTGCTGGCG
GTGATTGCCG CGATGTATGC GGTGTATCAC GGCCCGGACG GCCTCGCCGC GATTGCCCGC
CGGGTGCATC GCCGCACCGC GACGCTGGCG AGCGGGCTGA AGCAGCTCGG CTTTGCGCCC
ATCAACGACG CCTATTTCGA TACGCTGACG GTCGAGGTCG GCGACAAGCG CGACGCCATC
GTCGCGCGCG CCGAGGCCGA ACAGATCAAT CTGCGGATCG GCGCGACCTC GCTCGGCATC
TCGCTCGATG AAACCACCAC CCCCGCGATC GTCGAGGCGC TATGGCGCGC GTTCGACGGA
TCGCTCGACT ACGCAAGCGT CGAGCGCGAC GCGACCGACA CGCTGCCGGC GGCGCTGACG
CGGACGAGCG ACTACCTGAC GCAACCGGCG TTCCAGGACT ATCGCTCGGA GACCGAATTG
CTCCGCTACA TGCGCAAGCT GTCGGACCGC GACCTCGCGC TCGACCGCGC GATGATTCCG
CTCGGCTCCT GCACCATGAA ACTCAACGCC ACTACCGAGA TGATGCCGCT GACCTGGCCT
CAATTCGGCA GCCTGCATCC GTTCGTGCCG CGGGCGCAGG CGGAGGGCTA TCATGCGATG
TTCGCGACGC TGGAAGCCTG GCTCGCCGAG ATCACCGGCT ACGACGCCGT GTCGCTGCAG
CCGAATTCCG GCGCGCAGGG CGAATATGCC GGCCTGCTGG CGATCCGCGG CTATCATCTG
TCGCGCGGCG AGCCGCACCG CAAGATCTGC CTGATCCCCT CCTCCGCGCA CGGCACCAAT
CCGGCCTCGG CCGCGATGGT CGGGATGGAT GTCGTCGTGG TCGCTTGCAA CAATCATGGC
GACGTCGACG TCGACGATCT GCGCGCCAAG GCGGAGAAGC ATTCGGCCGA ACTCGCCGCG
GTGATGATCA CCTATCCGTC GACCCACGGC GTGTTCGAGG AGCACATTCG CGAGATCTGC
GACATCGTCC ATGCCCATGG CGGCCAGATC TATCTCGACG GCGCCAACCT CAACGCGCAG
GTCGGCCTGG CGCGGCCCGG CGACTACGGC GCCGATGTCA GTCACCTCAA TCTGCACAAG
ACTTTCTGCA TTCCGCATGG CGGCGGCGGC CCGGGGATGG GGCCGATCGG CGTCAAGGCG
CATCTGGCGC CGTTCCTGCC CGGCCACCCG GCCGAGGGCG AGCCGTCGAG CGGCGTGCTG
CACGGCGGCG GCACGGTGTC GGCAGCGCCC TATGGCTCGG CGTCGATTCT CACGATCTCC
TATATCTACA TTCTGATGAT GGGCGGCGCC GGCCTGAAGC GGGCCACCGA GATCGCGATC
CTCAACGCCA ACTACATCGC GGCGCGACTG CAGCCGCATT TCCCGGTGCT GTATCGCAAC
CTGCGCGGCC GCGTCGCGCA TGAATGCATC GTCGATCCGC GACCGCTGAA GACGACGACC
GGGGTGACGG TCGACGACAT CGCCAAGCGG CTGATCGACT ACGGCTTCCA CGCCCCGACC
ATGAGCTTCC CGGTGCCGGG CACGCTGATG ATCGAGCCAA CCGAATCGGA GTCCAAGGCG
GAGATCGACC GGTTCTGCGA GGCGATGATC GCGATCCGGC GGGAGATCGC CCAGATCGAG
CAAGGCCGGT TCAAGGTCGA GGCGTCGCCG CTGCGCTTTG CGCCGCATAC GGTGCACGAC
GTCACCAGCG CGGAATGGAC GCGGCCCTAT CCGCGCACCG AGGGCTGCTT CCCGGCTCCG
AACTCGCGCA CCGACAAATA TTGGTGCCCG GTCGGCCGCG TCGACAACGT CTATGGCGAC
CGCAACCTCG TGTGCGCGTG CCCGCCGATC GAAGACTACG CACTGGCGGC GGACTACGCC
CGGGCGGCGG AGTAG
 
Protein sequence
MPNRRPIDAA NNFVRRHIGP SPQDIAQMLR TVGAGSIDQL MAETLPYAIR IKEPLSLGAP 
LSETEALAHM TELAAKNAVF TSLIGQGYSG TILPTVIQRN ILENPAWYTA YTPYQPEISQ
GRLEALFNFQ TMICDLTGLD LANASLLDEA TAAAEAMALA ERAAQKKTKA FFVDRDTHPQ
TLAVLRTRAE PLGWSIIVGD PDTELEAADV FGALLQYPGS SGALRDPRPA IATLHNKGAL
AVIAADLLAL TLITSPGELG ADIAIGSAQR FGVPMGYGGP HAGYMAARDS LKRSLPGRIV
GLSIDSHGQP AYRLAMQTRE QHIRREKATS NICTAQVLLA VIAAMYAVYH GPDGLAAIAR
RVHRRTATLA SGLKQLGFAP INDAYFDTLT VEVGDKRDAI VARAEAEQIN LRIGATSLGI
SLDETTTPAI VEALWRAFDG SLDYASVERD ATDTLPAALT RTSDYLTQPA FQDYRSETEL
LRYMRKLSDR DLALDRAMIP LGSCTMKLNA TTEMMPLTWP QFGSLHPFVP RAQAEGYHAM
FATLEAWLAE ITGYDAVSLQ PNSGAQGEYA GLLAIRGYHL SRGEPHRKIC LIPSSAHGTN
PASAAMVGMD VVVVACNNHG DVDVDDLRAK AEKHSAELAA VMITYPSTHG VFEEHIREIC
DIVHAHGGQI YLDGANLNAQ VGLARPGDYG ADVSHLNLHK TFCIPHGGGG PGMGPIGVKA
HLAPFLPGHP AEGEPSSGVL HGGGTVSAAP YGSASILTIS YIYILMMGGA GLKRATEIAI
LNANYIAARL QPHFPVLYRN LRGRVAHECI VDPRPLKTTT GVTVDDIAKR LIDYGFHAPT
MSFPVPGTLM IEPTESESKA EIDRFCEAMI AIRREIAQIE QGRFKVEASP LRFAPHTVHD
VTSAEWTRPY PRTEGCFPAP NSRTDKYWCP VGRVDNVYGD RNLVCACPPI EDYALAADYA
RAAE