Gene RPB_3740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3740 
Symbol 
ID3911543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4278412 
End bp4281306 
Gene Length2895 bp 
Protein Length964 aa 
Translation table11 
GC content68% 
IMG OID637885641 
Productglycine dehydrogenase 
Protein accessionYP_487345 
Protein GI86750849 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.872567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAATC GCCGACCGAT CGACGAAGCC CACGACTTCG TGCGACGCCA TATCGGCCCG 
TCGCCCCAGG ACATCGATGC GATGTTGGCA ACGGTGGGCG CGGACAGCCT CGACCAGCTA
ATGGCCGAGA CGCTGCCGGA TTCGATCCGC ATCAAACAGC CGCTGTCGCT CGGCACGCCG
CTGTCGGAAC CCGACGCGCT GACGCATATG ACCGAGCTGG CGGCGAAGAA CCAGGTGTTC
ACCTCGCTGA TCGGCCAAGG CTACTCCGGC ACGATCCTGC CGACCGTGAT CCAGCGCAAT
ATTCTGGAGA ATCCCGCCTG GTACACCGCC TACACGCCGT ATCAGCCGGA GATCAGCCAG
GGCCGGCTGG AGGCGCTGTT CAACTTCCAG ACCATGATCT GCGACCTCAC CGGGCTCGAC
GTCGCCAACG CCTCGCTGCT CGATGACGCA ACCGCGGCGG CCGAAGCGAT GGCGCTGGCG
GAGCGCGCCG TCGCGAAGAA AACCAAGGCG TTCTTCGTCG ACCGCGACAC CCATCCGCAG
ACGCTGGCGG TGCTGCGCAC CCGCGCCGAG CCGCTCGGCT GGTCGATCAT CGTCGGCGAT
CCGGACACCG AGCTCGAAGC CGCCGACGTA TTCGGCGCGC TGCTGCAATA TCCAGGCTCG
TCGGGCGCCT TGCGCGACCC GCGCGCGGTA ATCGCGACGC TACACAAGAA GGGCGCGCTG
GCGGTGATCG CCGCCGATCT GCTGGCGCTG ACGCTGATCG CCTCGCCGGG CGAACTCGGC
GCCGACATCG CGATCGGCTC GGCGCAGCGC TTCGGCGTGC CGATGGGCTA TGGCGGCCCG
CATGCCGGCT ACATGGCGGC GCGCGACAGC CTCAAGCGTT CGTTGCCCGG CCGCATCGTC
GGCCTGTCGA TCGACAGCCA CGGCCAGCCG GCCTACCGGC TGGCGCTGCA GACCCGCGAA
CAGCACATCC GCCGCGAGAA GGCGACCTCC AACATCTGCA CCGCGCAGGT GCTGCTCGCG
GTGATCGCCG CGATGTATGC GGTGTATCAC GGCCCCGATG GCCTCGCCGC GATCGCCCGC
CGGGTGCACC GGCGCACCGC GACGCTGGCG GCCGGGCTGA AGCAGCTCGG CTTCGCGCCG
ATCAACGAGA CCTACTTCGA TACGCTGACG ATCGAGGTCG GCAGCAAGCG CGACGCCATC
GTGGCGCGCG CCGAGGCCGA AAAGATCAAT CTGCGGATCG GCGCGTCGTC GCTCGGCATC
GCGCTCGATG AGACCACCAC GCCCGCAACC GTCGAGGCGC TGTGGCGCGC TTTCGGCGGG
GAGCTGAACT ACGCGGCAGT CGAGCGCGAC GCGACCGACA CGCTGCCCGC CTCGCTGACG
CGGACCGGCG ACTATCTCAC CCAGCCGGCG TTCCAGGACT ATCGCTCGGA GACCGAACTG
CTGCGCTACA TGCGCAAACT CAGCGACCGC GACCTGGCGC TCGACCGCGC GATGATCCCG
CTCGGCTCCT GCACCATGAA GCTCAACGCC ACCACCGAGA TGATGCCGCT GACCTGGCCG
GCCTTCGGCA GCCTGCATCC GTTCGTGCCG CGCGCGCAGG CGCAGGGCTA TCACGAGATG
TTCGCGCGGC TCGAGGCCTG GCTCGCCGAG ATCACCGGCT ACGACGCGGT GTCGCTGCAG
CCGAATTCCG GCGCGCAGGG CGAATATGCC GGGCTGCTGG CGATCCGCGG CTATCATCTA
TCGCGCGGCG AGCCGCATCG CAGGATCTGC CTGATCCCCT CCTCCGCGCA CGGCACCAAC
CCGGCGTCGG CGGCGATGAC CGGGATGGAC GTCGTGGTGG TCGCCTGCAA CAGCCATGGC
GACGTCGACG TCGACGATCT GCGCGCCAAG GCCGAGAAAC ATTCGGCCGA ACTCGCCGCG
GTGATGATCA CCTATCCGTC GACCCACGGC GTGTTCGAGG AGCATATCCG CGACATCTGC
GACATCGTCC ACGCCCATGG CGGCCAGGTC TACCTCGACG GCGCCAATCT CAACGCGCAG
GTCGGGCTGT CGAGGCCGGG CGACTACGGC GCCGACGTCA GCCACCTCAA TCTGCACAAG
ACGTTCTGCA TCCCGCATGG CGGCGGCGGC CCGGGCATGG GCCCGATCGG CGTCAAGGCG
CATCTGGCGC CGTTCCTGCC CGGCCATCCG GCCGAGGGCG AGGCTTTGAA CGGCGTGCTG
CACGGCGGCG GCACGGTGTC GGCGGCGCCC TACGGCTCGG CGTCGATCCT GACGATCTCC
TACATCTACA TCCTGATGAT GGGCGGCGCC GGCCTCAAGC GCGCCACCGA GATCGCGATC
CTCAACGCCA ACTACATCGC GGACCGGCTG CAGCCGCATT TCCCGGTGCT GTATCGCAAT
CTGCGCGGCC GCGTCGCGCA TGAGTGCATC GTCGATCCGC GGCCGCTGAA GACCACCACC
GGCGTCACCG TCGACGACAT CGCCAAGCGG CTGATCGACT ACGGCTTCCA CGCGCCGACG
ATGAGCTTCC CGGTGCCAGG CACGCTGATG ATCGAGCCGA CGGAGTCGGA ATCGAAGGCC
GAGATCGATC GGTTCTGCGA CGCCATGATC GCGATCCGGC AGGAGATCGC GCAGATCGAG
GACGGCCGCT TCAAGGTGGA GGCCTCGCCG CTGCGGTTCG CGCCGCACAC GGTGCACGAC
GTCACCTCGG CGGAATGGAC CAGGCCCTAT CCGCGCACCG AGGGCTGTTT CCCGGCGCCG
CATTCGCGCA CCGACAAATA TTGGTGCCCG GTCGGCCGCG TCGACAACGT CTATGGCGAC
CGCAATCTGG TGTGCTCGTG CCCGCCGATC GAAGACTACG CACTGGCCGC CGACTACGCG
CGGGCGGCGG AGTAA
 
Protein sequence
MPNRRPIDEA HDFVRRHIGP SPQDIDAMLA TVGADSLDQL MAETLPDSIR IKQPLSLGTP 
LSEPDALTHM TELAAKNQVF TSLIGQGYSG TILPTVIQRN ILENPAWYTA YTPYQPEISQ
GRLEALFNFQ TMICDLTGLD VANASLLDDA TAAAEAMALA ERAVAKKTKA FFVDRDTHPQ
TLAVLRTRAE PLGWSIIVGD PDTELEAADV FGALLQYPGS SGALRDPRAV IATLHKKGAL
AVIAADLLAL TLIASPGELG ADIAIGSAQR FGVPMGYGGP HAGYMAARDS LKRSLPGRIV
GLSIDSHGQP AYRLALQTRE QHIRREKATS NICTAQVLLA VIAAMYAVYH GPDGLAAIAR
RVHRRTATLA AGLKQLGFAP INETYFDTLT IEVGSKRDAI VARAEAEKIN LRIGASSLGI
ALDETTTPAT VEALWRAFGG ELNYAAVERD ATDTLPASLT RTGDYLTQPA FQDYRSETEL
LRYMRKLSDR DLALDRAMIP LGSCTMKLNA TTEMMPLTWP AFGSLHPFVP RAQAQGYHEM
FARLEAWLAE ITGYDAVSLQ PNSGAQGEYA GLLAIRGYHL SRGEPHRRIC LIPSSAHGTN
PASAAMTGMD VVVVACNSHG DVDVDDLRAK AEKHSAELAA VMITYPSTHG VFEEHIRDIC
DIVHAHGGQV YLDGANLNAQ VGLSRPGDYG ADVSHLNLHK TFCIPHGGGG PGMGPIGVKA
HLAPFLPGHP AEGEALNGVL HGGGTVSAAP YGSASILTIS YIYILMMGGA GLKRATEIAI
LNANYIADRL QPHFPVLYRN LRGRVAHECI VDPRPLKTTT GVTVDDIAKR LIDYGFHAPT
MSFPVPGTLM IEPTESESKA EIDRFCDAMI AIRQEIAQIE DGRFKVEASP LRFAPHTVHD
VTSAEWTRPY PRTEGCFPAP HSRTDKYWCP VGRVDNVYGD RNLVCSCPPI EDYALAADYA
RAAE