Gene Rsph17029_0870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0870 
Symbol 
ID4896994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp890078 
End bp892948 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content69% 
IMG OID640111455 
Productglycine dehydrogenase 
Protein accessionYP_001042753 
Protein GI126461639 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.597554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.602316 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCA CCCCCACGGA CTATAACGCC TACGATTTCG CCAACCGCCG GCACATCGGC 
CCGTCGCCCT CCGAAATGGA GGAGATGCTG CGCGTGGTGG GGGTGTCCTC GCTCGACCAG
CTGATCGAGG AGACGGTGCC CGCCTCGATC CGGCAGGAGA CGCCGCTCGA CTGGGCGCCG
CTCGCCGAGC ATGAGCTGCT GGCCCGGATG CGCGAGGTGG CCGGCAAGAA CCGCGTGATG
ACCTCGCTCA TCGGGCAGGG CTATTACGGC ACGGTGACGC CGCCCGCCAT CCAGCGCAAC
ATCCTCGAGA ATCCGGCCTG GTATACGGCC TATACGCCCT ACCAGCCCGA GATCGCGCAG
GGGCGGCTCG AGGCGCTCTT GAACTACCAG ACCATGGTCG CGGACCTCAC CGGCCTGCCG
GTCGCCAACG CCTCGCTTCT CGACGAGGCG ACGGCGGCGG CCGAGGCCAT GACCATGGCC
GAGCGCGCCT CGAAGTCGAA GGCGCGGGCC TTCTTCGTCG ATGCCGACTG CCATCCGCAG
ACGATCTCGG TCATCCGCAC CCGCGCCGAG CCGCTGGGCA TCGAGGTGAT CGTGGGCCAT
CCCGCGCAGC TCGTGCCCGA GGATGTGTTC GGCGCGCTGT TCCAGTATCC CGGCACCTAC
GGGCTCGTGC GCGACTTCAC CCGCGACATT GCCGCGCTGC ACGAGGCGAA GGCGCTCGCC
GTGGTGGCGA CCGACCTTCT GGCGCTCTGC CTCCTCAAGG AGCCGGGCGC GATGGGCGCC
GACATCGCCA TCGGCTCGAG CCAGCGGTTC GGCGTGCCGA TGGGCTACGG CGGCCCGCAT
GCGGCCTTCA TGTCCTGCAA GGACGATCTG AAGCGCTCGA TGCCCGGCCG GCTCGTCGGC
GTTTCGGTCG ATGCGCGCGG CAACAAGGCC TACCGCCTCG CCCTCCAGAC CCGCGAGCAG
CATATCCGCC GCGAGAAGGC CACCTCGAAC GTCTGCACCG CGCAGGCGCT TCTGGCGGTC
ATGGCGAGCT TCTATGCCGT CTTCCACGGC CCCCGCGGCC TCCGCGCCAT CGCCGAGCGG
GTGCATCTGA ACACCGTCCG CCTCGCGACC GCGCTGAAGG AGGCCGGGGC CCGCGTCAGC
CCCGAGGCCT TCTTCGACAC GATCACCGTC GAGGTGGGCG TGGGGCAGGC GGGGATCCTC
GCCGCCGCGC GCCACCGGGG CATCAACCTG CGCAAGGTGG GCCGCGACCG GGTGGGGATC
TCGCTCGACG AGACGACCGA TGCCGGCGTG ATCGCGCGCG TGCTCGATGC CTTCGGGATC
CACGAGCCCG CGCCCGCGAA AGTGGGCCTC GGCTTCCCCG AGCCGCTCCT GCGCGAGACC
GGCTATCTCT CACATCCGGT CTTCCAGATG AACCGGGCGG AATCCGAGAT GATGCGCTAC
ATGCGCCGCC TTTCGGACCG TGACCTCGCG CTGGACCGGG CGATGATCCC GCTCGGATCC
TGCACGATGA AGCTGAATGC CGCCGCCGAG ATGATGCCCA TCACCTGGCC CGAATTCGGC
ACGCTGCATC CGTTCGCGCC GGCCGATCAG GCCGCGGGCT ATCACGAGGC CATCGGCGAT
CTGGCCCAGC GGCTCTGCCG GATCACCGGC TACGACGCCA TGTCGATGCA GCCGAACTCG
GGCGCGCAGG GCGAATATGC GGGGCTTCTG ACCATCCTCG CCTATCACCG GGCGCGGGGC
GAAGCGGAGC GCACGATCTG CCTCATCCCG GTCTCGGCGC ATGGCACCAA CCCGGCCTCG
GCGCAGATGG CGGGGATGAA GGTGGTCGTG GTGAAGTCGG CGCCGAACGG CGACGTCGAT
CTCGAGGATT TTCGCGACAA GGCGGCGGCG GCGGGCGACC GGCTCGCGGC CTGCATGATC
ACCTATCCCT CGACCCACGG CGTCTTCGAG GAGACGGTGC GCGAGGTCTG CCGCATCACC
CACGAGCACG GCGGTCAGGT CTATATCGAC GGGGCCAACA TGAACGCGAT GGTGGGCCTC
GTGCAGCCGG GCGCCATCGG CGGCGACGTG AGCCACCTCA ACCTGCACAA GACCTTCGCC
ATTCCGCATG GCGGCGGCGG CCCCGGCATG GGGCCCATCG GGGTCAAGGC GCATCTGGCG
CCCTACCTGC CGGGCCATCC CGAGGTGACG GGGCCGCTGA CCGGCGGCCA TGACGAGGCG
GCCGACGAGG GGCCGGTCTC GGCCGCGCCC TATGGCTCGG CCTCGATCCT GCTCATCTCC
TGGGCCTATT GCCTGATGAT GGGCGGCGAG GGGCTCACGC AGGCGACGCG GGTCGCGATC
CTGAACGCAA ACTATATCGC GGCGCGGTTG CGCGGGGCCT ACAAGGTGCT GTTCATGGGC
AATCGCGGGC GCGTGGCCCA CGAATGCATC CTCGACACCC GGCCCTTCGC CGAGGCGGGC
GTGACGGTGG ACGATATCGC CAAGCGGCTG ATCGACAACG GCTTCCACGC GCCGACCATG
AGCTGGCCCG TGCCGGGGAC CCTGATGGTC GAGCCCACGG AATCCGAGAC CAAGGCCGAG
ATCGACCGGT TCGTGGCGGC GCTCCTTGCG ATCCGCGAGG AGATCCGCGC GGTCGAGGAG
GGCGAAATCG CCGCCGGGGA CAGCCCGCTG CGCCATGCGC CGCATACGGT CGAGGATCTG
GTGGCGGACT GGGACCGCAA ATATCCGCGC GAGCAAGGCT GCTTCCCGCC GGGCTCGTTC
CGGGTCGACA AATACTGGCC GCCCGTCGGC CGGGTCGACA ATGCGTGGGG CGACCGCAAC
CTCGTCTGCA CCTGTCCGCC GGTGGAAAGC TACAGCATCG CCGCGCAGTA G
 
Protein sequence
MSFTPTDYNA YDFANRRHIG PSPSEMEEML RVVGVSSLDQ LIEETVPASI RQETPLDWAP 
LAEHELLARM REVAGKNRVM TSLIGQGYYG TVTPPAIQRN ILENPAWYTA YTPYQPEIAQ
GRLEALLNYQ TMVADLTGLP VANASLLDEA TAAAEAMTMA ERASKSKARA FFVDADCHPQ
TISVIRTRAE PLGIEVIVGH PAQLVPEDVF GALFQYPGTY GLVRDFTRDI AALHEAKALA
VVATDLLALC LLKEPGAMGA DIAIGSSQRF GVPMGYGGPH AAFMSCKDDL KRSMPGRLVG
VSVDARGNKA YRLALQTREQ HIRREKATSN VCTAQALLAV MASFYAVFHG PRGLRAIAER
VHLNTVRLAT ALKEAGARVS PEAFFDTITV EVGVGQAGIL AAARHRGINL RKVGRDRVGI
SLDETTDAGV IARVLDAFGI HEPAPAKVGL GFPEPLLRET GYLSHPVFQM NRAESEMMRY
MRRLSDRDLA LDRAMIPLGS CTMKLNAAAE MMPITWPEFG TLHPFAPADQ AAGYHEAIGD
LAQRLCRITG YDAMSMQPNS GAQGEYAGLL TILAYHRARG EAERTICLIP VSAHGTNPAS
AQMAGMKVVV VKSAPNGDVD LEDFRDKAAA AGDRLAACMI TYPSTHGVFE ETVREVCRIT
HEHGGQVYID GANMNAMVGL VQPGAIGGDV SHLNLHKTFA IPHGGGGPGM GPIGVKAHLA
PYLPGHPEVT GPLTGGHDEA ADEGPVSAAP YGSASILLIS WAYCLMMGGE GLTQATRVAI
LNANYIAARL RGAYKVLFMG NRGRVAHECI LDTRPFAEAG VTVDDIAKRL IDNGFHAPTM
SWPVPGTLMV EPTESETKAE IDRFVAALLA IREEIRAVEE GEIAAGDSPL RHAPHTVEDL
VADWDRKYPR EQGCFPPGSF RVDKYWPPVG RVDNAWGDRN LVCTCPPVES YSIAAQ