Gene Rsph17025_2297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2297 
Symbol 
ID5083716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2335728 
End bp2338598 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content69% 
IMG OID640483860 
Productglycine dehydrogenase 
Protein accessionYP_001168491 
Protein GI146278332 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.81309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCA CCCCCACCGA CTACAACGCC TACGATTTCG CCAACCGCCG GCACATCGGC 
CCGTCGCCCT CCGAAATGGA GGAGATGCTG CGCGTGGTGG GGGTGTCCTC GCTCGACCAG
CTGATCGAGG AGACGGTGCC CGCCTCGATC CGGCAGGATC AGCCGCTCGA CTGGGCGCCG
CTGGCCGAGC ATGAACTTTT GCAGAAGATG CGCGAGGTGG CGGCGAAGAA CCGCGTGATG
GTCTCGCTGA TCGGGCAGGG CTATTACGGC ACCGTGACGC CGCCCGCGAT CCAGCGCAAC
ATCCTGGAAA ACCCGGCCTG GTACACGGCC TACACGCCCT ACCAGCCCGA GATTGCGCAA
GGGCGGCTCG AGGCGCTTCT GAACTATCAG ACCATGGTGG CGGACCTGAC CGGCCTGCCG
GTGGCGAACG CCTCGCTTCT GGACGAGGCG ACGGCGGCGG CCGAGGCCAT GACCATGGCC
GAGCGCGCCT CGAAGTCGAA GGCGCGGGCC TTCTTCGTCG ATGCCGACTG CCATCCCCAG
ACGATCGCCG TGATCCGCAC CCGGGCAGAG CCCCTGGGGA TCGAGGTCAT CGTGGGACAC
CCGGCGCAAC TGGTGCCCGA GGACGTGTTC GGCGCGCTCT TCCAGTATCC CGGCACCTAC
GGCCTCGTGC GCGACTTCAC CCGCGAGATC GCGGCGCTGC ACGAGGCGAA GGCGCTGGCG
ATCGTGGCGA CCGACCTTCT GGCGCTCTGC CTGCTGAAGG AGCCGGGCGC CATGGGCGCC
GACATCGCCA TCGGCTCGAG CCAGCGGTTC GGCGTGCCGA TGGGCTATGG CGGCCCGCAC
GCGGCCTTCA TGTCCTGCCG GGACGAGCTG AAACGTTCGA TGCCGGGGCG GCTGGTCGGC
GTCTCGGTCG ATGCGCGCGG CAACAAGGCC TATCGCCTCG CGCTCCAGAC GCGCGAGCAG
CACATCCGGC GCGAGAAGGC GACCTCGAAC GTCTGCACCG CGCAGGCGCT TCTGGCGGTG
ATGGCGAGCT TCTACGCCGT CTTCCACGGC CCCAAGGGCC TGCGCCACAT CGCCGAGCGG
GTGCATCTGA ACACCGTGCG GCTGGCTCAG GCGCTGAAGG AGGCCGGCGC CCGCGTCAGC
CCCGAGGCCT TCTTCGACAC GATCACCGTC GAGGTGGGGG TGGGGCAGGC GGGCATCCTG
GCCGCCGCGC GCCACCGTGG CATCAACCTG CGCAAGGTGG GCCGCGACCG GGTGGGCATC
TCGCTCGACG AGACGACCGA CGCAGGCGTG ATCGCCCGCG TGCTGGACGC CTTCGGCATC
CATGATCCCG CACCCGCGAG CGTGGGCCTC GGCTTCCCCG AGGCGATGCT GCGCGAGTCC
GCCTATCTGA GCCACCCGGT CTTTGACATG AACCGGGCCG AGTCCGAGAT GATGCGCTAC
ATGCGCCGCC TGTCGGACCG GGATCTGGCG CTCGACCGCG CGATGATCCC GCTGGGAAGC
TGCACGATGA AGCTGAACGC CGCGGCCGAG ATGATGCCGA TCACCTGGCC CGAGTTCGCC
ACCCTTCATC CCTTCGCGCC ACCCGAACAG GCGGCGGGCT ACACCGAGGC GATCCGCGAC
CTCAGCGACC GGCTCTGCCG CATCACCGGC TATGACGCCA TGTCGATGCA GCCGAACTCG
GGCGCGCAGG GGGAATATGC GGGGCTGCTC ACGATCCTCG CCTATCACCG CGCGCGGGGC
GAGGGGCAGC GCACCATCTG CCTCATCCCG ATGTCGGCGC ACGGGACGAA CCCGGCCTCG
GCGCAGATGG CGGGGATGAA GGTGGTCGTG GTCAAGTCGG CGCCGAACGG CGACGTCGAT
CTGGACGACT TCCGCGACAA GGCGGCGGCG GCGGGCGACC GGCTCGCGGC CTGCATGATC
ACCTATCCCT CGACCCACGG CGTCTTCGAA GAGACGGTGC GCGACGTCTG CCGCATCACC
CACGAGCACG GCGGGCAGGT CTATATCGAC GGCGCCAACA TGAACGCGAT GGTGGGCCTC
GTGCAGCCGG GCGCCATCGG CGGCGACGTG AGCCACCTGA ACCTGCACAA GACCTTTGCC
ATTCCGCATG GCGGCGGCGG TCCGGGCATG GGGCCGATCG GAGTGAAGGC GCATCTCGCG
CCCTACCTGC CGGGCCACCC GGAGACGGGC GGCCCCCTGG CCGGAGGCCA GATCGTGGCT
ACGCACGAGG GGCCGGTCTC GGCCGCGCCC TATGGCTCGG CCTCGATCCT GCTGATCTCC
TGGGCCTATT GCCTGATGAT GGGCGGCGAG GGGCTGACGC AGGCGACGCG GGTGGCGATC
CTGAACGCCA ACTATGTCGC GGCGCGGCTG AGGGGGGCCT ATGACGTGCT GTTCATGGGC
AACCGCGGGC GCGTGGCGCA TGAGTGCATC CTCGACACCC GCCCCTTCGC CGAGGCCGGC
GTGACGGTGG ACGACATTGC CAAGCGCCTG ATCGACAACG GCTTCCACGC CCCCACCATG
AGCTGGCCCG TGCCCGGCAC GCTGATGGTG GAGCCGACCG AATCCGAGAC CAAGGCCGAG
ATCGACCGCT TCATCACGGC GCTTCTCGCG ATCCGCGAGG AGATCCGGGC GGTCGAGGCG
GGCGAGATCG CGGCGGCCGA CAGCCCGCTG CGCCACGCGC CCCACACGGT TGAGGATCTG
GTCGCCGACT GGGATCGCAA CTATCCGCGC GAGCAGGGTT GCTTCCCGCC GGGCGCCTTC
CGGGTGGACA AGTACTGGCC GCCCGTCGGC CGGGTGGACA ATGCCTGGGG CGATCGCAAC
CTCGTCTGCA TCTGCCCGCC GGTCGAAAGC TACAGCATCG CGGCGCAATA G
 
Protein sequence
MTFTPTDYNA YDFANRRHIG PSPSEMEEML RVVGVSSLDQ LIEETVPASI RQDQPLDWAP 
LAEHELLQKM REVAAKNRVM VSLIGQGYYG TVTPPAIQRN ILENPAWYTA YTPYQPEIAQ
GRLEALLNYQ TMVADLTGLP VANASLLDEA TAAAEAMTMA ERASKSKARA FFVDADCHPQ
TIAVIRTRAE PLGIEVIVGH PAQLVPEDVF GALFQYPGTY GLVRDFTREI AALHEAKALA
IVATDLLALC LLKEPGAMGA DIAIGSSQRF GVPMGYGGPH AAFMSCRDEL KRSMPGRLVG
VSVDARGNKA YRLALQTREQ HIRREKATSN VCTAQALLAV MASFYAVFHG PKGLRHIAER
VHLNTVRLAQ ALKEAGARVS PEAFFDTITV EVGVGQAGIL AAARHRGINL RKVGRDRVGI
SLDETTDAGV IARVLDAFGI HDPAPASVGL GFPEAMLRES AYLSHPVFDM NRAESEMMRY
MRRLSDRDLA LDRAMIPLGS CTMKLNAAAE MMPITWPEFA TLHPFAPPEQ AAGYTEAIRD
LSDRLCRITG YDAMSMQPNS GAQGEYAGLL TILAYHRARG EGQRTICLIP MSAHGTNPAS
AQMAGMKVVV VKSAPNGDVD LDDFRDKAAA AGDRLAACMI TYPSTHGVFE ETVRDVCRIT
HEHGGQVYID GANMNAMVGL VQPGAIGGDV SHLNLHKTFA IPHGGGGPGM GPIGVKAHLA
PYLPGHPETG GPLAGGQIVA THEGPVSAAP YGSASILLIS WAYCLMMGGE GLTQATRVAI
LNANYVAARL RGAYDVLFMG NRGRVAHECI LDTRPFAEAG VTVDDIAKRL IDNGFHAPTM
SWPVPGTLMV EPTESETKAE IDRFITALLA IREEIRAVEA GEIAAADSPL RHAPHTVEDL
VADWDRNYPR EQGCFPPGAF RVDKYWPPVG RVDNAWGDRN LVCICPPVES YSIAAQ