Gene RSP_2195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2195 
SymbolgcvP 
ID3719724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp806327 
End bp809197 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content69% 
IMG OID640070367 
Productglycine dehydrogenase 
Protein accessionYP_352251 
Protein GI77462747 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTCA CCCCCACGGA CTATAACGCC TACGATTTCG CCAACCGCCG GCACATCGGC 
CCATCGCCCT CCGAAATGGA AGAGATGCTG CGCGTGGTGG GGGTGTCCTC GCTCGACCAG
CTGATCGAGG AGACGGTGCC CGCCTCGATC CGGCAGGAGA CGCCGCTCGA CTGGGCGCCG
CTCGCCGAGC ATGAGCTGCT CGCCCGGATG CGCGAGGTGG CTGCCAAGAA CCGCGTGATG
ACCTCGCTCA TCGGGCAGGG CTATTACGGC ACGGTCACGC CGCCTGCCAT CCAGCGCAAC
ATCCTCGAGA ATCCGGCCTG GTACACGGCC TATACGCCCT ACCAGCCCGA GATCGCGCAG
GGGCGGCTCG AGGCGCTCTT GAACTACCAG ACCATGGTCG CGGACCTCAC CGGCCTGCCG
GTCGCCAATG CCTCGCTTCT CGACGAGGCG ACGGCGGCGG CCGAGGCCAT GACCATGGCC
GAGCGGGCCT CGAAGTCGAA GGCGCGGGCC TTCTTCGTCG ATGCCGACTG CCATCCGCAG
ACGATCTCGG TCATCCGCAC CCGCGCCGAG CCGCTGGGCA TCGAGGTGAT CGTGGGCCAT
CCCGCGCAGC TCGTGCCCGA GGATGTGTTC GGCGCGCTGT TCCAGTATCC CGGCACTTAC
GGGCTCGTGC GCGACTTCAC CCGCGATATT GCGGCGCTGC ACGAGGCGAA GGCGCTGGCC
GTGGTGGCGA CCGACCTTCT GGCGCTCTGC CTCCTCAAGG AGCCGGGCGC GATGGGCGCC
GACATCGCCA TCGGCTCGAG CCAGCGGTTC GGCGTGCCGA TGGGCTACGG CGGTCCGCAT
GCGGCCTTCA TGTCCTGCAA GGACGATCTG AAACGGTCGA TGCCCGGCCG GCTCGTCGGC
GTTTCGGTCG ATGCGCGCGG CAACAAGGCC TATCGCCTCG CCCTCCAGAC CCGCGAGCAG
CATATCCGCC GCGAGAAGGC CACCTCGAAC GTCTGCACCG CGCAGGCGCT TCTGGCGGTC
ATGGCGAGCT TCTATGCCGT CTTCCACGGC CCCCGCGGCC TCCGCGCCAT CGCCGAGCGG
GTGCATCTGA ACACCGTCCG CCTCGCGACC GCGCTGAAGG AGGCGGGAGC CCGCGTCAGC
CCCGAGGCCT TCTTCGACAC GATCACCGTC GAGGTGGGCG TGGGGCAGGC GGGGATCCTC
GCCGCTGCGC GTCACCGGGG CATCAACCTG CGCAAGGTGG GCCGCGACCG GGTGGGGATC
TCGCTCGACG AGACGACCGA TGCCGGCGTG ATCGCGCGCG TGCTCGATGC CTTCGGGATC
CACGAGCCCG CCCCCGCGAA GGTGGGCTTG GGCTTCCCCG AGCCGCTCCT GCGCGAGACC
GGCTATCTCT CGCATCCGGT CTTCCAGATG AACCGGGCGG AATCCGAGAT GATGCGCTAC
ATGCGCCGCC TTTCGGACCG CGACCTCGCG CTGGACCGGG CGATGATCCC GCTTGGCTCC
TGCACGATGA AGCTGAATGC CGCCGCCGAG ATGATGCCCA TCACCTGGCC CGAATTCGGC
ACGCTGCATC CGTTCGCGCC GGCCGATCAG GCCGCGGGCT ATCACGAGGC CATCGGCGAT
CTGGCCCAGC GGCTCTGCCG GATCACCGGC TACGACGCCA TGTCGATGCA GCCGAACTCG
GGCGCGCAGG GCGAATATGC GGGGCTTCTC ACCATCCTCG CCTATCACCG GGCGCGGGGC
GACGCGGAGC GCACGATCTG CCTCATCCCG GTCTCGGCGC ATGGCACCAA CCCGGCCTCG
GCGCAGATGG CAGGGATGAA GGTGGTCGTG GTGAAGTCGG CGCCGAACGG CGACGTCGAT
CTCGAGGATT TTCGCGACAA GGCGGCGGCG GCGGGCGACC GGCTCGCGGC CTGCATGATC
ACCTATCCCT CGACCCACGG CGTCTTCGAG GAGACGGTGC GCGAGGTCTG CCGCATCACC
CACGAGCACG GCGGTCAGGT CTATATCGAC GGGGCCAACA TGAACGCGAT GGTGGGCCTC
GTGCAGCCGG GCGCCATCGG CGGCGATGTG AGCCACCTGA ACCTGCACAA GACCTTCGCC
ATTCCGCATG GCGGCGGCGG CCCCGGCATG GGGCCCATCG GGGTCAAGGC GCATCTCGCG
CCGTACCTGC CGGGCCATCC CGAGGTGACG GGGCCGCTGA CCGGAGGCCA TGACGAGGCG
GCCGACGAGG GGCCGGTCTC GGCCGCGCCC TATGGCTCGG CCTCGATCCT GCTCATCTCC
TGGGCCTATT GCCTGATGAT GGGCGGCGAG GGGCTCACGC AGGCGACGCG GGTCGCGATC
CTGAACGCAA ACTATATCGC GGCGCGGTTG CGCGGGGCCT ACAAGGTGCT GTTCATGGGC
AATCGCGGGC GCGTGGCCCA CGAGTGCATC CTCGACACCC GGCCCTTCGC CGAGGCGGGC
GTGACGGTGG ACGATATCGC CAAGCGGCTG ATCGACAACG GCTTCCACGC GCCGACCATG
AGCTGGCCCG TGCCGGGGAC GCTGATGGTC GAGCCCACGG AATCCGAGAC CAAGGCCGAG
ATCGACCGGT TCGTGGCGGC GCTCCTGGCG ATCCGCGAGG AGATCCGCGC GGTCGAGGAG
GGCGAGATCG CCGCGGCGGA CAGCCCGCTG CGCCATGCGC CGCATACGGT CGAGGATCTG
GTGGCGGACT GGGATCGCAA ATATCCGCGC GAGCAGGGCT GCTTCCCGCC GGGCTCGTTC
CGGGTCGACA AATACTGGCC GCCCGTCGGC CGCGTCGACA ATGCGTGGGG CGACCGCAAC
CTCGTCTGCA CCTGTCCGCC GGTGGAAAGC TACAGCATCG CCGCACAATA G
 
Protein sequence
MSFTPTDYNA YDFANRRHIG PSPSEMEEML RVVGVSSLDQ LIEETVPASI RQETPLDWAP 
LAEHELLARM REVAAKNRVM TSLIGQGYYG TVTPPAIQRN ILENPAWYTA YTPYQPEIAQ
GRLEALLNYQ TMVADLTGLP VANASLLDEA TAAAEAMTMA ERASKSKARA FFVDADCHPQ
TISVIRTRAE PLGIEVIVGH PAQLVPEDVF GALFQYPGTY GLVRDFTRDI AALHEAKALA
VVATDLLALC LLKEPGAMGA DIAIGSSQRF GVPMGYGGPH AAFMSCKDDL KRSMPGRLVG
VSVDARGNKA YRLALQTREQ HIRREKATSN VCTAQALLAV MASFYAVFHG PRGLRAIAER
VHLNTVRLAT ALKEAGARVS PEAFFDTITV EVGVGQAGIL AAARHRGINL RKVGRDRVGI
SLDETTDAGV IARVLDAFGI HEPAPAKVGL GFPEPLLRET GYLSHPVFQM NRAESEMMRY
MRRLSDRDLA LDRAMIPLGS CTMKLNAAAE MMPITWPEFG TLHPFAPADQ AAGYHEAIGD
LAQRLCRITG YDAMSMQPNS GAQGEYAGLL TILAYHRARG DAERTICLIP VSAHGTNPAS
AQMAGMKVVV VKSAPNGDVD LEDFRDKAAA AGDRLAACMI TYPSTHGVFE ETVREVCRIT
HEHGGQVYID GANMNAMVGL VQPGAIGGDV SHLNLHKTFA IPHGGGGPGM GPIGVKAHLA
PYLPGHPEVT GPLTGGHDEA ADEGPVSAAP YGSASILLIS WAYCLMMGGE GLTQATRVAI
LNANYIAARL RGAYKVLFMG NRGRVAHECI LDTRPFAEAG VTVDDIAKRL IDNGFHAPTM
SWPVPGTLMV EPTESETKAE IDRFVAALLA IREEIRAVEE GEIAAADSPL RHAPHTVEDL
VADWDRKYPR EQGCFPPGSF RVDKYWPPVG RVDNAWGDRN LVCTCPPVES YSIAAQ