Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_2195 |
Symbol | gcvP |
ID | 3719724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | + |
Start bp | 806327 |
End bp | 809197 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640070367 |
Product | glycine dehydrogenase |
Protein accession | YP_352251 |
Protein GI | 77462747 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTCA CCCCCACGGA CTATAACGCC TACGATTTCG CCAACCGCCG GCACATCGGC CCATCGCCCT CCGAAATGGA AGAGATGCTG CGCGTGGTGG GGGTGTCCTC GCTCGACCAG CTGATCGAGG AGACGGTGCC CGCCTCGATC CGGCAGGAGA CGCCGCTCGA CTGGGCGCCG CTCGCCGAGC ATGAGCTGCT CGCCCGGATG CGCGAGGTGG CTGCCAAGAA CCGCGTGATG ACCTCGCTCA TCGGGCAGGG CTATTACGGC ACGGTCACGC CGCCTGCCAT CCAGCGCAAC ATCCTCGAGA ATCCGGCCTG GTACACGGCC TATACGCCCT ACCAGCCCGA GATCGCGCAG GGGCGGCTCG AGGCGCTCTT GAACTACCAG ACCATGGTCG CGGACCTCAC CGGCCTGCCG GTCGCCAATG CCTCGCTTCT CGACGAGGCG ACGGCGGCGG CCGAGGCCAT GACCATGGCC GAGCGGGCCT CGAAGTCGAA GGCGCGGGCC TTCTTCGTCG ATGCCGACTG CCATCCGCAG ACGATCTCGG TCATCCGCAC CCGCGCCGAG CCGCTGGGCA TCGAGGTGAT CGTGGGCCAT CCCGCGCAGC TCGTGCCCGA GGATGTGTTC GGCGCGCTGT TCCAGTATCC CGGCACTTAC GGGCTCGTGC GCGACTTCAC CCGCGATATT GCGGCGCTGC ACGAGGCGAA GGCGCTGGCC GTGGTGGCGA CCGACCTTCT GGCGCTCTGC CTCCTCAAGG AGCCGGGCGC GATGGGCGCC GACATCGCCA TCGGCTCGAG CCAGCGGTTC GGCGTGCCGA TGGGCTACGG CGGTCCGCAT GCGGCCTTCA TGTCCTGCAA GGACGATCTG AAACGGTCGA TGCCCGGCCG GCTCGTCGGC GTTTCGGTCG ATGCGCGCGG CAACAAGGCC TATCGCCTCG CCCTCCAGAC CCGCGAGCAG CATATCCGCC GCGAGAAGGC CACCTCGAAC GTCTGCACCG CGCAGGCGCT TCTGGCGGTC ATGGCGAGCT TCTATGCCGT CTTCCACGGC CCCCGCGGCC TCCGCGCCAT CGCCGAGCGG GTGCATCTGA ACACCGTCCG CCTCGCGACC GCGCTGAAGG AGGCGGGAGC CCGCGTCAGC CCCGAGGCCT TCTTCGACAC GATCACCGTC GAGGTGGGCG TGGGGCAGGC GGGGATCCTC GCCGCTGCGC GTCACCGGGG CATCAACCTG CGCAAGGTGG GCCGCGACCG GGTGGGGATC TCGCTCGACG AGACGACCGA TGCCGGCGTG ATCGCGCGCG TGCTCGATGC CTTCGGGATC CACGAGCCCG CCCCCGCGAA GGTGGGCTTG GGCTTCCCCG AGCCGCTCCT GCGCGAGACC GGCTATCTCT CGCATCCGGT CTTCCAGATG AACCGGGCGG AATCCGAGAT GATGCGCTAC ATGCGCCGCC TTTCGGACCG CGACCTCGCG CTGGACCGGG CGATGATCCC GCTTGGCTCC TGCACGATGA AGCTGAATGC CGCCGCCGAG ATGATGCCCA TCACCTGGCC CGAATTCGGC ACGCTGCATC CGTTCGCGCC GGCCGATCAG GCCGCGGGCT ATCACGAGGC CATCGGCGAT CTGGCCCAGC GGCTCTGCCG GATCACCGGC TACGACGCCA TGTCGATGCA GCCGAACTCG GGCGCGCAGG GCGAATATGC GGGGCTTCTC ACCATCCTCG CCTATCACCG GGCGCGGGGC GACGCGGAGC GCACGATCTG CCTCATCCCG GTCTCGGCGC ATGGCACCAA CCCGGCCTCG GCGCAGATGG CAGGGATGAA GGTGGTCGTG GTGAAGTCGG CGCCGAACGG CGACGTCGAT CTCGAGGATT TTCGCGACAA GGCGGCGGCG GCGGGCGACC GGCTCGCGGC CTGCATGATC ACCTATCCCT CGACCCACGG CGTCTTCGAG GAGACGGTGC GCGAGGTCTG CCGCATCACC CACGAGCACG GCGGTCAGGT CTATATCGAC GGGGCCAACA TGAACGCGAT GGTGGGCCTC GTGCAGCCGG GCGCCATCGG CGGCGATGTG AGCCACCTGA ACCTGCACAA GACCTTCGCC ATTCCGCATG GCGGCGGCGG CCCCGGCATG GGGCCCATCG GGGTCAAGGC GCATCTCGCG CCGTACCTGC CGGGCCATCC CGAGGTGACG GGGCCGCTGA CCGGAGGCCA TGACGAGGCG GCCGACGAGG GGCCGGTCTC GGCCGCGCCC TATGGCTCGG CCTCGATCCT GCTCATCTCC TGGGCCTATT GCCTGATGAT GGGCGGCGAG GGGCTCACGC AGGCGACGCG GGTCGCGATC CTGAACGCAA ACTATATCGC GGCGCGGTTG CGCGGGGCCT ACAAGGTGCT GTTCATGGGC AATCGCGGGC GCGTGGCCCA CGAGTGCATC CTCGACACCC GGCCCTTCGC CGAGGCGGGC GTGACGGTGG ACGATATCGC CAAGCGGCTG ATCGACAACG GCTTCCACGC GCCGACCATG AGCTGGCCCG TGCCGGGGAC GCTGATGGTC GAGCCCACGG AATCCGAGAC CAAGGCCGAG ATCGACCGGT TCGTGGCGGC GCTCCTGGCG ATCCGCGAGG AGATCCGCGC GGTCGAGGAG GGCGAGATCG CCGCGGCGGA CAGCCCGCTG CGCCATGCGC CGCATACGGT CGAGGATCTG GTGGCGGACT GGGATCGCAA ATATCCGCGC GAGCAGGGCT GCTTCCCGCC GGGCTCGTTC CGGGTCGACA AATACTGGCC GCCCGTCGGC CGCGTCGACA ATGCGTGGGG CGACCGCAAC CTCGTCTGCA CCTGTCCGCC GGTGGAAAGC TACAGCATCG CCGCACAATA G
|
Protein sequence | MSFTPTDYNA YDFANRRHIG PSPSEMEEML RVVGVSSLDQ LIEETVPASI RQETPLDWAP LAEHELLARM REVAAKNRVM TSLIGQGYYG TVTPPAIQRN ILENPAWYTA YTPYQPEIAQ GRLEALLNYQ TMVADLTGLP VANASLLDEA TAAAEAMTMA ERASKSKARA FFVDADCHPQ TISVIRTRAE PLGIEVIVGH PAQLVPEDVF GALFQYPGTY GLVRDFTRDI AALHEAKALA VVATDLLALC LLKEPGAMGA DIAIGSSQRF GVPMGYGGPH AAFMSCKDDL KRSMPGRLVG VSVDARGNKA YRLALQTREQ HIRREKATSN VCTAQALLAV MASFYAVFHG PRGLRAIAER VHLNTVRLAT ALKEAGARVS PEAFFDTITV EVGVGQAGIL AAARHRGINL RKVGRDRVGI SLDETTDAGV IARVLDAFGI HEPAPAKVGL GFPEPLLRET GYLSHPVFQM NRAESEMMRY MRRLSDRDLA LDRAMIPLGS CTMKLNAAAE MMPITWPEFG TLHPFAPADQ AAGYHEAIGD LAQRLCRITG YDAMSMQPNS GAQGEYAGLL TILAYHRARG DAERTICLIP VSAHGTNPAS AQMAGMKVVV VKSAPNGDVD LEDFRDKAAA AGDRLAACMI TYPSTHGVFE ETVREVCRIT HEHGGQVYID GANMNAMVGL VQPGAIGGDV SHLNLHKTFA IPHGGGGPGM GPIGVKAHLA PYLPGHPEVT GPLTGGHDEA ADEGPVSAAP YGSASILLIS WAYCLMMGGE GLTQATRVAI LNANYIAARL RGAYKVLFMG NRGRVAHECI LDTRPFAEAG VTVDDIAKRL IDNGFHAPTM SWPVPGTLMV EPTESETKAE IDRFVAALLA IREEIRAVEE GEIAAADSPL RHAPHTVEDL VADWDRKYPR EQGCFPPGSF RVDKYWPPVG RVDNAWGDRN LVCTCPPVES YSIAAQ
|
| |