Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0870 |
Symbol | |
ID | 4896994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 890078 |
End bp | 892948 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640111455 |
Product | glycine dehydrogenase |
Protein accession | YP_001042753 |
Protein GI | 126461639 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.597554 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.602316 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTCA CCCCCACGGA CTATAACGCC TACGATTTCG CCAACCGCCG GCACATCGGC CCGTCGCCCT CCGAAATGGA GGAGATGCTG CGCGTGGTGG GGGTGTCCTC GCTCGACCAG CTGATCGAGG AGACGGTGCC CGCCTCGATC CGGCAGGAGA CGCCGCTCGA CTGGGCGCCG CTCGCCGAGC ATGAGCTGCT GGCCCGGATG CGCGAGGTGG CCGGCAAGAA CCGCGTGATG ACCTCGCTCA TCGGGCAGGG CTATTACGGC ACGGTGACGC CGCCCGCCAT CCAGCGCAAC ATCCTCGAGA ATCCGGCCTG GTATACGGCC TATACGCCCT ACCAGCCCGA GATCGCGCAG GGGCGGCTCG AGGCGCTCTT GAACTACCAG ACCATGGTCG CGGACCTCAC CGGCCTGCCG GTCGCCAACG CCTCGCTTCT CGACGAGGCG ACGGCGGCGG CCGAGGCCAT GACCATGGCC GAGCGCGCCT CGAAGTCGAA GGCGCGGGCC TTCTTCGTCG ATGCCGACTG CCATCCGCAG ACGATCTCGG TCATCCGCAC CCGCGCCGAG CCGCTGGGCA TCGAGGTGAT CGTGGGCCAT CCCGCGCAGC TCGTGCCCGA GGATGTGTTC GGCGCGCTGT TCCAGTATCC CGGCACCTAC GGGCTCGTGC GCGACTTCAC CCGCGACATT GCCGCGCTGC ACGAGGCGAA GGCGCTCGCC GTGGTGGCGA CCGACCTTCT GGCGCTCTGC CTCCTCAAGG AGCCGGGCGC GATGGGCGCC GACATCGCCA TCGGCTCGAG CCAGCGGTTC GGCGTGCCGA TGGGCTACGG CGGCCCGCAT GCGGCCTTCA TGTCCTGCAA GGACGATCTG AAGCGCTCGA TGCCCGGCCG GCTCGTCGGC GTTTCGGTCG ATGCGCGCGG CAACAAGGCC TACCGCCTCG CCCTCCAGAC CCGCGAGCAG CATATCCGCC GCGAGAAGGC CACCTCGAAC GTCTGCACCG CGCAGGCGCT TCTGGCGGTC ATGGCGAGCT TCTATGCCGT CTTCCACGGC CCCCGCGGCC TCCGCGCCAT CGCCGAGCGG GTGCATCTGA ACACCGTCCG CCTCGCGACC GCGCTGAAGG AGGCCGGGGC CCGCGTCAGC CCCGAGGCCT TCTTCGACAC GATCACCGTC GAGGTGGGCG TGGGGCAGGC GGGGATCCTC GCCGCCGCGC GCCACCGGGG CATCAACCTG CGCAAGGTGG GCCGCGACCG GGTGGGGATC TCGCTCGACG AGACGACCGA TGCCGGCGTG ATCGCGCGCG TGCTCGATGC CTTCGGGATC CACGAGCCCG CGCCCGCGAA AGTGGGCCTC GGCTTCCCCG AGCCGCTCCT GCGCGAGACC GGCTATCTCT CACATCCGGT CTTCCAGATG AACCGGGCGG AATCCGAGAT GATGCGCTAC ATGCGCCGCC TTTCGGACCG TGACCTCGCG CTGGACCGGG CGATGATCCC GCTCGGATCC TGCACGATGA AGCTGAATGC CGCCGCCGAG ATGATGCCCA TCACCTGGCC CGAATTCGGC ACGCTGCATC CGTTCGCGCC GGCCGATCAG GCCGCGGGCT ATCACGAGGC CATCGGCGAT CTGGCCCAGC GGCTCTGCCG GATCACCGGC TACGACGCCA TGTCGATGCA GCCGAACTCG GGCGCGCAGG GCGAATATGC GGGGCTTCTG ACCATCCTCG CCTATCACCG GGCGCGGGGC GAAGCGGAGC GCACGATCTG CCTCATCCCG GTCTCGGCGC ATGGCACCAA CCCGGCCTCG GCGCAGATGG CGGGGATGAA GGTGGTCGTG GTGAAGTCGG CGCCGAACGG CGACGTCGAT CTCGAGGATT TTCGCGACAA GGCGGCGGCG GCGGGCGACC GGCTCGCGGC CTGCATGATC ACCTATCCCT CGACCCACGG CGTCTTCGAG GAGACGGTGC GCGAGGTCTG CCGCATCACC CACGAGCACG GCGGTCAGGT CTATATCGAC GGGGCCAACA TGAACGCGAT GGTGGGCCTC GTGCAGCCGG GCGCCATCGG CGGCGACGTG AGCCACCTCA ACCTGCACAA GACCTTCGCC ATTCCGCATG GCGGCGGCGG CCCCGGCATG GGGCCCATCG GGGTCAAGGC GCATCTGGCG CCCTACCTGC CGGGCCATCC CGAGGTGACG GGGCCGCTGA CCGGCGGCCA TGACGAGGCG GCCGACGAGG GGCCGGTCTC GGCCGCGCCC TATGGCTCGG CCTCGATCCT GCTCATCTCC TGGGCCTATT GCCTGATGAT GGGCGGCGAG GGGCTCACGC AGGCGACGCG GGTCGCGATC CTGAACGCAA ACTATATCGC GGCGCGGTTG CGCGGGGCCT ACAAGGTGCT GTTCATGGGC AATCGCGGGC GCGTGGCCCA CGAATGCATC CTCGACACCC GGCCCTTCGC CGAGGCGGGC GTGACGGTGG ACGATATCGC CAAGCGGCTG ATCGACAACG GCTTCCACGC GCCGACCATG AGCTGGCCCG TGCCGGGGAC CCTGATGGTC GAGCCCACGG AATCCGAGAC CAAGGCCGAG ATCGACCGGT TCGTGGCGGC GCTCCTTGCG ATCCGCGAGG AGATCCGCGC GGTCGAGGAG GGCGAAATCG CCGCCGGGGA CAGCCCGCTG CGCCATGCGC CGCATACGGT CGAGGATCTG GTGGCGGACT GGGACCGCAA ATATCCGCGC GAGCAAGGCT GCTTCCCGCC GGGCTCGTTC CGGGTCGACA AATACTGGCC GCCCGTCGGC CGGGTCGACA ATGCGTGGGG CGACCGCAAC CTCGTCTGCA CCTGTCCGCC GGTGGAAAGC TACAGCATCG CCGCGCAGTA G
|
Protein sequence | MSFTPTDYNA YDFANRRHIG PSPSEMEEML RVVGVSSLDQ LIEETVPASI RQETPLDWAP LAEHELLARM REVAGKNRVM TSLIGQGYYG TVTPPAIQRN ILENPAWYTA YTPYQPEIAQ GRLEALLNYQ TMVADLTGLP VANASLLDEA TAAAEAMTMA ERASKSKARA FFVDADCHPQ TISVIRTRAE PLGIEVIVGH PAQLVPEDVF GALFQYPGTY GLVRDFTRDI AALHEAKALA VVATDLLALC LLKEPGAMGA DIAIGSSQRF GVPMGYGGPH AAFMSCKDDL KRSMPGRLVG VSVDARGNKA YRLALQTREQ HIRREKATSN VCTAQALLAV MASFYAVFHG PRGLRAIAER VHLNTVRLAT ALKEAGARVS PEAFFDTITV EVGVGQAGIL AAARHRGINL RKVGRDRVGI SLDETTDAGV IARVLDAFGI HEPAPAKVGL GFPEPLLRET GYLSHPVFQM NRAESEMMRY MRRLSDRDLA LDRAMIPLGS CTMKLNAAAE MMPITWPEFG TLHPFAPADQ AAGYHEAIGD LAQRLCRITG YDAMSMQPNS GAQGEYAGLL TILAYHRARG EAERTICLIP VSAHGTNPAS AQMAGMKVVV VKSAPNGDVD LEDFRDKAAA AGDRLAACMI TYPSTHGVFE ETVREVCRIT HEHGGQVYID GANMNAMVGL VQPGAIGGDV SHLNLHKTFA IPHGGGGPGM GPIGVKAHLA PYLPGHPEVT GPLTGGHDEA ADEGPVSAAP YGSASILLIS WAYCLMMGGE GLTQATRVAI LNANYIAARL RGAYKVLFMG NRGRVAHECI LDTRPFAEAG VTVDDIAKRL IDNGFHAPTM SWPVPGTLMV EPTESETKAE IDRFVAALLA IREEIRAVEE GEIAAGDSPL RHAPHTVEDL VADWDRKYPR EQGCFPPGSF RVDKYWPPVG RVDNAWGDRN LVCTCPPVES YSIAAQ
|
| |