Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1731 |
Symbol | |
ID | 4022211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 1941809 |
End bp | 1944703 |
Gene Length | 2895 bp |
Protein Length | 964 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637961925 |
Product | glycine dehydrogenase |
Protein accession | YP_568868 |
Protein GI | 91976209 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0573168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAATC GCCGACCGAT CGACGCCGCC AACAACTTCG TGCGACGCCA TATCGGCCCG TCGCCGCAGG ACATCGCGCA GATGCTGAGG ACCGTGGGAG CAGGCAGCAT CGACCAGTTG ATGGCCGAAA CGCTGCCTTA TGCGATCCGC ATCAAAGAGC CTCTGTCGCT CGGCGCGCCG CTGTCGGAGA CCGAGGCGCT GGCGCACATG ACAGAACTCG CAGCGAAGAA CGCGGTGTTC ACCTCGCTGA TCGGCCAAGG CTACTCCGGC ACGATCCTGC CGACCGTGAT CCAGCGCAAT ATTCTGGAAA ATCCCGCCTG GTACACGGCC TATACGCCGT ATCAGCCCGA GATCAGCCAA GGCCGGCTGG AAGCGCTGTT CAACTTCCAG ACCATGATCT GCGACCTGAC CGGGCTCGAC CTCGCCAACG CCTCGCTGCT CGACGAGGCG ACCGCGGCGG CGGAAGCGAT GGCGCTGGCG GAACGCGCCG CGCAAAAGAA GACCAAGGCG TTCTTCGTCG ATCGCGACAC CCATCCGCAG ACGCTGGCGG TGCTGCGCAC CCGCGCCGAA CCACTCGGCT GGTCGATCAT CGTCGGCGAT CCGGACACCG AGCTCGAAGC CGCGGACGTG TTCGGTGCTT TGCTGCAATA TCCCGGCTCG TCCGGCGCTC TGCGCGACCC GCGCCCGGCG ATCGCGACGC TGCACAACAA GGGCGCGCTC GCGGTGATCG CCGCGGATCT GCTGGCGCTG ACGCTGATCA CGTCGCCCGG CGAACTCGGC GCCGATATCG CGATCGGCTC GGCGCAACGC TTCGGCGTGC CGATGGGCTA TGGCGGCCCG CATGCCGGCT ACATGGCCGC GCGCGACAGC CTCAAGCGTT CGCTCCCCGG CCGCATCGTC GGCCTGTCGA TCGATTCGCA CGGCCAGCCG GCCTATCGGC TGGCGATGCA GACCCGCGAG CAGCACATCC GCCGCGAGAA GGCGACCTCG AACATCTGCA CCGCGCAGGT GCTGCTGGCG GTGATTGCCG CGATGTATGC GGTGTATCAC GGCCCGGACG GCCTCGCCGC GATTGCCCGC CGGGTGCATC GCCGCACCGC GACGCTGGCG AGCGGGCTGA AGCAGCTCGG CTTTGCGCCC ATCAACGACG CCTATTTCGA TACGCTGACG GTCGAGGTCG GCGACAAGCG CGACGCCATC GTCGCGCGCG CCGAGGCCGA ACAGATCAAT CTGCGGATCG GCGCGACCTC GCTCGGCATC TCGCTCGATG AAACCACCAC CCCCGCGATC GTCGAGGCGC TATGGCGCGC GTTCGACGGA TCGCTCGACT ACGCAAGCGT CGAGCGCGAC GCGACCGACA CGCTGCCGGC GGCGCTGACG CGGACGAGCG ACTACCTGAC GCAACCGGCG TTCCAGGACT ATCGCTCGGA GACCGAATTG CTCCGCTACA TGCGCAAGCT GTCGGACCGC GACCTCGCGC TCGACCGCGC GATGATTCCG CTCGGCTCCT GCACCATGAA ACTCAACGCC ACTACCGAGA TGATGCCGCT GACCTGGCCT CAATTCGGCA GCCTGCATCC GTTCGTGCCG CGGGCGCAGG CGGAGGGCTA TCATGCGATG TTCGCGACGC TGGAAGCCTG GCTCGCCGAG ATCACCGGCT ACGACGCCGT GTCGCTGCAG CCGAATTCCG GCGCGCAGGG CGAATATGCC GGCCTGCTGG CGATCCGCGG CTATCATCTG TCGCGCGGCG AGCCGCACCG CAAGATCTGC CTGATCCCCT CCTCCGCGCA CGGCACCAAT CCGGCCTCGG CCGCGATGGT CGGGATGGAT GTCGTCGTGG TCGCTTGCAA CAATCATGGC GACGTCGACG TCGACGATCT GCGCGCCAAG GCGGAGAAGC ATTCGGCCGA ACTCGCCGCG GTGATGATCA CCTATCCGTC GACCCACGGC GTGTTCGAGG AGCACATTCG CGAGATCTGC GACATCGTCC ATGCCCATGG CGGCCAGATC TATCTCGACG GCGCCAACCT CAACGCGCAG GTCGGCCTGG CGCGGCCCGG CGACTACGGC GCCGATGTCA GTCACCTCAA TCTGCACAAG ACTTTCTGCA TTCCGCATGG CGGCGGCGGC CCGGGGATGG GGCCGATCGG CGTCAAGGCG CATCTGGCGC CGTTCCTGCC CGGCCACCCG GCCGAGGGCG AGCCGTCGAG CGGCGTGCTG CACGGCGGCG GCACGGTGTC GGCAGCGCCC TATGGCTCGG CGTCGATTCT CACGATCTCC TATATCTACA TTCTGATGAT GGGCGGCGCC GGCCTGAAGC GGGCCACCGA GATCGCGATC CTCAACGCCA ACTACATCGC GGCGCGACTG CAGCCGCATT TCCCGGTGCT GTATCGCAAC CTGCGCGGCC GCGTCGCGCA TGAATGCATC GTCGATCCGC GACCGCTGAA GACGACGACC GGGGTGACGG TCGACGACAT CGCCAAGCGG CTGATCGACT ACGGCTTCCA CGCCCCGACC ATGAGCTTCC CGGTGCCGGG CACGCTGATG ATCGAGCCAA CCGAATCGGA GTCCAAGGCG GAGATCGACC GGTTCTGCGA GGCGATGATC GCGATCCGGC GGGAGATCGC CCAGATCGAG CAAGGCCGGT TCAAGGTCGA GGCGTCGCCG CTGCGCTTTG CGCCGCATAC GGTGCACGAC GTCACCAGCG CGGAATGGAC GCGGCCCTAT CCGCGCACCG AGGGCTGCTT CCCGGCTCCG AACTCGCGCA CCGACAAATA TTGGTGCCCG GTCGGCCGCG TCGACAACGT CTATGGCGAC CGCAACCTCG TGTGCGCGTG CCCGCCGATC GAAGACTACG CACTGGCGGC GGACTACGCC CGGGCGGCGG AGTAG
|
Protein sequence | MPNRRPIDAA NNFVRRHIGP SPQDIAQMLR TVGAGSIDQL MAETLPYAIR IKEPLSLGAP LSETEALAHM TELAAKNAVF TSLIGQGYSG TILPTVIQRN ILENPAWYTA YTPYQPEISQ GRLEALFNFQ TMICDLTGLD LANASLLDEA TAAAEAMALA ERAAQKKTKA FFVDRDTHPQ TLAVLRTRAE PLGWSIIVGD PDTELEAADV FGALLQYPGS SGALRDPRPA IATLHNKGAL AVIAADLLAL TLITSPGELG ADIAIGSAQR FGVPMGYGGP HAGYMAARDS LKRSLPGRIV GLSIDSHGQP AYRLAMQTRE QHIRREKATS NICTAQVLLA VIAAMYAVYH GPDGLAAIAR RVHRRTATLA SGLKQLGFAP INDAYFDTLT VEVGDKRDAI VARAEAEQIN LRIGATSLGI SLDETTTPAI VEALWRAFDG SLDYASVERD ATDTLPAALT RTSDYLTQPA FQDYRSETEL LRYMRKLSDR DLALDRAMIP LGSCTMKLNA TTEMMPLTWP QFGSLHPFVP RAQAEGYHAM FATLEAWLAE ITGYDAVSLQ PNSGAQGEYA GLLAIRGYHL SRGEPHRKIC LIPSSAHGTN PASAAMVGMD VVVVACNNHG DVDVDDLRAK AEKHSAELAA VMITYPSTHG VFEEHIREIC DIVHAHGGQI YLDGANLNAQ VGLARPGDYG ADVSHLNLHK TFCIPHGGGG PGMGPIGVKA HLAPFLPGHP AEGEPSSGVL HGGGTVSAAP YGSASILTIS YIYILMMGGA GLKRATEIAI LNANYIAARL QPHFPVLYRN LRGRVAHECI VDPRPLKTTT GVTVDDIAKR LIDYGFHAPT MSFPVPGTLM IEPTESESKA EIDRFCEAMI AIRREIAQIE QGRFKVEASP LRFAPHTVHD VTSAEWTRPY PRTEGCFPAP NSRTDKYWCP VGRVDNVYGD RNLVCACPPI EDYALAADYA RAAE
|
| |