Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3740 |
Symbol | |
ID | 3911543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4278412 |
End bp | 4281306 |
Gene Length | 2895 bp |
Protein Length | 964 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885641 |
Product | glycine dehydrogenase |
Protein accession | YP_487345 |
Protein GI | 86750849 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.872567 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAATC GCCGACCGAT CGACGAAGCC CACGACTTCG TGCGACGCCA TATCGGCCCG TCGCCCCAGG ACATCGATGC GATGTTGGCA ACGGTGGGCG CGGACAGCCT CGACCAGCTA ATGGCCGAGA CGCTGCCGGA TTCGATCCGC ATCAAACAGC CGCTGTCGCT CGGCACGCCG CTGTCGGAAC CCGACGCGCT GACGCATATG ACCGAGCTGG CGGCGAAGAA CCAGGTGTTC ACCTCGCTGA TCGGCCAAGG CTACTCCGGC ACGATCCTGC CGACCGTGAT CCAGCGCAAT ATTCTGGAGA ATCCCGCCTG GTACACCGCC TACACGCCGT ATCAGCCGGA GATCAGCCAG GGCCGGCTGG AGGCGCTGTT CAACTTCCAG ACCATGATCT GCGACCTCAC CGGGCTCGAC GTCGCCAACG CCTCGCTGCT CGATGACGCA ACCGCGGCGG CCGAAGCGAT GGCGCTGGCG GAGCGCGCCG TCGCGAAGAA AACCAAGGCG TTCTTCGTCG ACCGCGACAC CCATCCGCAG ACGCTGGCGG TGCTGCGCAC CCGCGCCGAG CCGCTCGGCT GGTCGATCAT CGTCGGCGAT CCGGACACCG AGCTCGAAGC CGCCGACGTA TTCGGCGCGC TGCTGCAATA TCCAGGCTCG TCGGGCGCCT TGCGCGACCC GCGCGCGGTA ATCGCGACGC TACACAAGAA GGGCGCGCTG GCGGTGATCG CCGCCGATCT GCTGGCGCTG ACGCTGATCG CCTCGCCGGG CGAACTCGGC GCCGACATCG CGATCGGCTC GGCGCAGCGC TTCGGCGTGC CGATGGGCTA TGGCGGCCCG CATGCCGGCT ACATGGCGGC GCGCGACAGC CTCAAGCGTT CGTTGCCCGG CCGCATCGTC GGCCTGTCGA TCGACAGCCA CGGCCAGCCG GCCTACCGGC TGGCGCTGCA GACCCGCGAA CAGCACATCC GCCGCGAGAA GGCGACCTCC AACATCTGCA CCGCGCAGGT GCTGCTCGCG GTGATCGCCG CGATGTATGC GGTGTATCAC GGCCCCGATG GCCTCGCCGC GATCGCCCGC CGGGTGCACC GGCGCACCGC GACGCTGGCG GCCGGGCTGA AGCAGCTCGG CTTCGCGCCG ATCAACGAGA CCTACTTCGA TACGCTGACG ATCGAGGTCG GCAGCAAGCG CGACGCCATC GTGGCGCGCG CCGAGGCCGA AAAGATCAAT CTGCGGATCG GCGCGTCGTC GCTCGGCATC GCGCTCGATG AGACCACCAC GCCCGCAACC GTCGAGGCGC TGTGGCGCGC TTTCGGCGGG GAGCTGAACT ACGCGGCAGT CGAGCGCGAC GCGACCGACA CGCTGCCCGC CTCGCTGACG CGGACCGGCG ACTATCTCAC CCAGCCGGCG TTCCAGGACT ATCGCTCGGA GACCGAACTG CTGCGCTACA TGCGCAAACT CAGCGACCGC GACCTGGCGC TCGACCGCGC GATGATCCCG CTCGGCTCCT GCACCATGAA GCTCAACGCC ACCACCGAGA TGATGCCGCT GACCTGGCCG GCCTTCGGCA GCCTGCATCC GTTCGTGCCG CGCGCGCAGG CGCAGGGCTA TCACGAGATG TTCGCGCGGC TCGAGGCCTG GCTCGCCGAG ATCACCGGCT ACGACGCGGT GTCGCTGCAG CCGAATTCCG GCGCGCAGGG CGAATATGCC GGGCTGCTGG CGATCCGCGG CTATCATCTA TCGCGCGGCG AGCCGCATCG CAGGATCTGC CTGATCCCCT CCTCCGCGCA CGGCACCAAC CCGGCGTCGG CGGCGATGAC CGGGATGGAC GTCGTGGTGG TCGCCTGCAA CAGCCATGGC GACGTCGACG TCGACGATCT GCGCGCCAAG GCCGAGAAAC ATTCGGCCGA ACTCGCCGCG GTGATGATCA CCTATCCGTC GACCCACGGC GTGTTCGAGG AGCATATCCG CGACATCTGC GACATCGTCC ACGCCCATGG CGGCCAGGTC TACCTCGACG GCGCCAATCT CAACGCGCAG GTCGGGCTGT CGAGGCCGGG CGACTACGGC GCCGACGTCA GCCACCTCAA TCTGCACAAG ACGTTCTGCA TCCCGCATGG CGGCGGCGGC CCGGGCATGG GCCCGATCGG CGTCAAGGCG CATCTGGCGC CGTTCCTGCC CGGCCATCCG GCCGAGGGCG AGGCTTTGAA CGGCGTGCTG CACGGCGGCG GCACGGTGTC GGCGGCGCCC TACGGCTCGG CGTCGATCCT GACGATCTCC TACATCTACA TCCTGATGAT GGGCGGCGCC GGCCTCAAGC GCGCCACCGA GATCGCGATC CTCAACGCCA ACTACATCGC GGACCGGCTG CAGCCGCATT TCCCGGTGCT GTATCGCAAT CTGCGCGGCC GCGTCGCGCA TGAGTGCATC GTCGATCCGC GGCCGCTGAA GACCACCACC GGCGTCACCG TCGACGACAT CGCCAAGCGG CTGATCGACT ACGGCTTCCA CGCGCCGACG ATGAGCTTCC CGGTGCCAGG CACGCTGATG ATCGAGCCGA CGGAGTCGGA ATCGAAGGCC GAGATCGATC GGTTCTGCGA CGCCATGATC GCGATCCGGC AGGAGATCGC GCAGATCGAG GACGGCCGCT TCAAGGTGGA GGCCTCGCCG CTGCGGTTCG CGCCGCACAC GGTGCACGAC GTCACCTCGG CGGAATGGAC CAGGCCCTAT CCGCGCACCG AGGGCTGTTT CCCGGCGCCG CATTCGCGCA CCGACAAATA TTGGTGCCCG GTCGGCCGCG TCGACAACGT CTATGGCGAC CGCAATCTGG TGTGCTCGTG CCCGCCGATC GAAGACTACG CACTGGCCGC CGACTACGCG CGGGCGGCGG AGTAA
|
Protein sequence | MPNRRPIDEA HDFVRRHIGP SPQDIDAMLA TVGADSLDQL MAETLPDSIR IKQPLSLGTP LSEPDALTHM TELAAKNQVF TSLIGQGYSG TILPTVIQRN ILENPAWYTA YTPYQPEISQ GRLEALFNFQ TMICDLTGLD VANASLLDDA TAAAEAMALA ERAVAKKTKA FFVDRDTHPQ TLAVLRTRAE PLGWSIIVGD PDTELEAADV FGALLQYPGS SGALRDPRAV IATLHKKGAL AVIAADLLAL TLIASPGELG ADIAIGSAQR FGVPMGYGGP HAGYMAARDS LKRSLPGRIV GLSIDSHGQP AYRLALQTRE QHIRREKATS NICTAQVLLA VIAAMYAVYH GPDGLAAIAR RVHRRTATLA AGLKQLGFAP INETYFDTLT IEVGSKRDAI VARAEAEKIN LRIGASSLGI ALDETTTPAT VEALWRAFGG ELNYAAVERD ATDTLPASLT RTGDYLTQPA FQDYRSETEL LRYMRKLSDR DLALDRAMIP LGSCTMKLNA TTEMMPLTWP AFGSLHPFVP RAQAQGYHEM FARLEAWLAE ITGYDAVSLQ PNSGAQGEYA GLLAIRGYHL SRGEPHRRIC LIPSSAHGTN PASAAMTGMD VVVVACNSHG DVDVDDLRAK AEKHSAELAA VMITYPSTHG VFEEHIRDIC DIVHAHGGQV YLDGANLNAQ VGLSRPGDYG ADVSHLNLHK TFCIPHGGGG PGMGPIGVKA HLAPFLPGHP AEGEALNGVL HGGGTVSAAP YGSASILTIS YIYILMMGGA GLKRATEIAI LNANYIADRL QPHFPVLYRN LRGRVAHECI VDPRPLKTTT GVTVDDIAKR LIDYGFHAPT MSFPVPGTLM IEPTESESKA EIDRFCDAMI AIRQEIAQIE DGRFKVEASP LRFAPHTVHD VTSAEWTRPY PRTEGCFPAP HSRTDKYWCP VGRVDNVYGD RNLVCSCPPI EDYALAADYA RAAE
|
| |