Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1568 |
Symbol | |
ID | 3973124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 1699407 |
End bp | 1702280 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637924684 |
Product | glycine dehydrogenase |
Protein accession | YP_531449 |
Protein GI | 90423079 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.34808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000847802 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACCGCCC ACCGCCAGTC CGAGGACATC GCCACCGGTT TCGCCCGCCG CCACATCGGG CCGTCGCCGC AGGACATCCG CGGCATGCTG CGCGTGGTGG GGGCCGAGAG TCTGGAGGCT TTGGTCGACC AGACGCTGCC GGCGGCGATC CGGCAGCGCG CGCCGCTCGA CCTCGGCCAG CCTCTGACCG AGACCGAGGC GCTGGCGCAT ATGGCCGAGT TGGCCTGCCG CAACGAGGTG TTCACCTCGC TGATCGGCCA AGGCTATTCC GGCACCATCC TGCCGGCGGT GATCCAGCGC AACATTCTGG AGAATCCGGC CTGGTACACC GCCTATACGC CGTATCAGCC GGAGATCAGC CAGGGCCGGC TGGAAGCCTT GTTCAACTAC CAGACCATGA TCTGCGATCT CACCGGGCTC GACGTCGCCA ACGCCTCGCT GCTCGACGAG GCCACTGCCG CTGCGGAAGC GATGGCCTTG GCGGAGCGCA GCGCGCACAA GAAAACCAAG GCGTTCTTCG TCGACCGCGA AGTGCATCCG CAGACGCTTG AAGTGCTACG CACCCGCGCC GAGCCGCTCG GCTGGACGCT GATCGTCGGC GACCCGGTGC ACGATCTGCA GAAGGCCGAC GTGTTCGGCG CGCTGATCCA ATATCCCGGC ACGTCCGGCG CGGTGCGCGA TCCGCGGCCG ATGATCGCCG CATTGCGCGC CAAGGGCGGG CTCGCCATCA TCGCCGCCGA CCTGTTGGCG CTGACGCTGT TGGCCTCGCC GGGCGATCTC GGCGCCGACA TCGCGATCGG TTCGGCGCAG CGCTTCGGCG TGCCGATGGG CTACGGCGGG CCGCACGCCG CCTTCATGGC GGTGCGCGAC AGCTTGAAAC GCGCGCTGCC CGGCCGCCTG GTCGGGCAGT CGATCGACGT GCACGGCCAG CCGGCCTATC GGCTGGCGCT GCAGACCCGC GAGCAGCACA TCCGCCGCGA GAAGGCCACC TCCAACATCT GCACCGCGCA GGTGCTGCTG GCGGTGATCG CCGCGATGTA TGCGGTGTAT CACGGCCCCG ACGGGCTCTC CGAAATCGCC CGCCGCGTGC ACCGGCGCGC CGCGGTGTTG GCCGCAGGCT TACGCAAGCT CGGCTTGCCG CCGCACAATG AATCGTTCTT CGACACGCTG ACCGTGGAGG TCGGCGCCCG GCAGAGCGAG ATCGTGGCGC GCGCGCTGAA TGAACGGATC AATCTGCGGA TCGGCGACGG CACGCTCGGC ATCGCGCTCG ACGAGACCAC CACGCCGGCC GTTGTGGAAG CGGTGTGGCG GGCGTTCGGC GGCACGCTGA ACTATCGCGA GGTCGAGCCG GAAATGCGCG ACACGCTGCA TCCGGCGTTG AAGCGCACTT CGAGTTTCAT GACCCAGGAC GTATTCCAGG CCTATCGCTC GGAGACCGAG CTGCTGCGCT ACATGCGCAA ATTGAGCGAC CGCGACCTCG CGCTCGACCG CGCCATGATC CCGCTCGGCT CCTGCACCAT GAAGCTGAAC GCCACCACCG AGATGATGCC GCTGACCTGG CCGGCGTTCG CCAGCCTGCA CCCGTTCGCG CCGCGCCAGC AGGCCGCGGG CTATCACGCG CTGTTCGCCA AGCTGGAAGC CTGGCTCGCC GACATCACCG GCTACGACGC GGTGTCGCTG CAGCCGAATT CCGGCGCGCA GGGCGAATAT GCCGGGATGC TGGCGATCCG GCACTATCAC GCCGCGCGCG GCGAATCGCA TCGCAAGGTC TGCCTGATCC CGTCCTCGGC GCACGGCACC AACCCGGCCT CCGCCAGCAT GGCGGGCATG GACGTGGTGG TGGTCGCCTG CAACAATCGC GGCGACGTCG ACGTCGAGGA TCTGCGCGCC AAGGCCGCCG AGCACAGCGC CGACCTCGCC GCGGTGATGA TCACCTATCC GTCGACCCAC GGCGTGTTCG AAGAGCACAT CCGCGAGATC TGCGACATCG TCCACGGCCA TGGCGGCCAG GTCTATCTCG ACGGCGCCAA TCTCAACGCC CAGGTCGGGC TGGCGCAGCC CGGCAAATAC GGCGCCGACG TCAGCCATCT CAACCTGCAC AAGACGTTCT GCATTCCGCA TGGCGGCGGC GGCCCCGGCA TGGGACCGAT CGGGGTGCGG GCGCATCTGG CGCCGTACCT GCCCGGGCAT CTGGTGCTCG ACGGCACCGA CGAGGCGCGC TCCGGCGGCG CGGTGGCGGC GGCGCCGTTC GGCTCGGCGT CGATCCTGAC CATCTCCTAC ATCTACATCC TGATGATGGG CGGCGAAGGA CTGCGCCGCG CCACCGAAGT CGCGATCCTC AACGCCAACT ACATCGCGGC AAAGCTCGAT CCGCATTTTC CGGTGCTGTA TCGCAACGAG CGCGGCCGCG TCGCGCATGA ATGCATCGTC GATCCGCGCG CGCTGAAGAA CTCCAGCGGC GTCACCGTCG ACGACATCGC CAAGCGGCTG ATCGACTACG GCTTCCACGC GCCGACCATG AGCTTCCCGG TGCCGGGCAC GCTGATGATC GAGCCGACCG AATCGGAATC CAAGGCCGAG ATCGACCGTT TCTGCGACGC CATGATCGCG ATCCGGCGCG AGATCGCCGA GATCGAGGCC GGGCGCTGGA GCGTGGAAAC CTCGCCGCTG CGCCACGCCC CGCACACCGT GCACGACATC GCCGAAGAGG TCTGGAAGCG GCCCTATACC AGGCACGAGG GCTGCTTCCC CGCCGGCACC ACCCGCACCG ACAAATACTG GTGCCCGGTC GGCCGCATCG ACAACGTCTA TGGCGACCGC AATCTGGTGT GCTCCTGCCC GCCGATCGAG GACTACGCGC TGGCGGCGGA GTGA
|
Protein sequence | MTAHRQSEDI ATGFARRHIG PSPQDIRGML RVVGAESLEA LVDQTLPAAI RQRAPLDLGQ PLTETEALAH MAELACRNEV FTSLIGQGYS GTILPAVIQR NILENPAWYT AYTPYQPEIS QGRLEALFNY QTMICDLTGL DVANASLLDE ATAAAEAMAL AERSAHKKTK AFFVDREVHP QTLEVLRTRA EPLGWTLIVG DPVHDLQKAD VFGALIQYPG TSGAVRDPRP MIAALRAKGG LAIIAADLLA LTLLASPGDL GADIAIGSAQ RFGVPMGYGG PHAAFMAVRD SLKRALPGRL VGQSIDVHGQ PAYRLALQTR EQHIRREKAT SNICTAQVLL AVIAAMYAVY HGPDGLSEIA RRVHRRAAVL AAGLRKLGLP PHNESFFDTL TVEVGARQSE IVARALNERI NLRIGDGTLG IALDETTTPA VVEAVWRAFG GTLNYREVEP EMRDTLHPAL KRTSSFMTQD VFQAYRSETE LLRYMRKLSD RDLALDRAMI PLGSCTMKLN ATTEMMPLTW PAFASLHPFA PRQQAAGYHA LFAKLEAWLA DITGYDAVSL QPNSGAQGEY AGMLAIRHYH AARGESHRKV CLIPSSAHGT NPASASMAGM DVVVVACNNR GDVDVEDLRA KAAEHSADLA AVMITYPSTH GVFEEHIREI CDIVHGHGGQ VYLDGANLNA QVGLAQPGKY GADVSHLNLH KTFCIPHGGG GPGMGPIGVR AHLAPYLPGH LVLDGTDEAR SGGAVAAAPF GSASILTISY IYILMMGGEG LRRATEVAIL NANYIAAKLD PHFPVLYRNE RGRVAHECIV DPRALKNSSG VTVDDIAKRL IDYGFHAPTM SFPVPGTLMI EPTESESKAE IDRFCDAMIA IRREIAEIEA GRWSVETSPL RHAPHTVHDI AEEVWKRPYT RHEGCFPAGT TRTDKYWCPV GRIDNVYGDR NLVCSCPPIE DYALAAE
|
| |