Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2297 |
Symbol | |
ID | 5083716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | - |
Start bp | 2335728 |
End bp | 2338598 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640483860 |
Product | glycine dehydrogenase |
Protein accession | YP_001168491 |
Protein GI | 146278332 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.81309 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTTCA CCCCCACCGA CTACAACGCC TACGATTTCG CCAACCGCCG GCACATCGGC CCGTCGCCCT CCGAAATGGA GGAGATGCTG CGCGTGGTGG GGGTGTCCTC GCTCGACCAG CTGATCGAGG AGACGGTGCC CGCCTCGATC CGGCAGGATC AGCCGCTCGA CTGGGCGCCG CTGGCCGAGC ATGAACTTTT GCAGAAGATG CGCGAGGTGG CGGCGAAGAA CCGCGTGATG GTCTCGCTGA TCGGGCAGGG CTATTACGGC ACCGTGACGC CGCCCGCGAT CCAGCGCAAC ATCCTGGAAA ACCCGGCCTG GTACACGGCC TACACGCCCT ACCAGCCCGA GATTGCGCAA GGGCGGCTCG AGGCGCTTCT GAACTATCAG ACCATGGTGG CGGACCTGAC CGGCCTGCCG GTGGCGAACG CCTCGCTTCT GGACGAGGCG ACGGCGGCGG CCGAGGCCAT GACCATGGCC GAGCGCGCCT CGAAGTCGAA GGCGCGGGCC TTCTTCGTCG ATGCCGACTG CCATCCCCAG ACGATCGCCG TGATCCGCAC CCGGGCAGAG CCCCTGGGGA TCGAGGTCAT CGTGGGACAC CCGGCGCAAC TGGTGCCCGA GGACGTGTTC GGCGCGCTCT TCCAGTATCC CGGCACCTAC GGCCTCGTGC GCGACTTCAC CCGCGAGATC GCGGCGCTGC ACGAGGCGAA GGCGCTGGCG ATCGTGGCGA CCGACCTTCT GGCGCTCTGC CTGCTGAAGG AGCCGGGCGC CATGGGCGCC GACATCGCCA TCGGCTCGAG CCAGCGGTTC GGCGTGCCGA TGGGCTATGG CGGCCCGCAC GCGGCCTTCA TGTCCTGCCG GGACGAGCTG AAACGTTCGA TGCCGGGGCG GCTGGTCGGC GTCTCGGTCG ATGCGCGCGG CAACAAGGCC TATCGCCTCG CGCTCCAGAC GCGCGAGCAG CACATCCGGC GCGAGAAGGC GACCTCGAAC GTCTGCACCG CGCAGGCGCT TCTGGCGGTG ATGGCGAGCT TCTACGCCGT CTTCCACGGC CCCAAGGGCC TGCGCCACAT CGCCGAGCGG GTGCATCTGA ACACCGTGCG GCTGGCTCAG GCGCTGAAGG AGGCCGGCGC CCGCGTCAGC CCCGAGGCCT TCTTCGACAC GATCACCGTC GAGGTGGGGG TGGGGCAGGC GGGCATCCTG GCCGCCGCGC GCCACCGTGG CATCAACCTG CGCAAGGTGG GCCGCGACCG GGTGGGCATC TCGCTCGACG AGACGACCGA CGCAGGCGTG ATCGCCCGCG TGCTGGACGC CTTCGGCATC CATGATCCCG CACCCGCGAG CGTGGGCCTC GGCTTCCCCG AGGCGATGCT GCGCGAGTCC GCCTATCTGA GCCACCCGGT CTTTGACATG AACCGGGCCG AGTCCGAGAT GATGCGCTAC ATGCGCCGCC TGTCGGACCG GGATCTGGCG CTCGACCGCG CGATGATCCC GCTGGGAAGC TGCACGATGA AGCTGAACGC CGCGGCCGAG ATGATGCCGA TCACCTGGCC CGAGTTCGCC ACCCTTCATC CCTTCGCGCC ACCCGAACAG GCGGCGGGCT ACACCGAGGC GATCCGCGAC CTCAGCGACC GGCTCTGCCG CATCACCGGC TATGACGCCA TGTCGATGCA GCCGAACTCG GGCGCGCAGG GGGAATATGC GGGGCTGCTC ACGATCCTCG CCTATCACCG CGCGCGGGGC GAGGGGCAGC GCACCATCTG CCTCATCCCG ATGTCGGCGC ACGGGACGAA CCCGGCCTCG GCGCAGATGG CGGGGATGAA GGTGGTCGTG GTCAAGTCGG CGCCGAACGG CGACGTCGAT CTGGACGACT TCCGCGACAA GGCGGCGGCG GCGGGCGACC GGCTCGCGGC CTGCATGATC ACCTATCCCT CGACCCACGG CGTCTTCGAA GAGACGGTGC GCGACGTCTG CCGCATCACC CACGAGCACG GCGGGCAGGT CTATATCGAC GGCGCCAACA TGAACGCGAT GGTGGGCCTC GTGCAGCCGG GCGCCATCGG CGGCGACGTG AGCCACCTGA ACCTGCACAA GACCTTTGCC ATTCCGCATG GCGGCGGCGG TCCGGGCATG GGGCCGATCG GAGTGAAGGC GCATCTCGCG CCCTACCTGC CGGGCCACCC GGAGACGGGC GGCCCCCTGG CCGGAGGCCA GATCGTGGCT ACGCACGAGG GGCCGGTCTC GGCCGCGCCC TATGGCTCGG CCTCGATCCT GCTGATCTCC TGGGCCTATT GCCTGATGAT GGGCGGCGAG GGGCTGACGC AGGCGACGCG GGTGGCGATC CTGAACGCCA ACTATGTCGC GGCGCGGCTG AGGGGGGCCT ATGACGTGCT GTTCATGGGC AACCGCGGGC GCGTGGCGCA TGAGTGCATC CTCGACACCC GCCCCTTCGC CGAGGCCGGC GTGACGGTGG ACGACATTGC CAAGCGCCTG ATCGACAACG GCTTCCACGC CCCCACCATG AGCTGGCCCG TGCCCGGCAC GCTGATGGTG GAGCCGACCG AATCCGAGAC CAAGGCCGAG ATCGACCGCT TCATCACGGC GCTTCTCGCG ATCCGCGAGG AGATCCGGGC GGTCGAGGCG GGCGAGATCG CGGCGGCCGA CAGCCCGCTG CGCCACGCGC CCCACACGGT TGAGGATCTG GTCGCCGACT GGGATCGCAA CTATCCGCGC GAGCAGGGTT GCTTCCCGCC GGGCGCCTTC CGGGTGGACA AGTACTGGCC GCCCGTCGGC CGGGTGGACA ATGCCTGGGG CGATCGCAAC CTCGTCTGCA TCTGCCCGCC GGTCGAAAGC TACAGCATCG CGGCGCAATA G
|
Protein sequence | MTFTPTDYNA YDFANRRHIG PSPSEMEEML RVVGVSSLDQ LIEETVPASI RQDQPLDWAP LAEHELLQKM REVAAKNRVM VSLIGQGYYG TVTPPAIQRN ILENPAWYTA YTPYQPEIAQ GRLEALLNYQ TMVADLTGLP VANASLLDEA TAAAEAMTMA ERASKSKARA FFVDADCHPQ TIAVIRTRAE PLGIEVIVGH PAQLVPEDVF GALFQYPGTY GLVRDFTREI AALHEAKALA IVATDLLALC LLKEPGAMGA DIAIGSSQRF GVPMGYGGPH AAFMSCRDEL KRSMPGRLVG VSVDARGNKA YRLALQTREQ HIRREKATSN VCTAQALLAV MASFYAVFHG PKGLRHIAER VHLNTVRLAQ ALKEAGARVS PEAFFDTITV EVGVGQAGIL AAARHRGINL RKVGRDRVGI SLDETTDAGV IARVLDAFGI HDPAPASVGL GFPEAMLRES AYLSHPVFDM NRAESEMMRY MRRLSDRDLA LDRAMIPLGS CTMKLNAAAE MMPITWPEFA TLHPFAPPEQ AAGYTEAIRD LSDRLCRITG YDAMSMQPNS GAQGEYAGLL TILAYHRARG EGQRTICLIP MSAHGTNPAS AQMAGMKVVV VKSAPNGDVD LDDFRDKAAA AGDRLAACMI TYPSTHGVFE ETVRDVCRIT HEHGGQVYID GANMNAMVGL VQPGAIGGDV SHLNLHKTFA IPHGGGGPGM GPIGVKAHLA PYLPGHPETG GPLAGGQIVA THEGPVSAAP YGSASILLIS WAYCLMMGGE GLTQATRVAI LNANYVAARL RGAYDVLFMG NRGRVAHECI LDTRPFAEAG VTVDDIAKRL IDNGFHAPTM SWPVPGTLMV EPTESETKAE IDRFITALLA IREEIRAVEA GEIAAADSPL RHAPHTVEDL VADWDRNYPR EQGCFPPGAF RVDKYWPPVG RVDNAWGDRN LVCICPPVES YSIAAQ
|
| |