Gene Rsph17029_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0020 
Symbol 
ID4897010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp21507 
End bp23102 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content65% 
IMG OID640110596 
ProductD-3-phosphoglycerate dehydrogenase 
Protein accessionYP_001041912 
Protein GI126460798 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID[TIGR01327] D-3-phosphoglycerate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCCC GCGTTCTCGT TTCCGACGAA CTCTCCGAAA CCGCCGTCCA GATCTTCCGC 
GACCGTGGCG TCGAGGTCGA CTACATGCCG AAACTCGGCA AGGACAAGGA AAAGCTGGCC
GAGATCATCG GCCAGTACGA CGGCCTCGCG ATCCGTTCGG CCACCAAGGT GACCGAGAAG
CTGCTCGAGC AGGCCACCAA CCTCAAGGTC ATCGGCCGCG CCGGCATCGG GGTCGACAAC
GTCGACATCC CGGCCGCCTC GCGCAAAGGC GTGATCGTGA TGAACACGCC CTTCGGCAAC
TCGATCACCA CCGCCGAACA TGCCATCGCG ATGATGTTCG CCTGCGCCCG GCAGCTGCCC
GAGGCCAATG CCTCGACCCA TGCGGGCAAG TGGGAGAAGT CGCGCTTCAT GGGGGTCGAG
CTCTTCAACA AGACGCTCGG CGTGATCGGC GCGGGCAACA TCGGCGGCAT CGTCTGCGAC
CGGGCGCTGG GGCTCTCGAT GAAAGTCGTG GCCTACGATC CCTTCCTGTC GGAAGAGCGC
GCCAAGGCGC TCGGCGTGAC CAAGGTCGAG CTCGACGACC TGCTGGCGCG CGCCGATTTC
ATCACGCTCC ATGTGCCGCT GACCGACAAG ACCCGCAACA TCCTGTCGGC AGAAGCTATC
GCCAAGACCA AGAAGGGCGT GCGGATCATC AACTGCGCCC GCGGCGGTCT GGTGGACGAG
AAGGCGCTGG CCGAGGCGAT CAAGTCGGGC CATGTGGCCG GCGCTGCCTT CGACGTGTTC
GAGGTCGAGC CCGCCTCCGA AAGCCCGCTC TTCAACCTGC CGAACGTGGT CGTGACGCCG
CACCTCGGCG CCTCCACGAC GGAAGCGCAG GAGAATGTGG CGCTTCAGGT GGCCGAGCAG
ATGTCCGACT ATCTGCTGAC GGGCGCGGTG CAGAACGCGC TCAACATGCC GTCGGTCACG
GCGGAAGAGG CCGCGGTCAT GGGCCCGTGG GTCAAGCTCG CCGGCCATCT CGGCGCCTTC
GTGGGCCAGA TGACGGACGA GCCGATCAAG GCGATCAACG TGCTCTACGA CGGGGCCGTG
GGCGAGATGA ACCTCGCCGC GCTGAACTGC GCCACGATCG CGGGCATCAT GAAGGCCACG
AACCCGGACG TGAACCTCGT CTCGGCTCCG GTCGTGGCCA AAGAGCGCGG GATCCAGATC
TCGACCACCA CGCAGGCCAA GTCGGGCGCC TTCGACGCCT ATATCAAGCT GACGGTCGTG
ACCGACAAGC GCGAGCGGTC GGTGGCGGGC ACCTGCTTCT CGGACGGCAA GCCGCGCTTC
ATCCAGATCA AGGGCATCAA CATCGACGCC GAAGTCGGCC GCCACATGCT CTACACGACG
AACGAGGACG TGCCGGGCAT CATCGGCCTC CTCGGCATGA CCATGGGCAA GAACGGCGTC
AACATCGCGA ACTTCACCCT CGGCCGGACC TCGGTCGGGC AGGAGGCCAT CGCGATCCTC
TACCTCGATC AGGCGATCGA TCCGAAGGTG GTGGAGACGC TCGAATCGAC CGGCCTCTTC
CAGCAGGTGA AGCCGCTCGA ATTCGACGTG GCCTGA
 
Protein sequence
MAPRVLVSDE LSETAVQIFR DRGVEVDYMP KLGKDKEKLA EIIGQYDGLA IRSATKVTEK 
LLEQATNLKV IGRAGIGVDN VDIPAASRKG VIVMNTPFGN SITTAEHAIA MMFACARQLP
EANASTHAGK WEKSRFMGVE LFNKTLGVIG AGNIGGIVCD RALGLSMKVV AYDPFLSEER
AKALGVTKVE LDDLLARADF ITLHVPLTDK TRNILSAEAI AKTKKGVRII NCARGGLVDE
KALAEAIKSG HVAGAAFDVF EVEPASESPL FNLPNVVVTP HLGASTTEAQ ENVALQVAEQ
MSDYLLTGAV QNALNMPSVT AEEAAVMGPW VKLAGHLGAF VGQMTDEPIK AINVLYDGAV
GEMNLAALNC ATIAGIMKAT NPDVNLVSAP VVAKERGIQI STTTQAKSGA FDAYIKLTVV
TDKRERSVAG TCFSDGKPRF IQIKGINIDA EVGRHMLYTT NEDVPGIIGL LGMTMGKNGV
NIANFTLGRT SVGQEAIAIL YLDQAIDPKV VETLESTGLF QQVKPLEFDV A