Gene RSP_4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_4044 
Symbolpgk 
ID3720104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp1127213 
End bp1128406 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content69% 
IMG OID640070668 
Productphosphoglycerate kinase 
Protein accessionYP_352549 
Protein GI77463045 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0126] 3-phosphoglycerate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.91232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCTGGA AGACACTCGA CGACATGGAT CTTGCCGGCA AGGTCGTGCT GGTGCGCGTG 
GATGTGAACG TGCCGATGGA AAATGGCGAA GTCACCGACG CCACCCGGAT CGAGAAGATC
GTCCCCACCG TCGAGGATAT CCTGAAGAAG GGCGGCAAGC CCGTCCTGCT CGCCCATTTC
GGCCGTCCGA AGGGCAAGGT CGTGGACGAG ATGAGCCTCC GCCTCGTGCT GCCCGCGCTG
CAGAACGCGC TGCCTGGCAC CAAGGTGAGC TTTGCCGCCG ACTGCGTGGG CCCCGAGCCC
GAGCAGGCGG TGGCCGCCAT GCTCGAGGGC GAGGTGCTCC TCCTCGAGAA CACCCGCTTC
CATGCCGGCG AGGAGAAGAA CGACCCCGAG CTGGCCGCCG CGATGGCGAA GCTGGGGCAG
GTCTATGTCA ACGATGCCTT CTCGGCCGCG CACCGCGCCC ATGCCTCGAC CGAGGGCCTC
GCCCGTCTTC TGCCCTCGGC CGCCGGCCGG CTGATGGAGG CCGAGCTGAA GGCGCTCGAA
GCCGCTCTCG GCCATCCCGA GCGCCCCGTT GTGGCCGTGG TGGGCGGGGC CAAGGTCTCG
ACCAAGCTCG ACCTTCTGGG CAATCTCGTG GGCCGGGTCG ATCATCTGGT GATCGGCGGC
GGCATGGCCA ACACCTTCCT CGTGGCGCAG GGGATCGAGG TCGGCAAGTC GCTGGCCGAG
CGCGACATGG CCGATACGGC GCGCGAGATC CTCTCCAAGG CGAAGGCCGC GGGCTGCACG
ATCCATCTTC CGCTCGATGT GGTGGTGGCG CGCGAGTTCA AGGCGGGGGC CGCGAACGAG
ACGGTCGAGA CGGCGGCCTG CCCGGCCGAC GCGATGATCC TCGATGCCGG TCCGAAGACC
GTGGCCGCCC TCTCCGAAGT GTTCGCCTCG GCTAAGACGC TGATCTGGAA CGGCCCGCTC
GGCGCCTTCG AGATCGAGCC CTTCGACGCC GCGACGAATG CGGCGGCGCT TCAGGTGGCG
CAGCTCACCA AGGCGGGCCA GCTCATTTCG GTCGCGGGCG GCGGCGATAC GGTGGCCGCC
CTCAACAAGG CGGGCGCGGC CGAAGGCTTC TCCTACATCT CGACGGCGGG CGGTGCCTTC
CTCGAATGGA TGGAGGGCAA GGAGCTGCCC GGAGTGGCCG CGCTCACGGT CTGA
 
Protein sequence
MGWKTLDDMD LAGKVVLVRV DVNVPMENGE VTDATRIEKI VPTVEDILKK GGKPVLLAHF 
GRPKGKVVDE MSLRLVLPAL QNALPGTKVS FAADCVGPEP EQAVAAMLEG EVLLLENTRF
HAGEEKNDPE LAAAMAKLGQ VYVNDAFSAA HRAHASTEGL ARLLPSAAGR LMEAELKALE
AALGHPERPV VAVVGGAKVS TKLDLLGNLV GRVDHLVIGG GMANTFLVAQ GIEVGKSLAE
RDMADTAREI LSKAKAAGCT IHLPLDVVVA REFKAGAANE TVETAACPAD AMILDAGPKT
VAALSEVFAS AKTLIWNGPL GAFEIEPFDA ATNAAALQVA QLTKAGQLIS VAGGGDTVAA
LNKAGAAEGF SYISTAGGAF LEWMEGKELP GVAALTV