Gene Rsph17029_0499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0499 
Symbol 
ID4896575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp520230 
End bp522473 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content69% 
IMG OID640111083 
Productphosphoenolpyruvate-protein phosphotransferase PtsP 
Protein accessionYP_001042387 
Protein GI126461273 
COG category[T] Signal transduction mechanisms 
COG ID[COG3605] Signal transduction protein containing GAF and PtsI domains 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.66379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.834311 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAAC GCAGCGAAAG CGAAAGCCGC AAGCTTCTCA GGCGGTTGCG GGACAGTCTG 
GCCCGGTCCG GAAAGGGCCA GGACCGGCTG GACCGCATCA CGTCCCTGAT TGCGGACTCG
ATGCGGACCG AGGTCTGCTC GATCTATCTC TTCCGCGACC CCGACACGCT CGAGCTCTGC
GCGACCCAGG GTCTCAATCC CGAAGCCGTC CACCAGACGC GGCTGAAGAT GGGCGAGGGC
CTCGTCGGCC GCGTGGCGCG GATGGCGAGC CCGATCAACA CCGGCAACGC GCCCTCCGAG
CGCGGCTTCC GGTTCATGCC CGAGACCGGC GAGGAGATCT ATTCCGCCTT CCTTGGGGTG
CCGATCCAGC GCGTGGGCGA GAAGCTCGGC GTGCTGGTGG TGCAGTCGCG CGAGGCGCGG
GAATATTCCG AAGACGAGGT CTATGCCCTC GAGGTGGTGG CCATGGTGCT GGCGGAAATG
GCCGAACTGG GCGTCTTCGT GGGCGAGGGC GAGGCGCTGT CGGCCAAACA TACGCAGCCC
GTGCTGATGC GCGGGTCGAC CGGGCAGGAG GGTGCGGCCG AGGGCCATGT CTGGCTGCAC
GAGGCGCGCG TCGTGGTCAC GAACCCGGTG GGCGACGATC CGGTGCATGA GACCGAACGC
ATCCGCGCCG CCGTGGCGCA GCTGCGCGTC TCGGTCGACG ATCTTCTCTC GGCCTCGACG
CTCGACAAGG ATCAGCGGCA GGTGCTCGAG GCCTACAGGC TCTTCGCCCA TTCCCGCGGC
TGGCTGCGGC GCATGGAAGA GGACATCATG GCCGGCCTCT CGGCCGAGGC CGCGGTGCAG
AAGGAGCAGT CGGCCGCCCG CGCGCGGCTC GAGCAGGTGC CCGACGCCTA TCTGCGCGAC
CGGCTCCACG ACCTCGACGA CCTGTCGCAC CGGCTGCTCA GGATCCTGAC CGGGCAGGGG
CGCGACACCG GCGCCGAGAT GCCGGAGAAC CCGGTGCTCG TGGCGCGCAA CATCGGGCCC
GCCGAGCTTC TGGAATACGG CCGCAAGCTG CGCGGCATCG TGCTCGAGGA GGGTTCGGTC
GGCTCTCATG CCGCGGTCGT GGCGCGGGCG CTGGCCATTC CGCTGGTCAT CCACGCCGAG
CGGATCACCA CCGAGGCGCT GAACGGCGAT CATATCATGG TGGATGGCGA CAACGGTCTC
GTGCATCTGC GCCCCGAGGC CTCGATCGCC GCGGCCTTCC GCGACAAGAT GGCGATGCAG
GCCAAGGCGC AGGAGCGCTA CGCCTCGCTC CGCAACCTGC CCGCCCAGTC GAAGTGCGGC
ACGGTCACGG GCCTCATGAT GAATGCGGGC CTGATGGCCG ATCTGCCCTC GCTCGATTCC
TCGGGGGCCG AGGGCGTGGG CCTCTTCCGC ACCGAGCTTC AGTTCCTGAT CCGCAACCAG
ATGCCCAAGC GCGACGAGCT GGCGCGGCTC TATGCCCGGG TGATGGATGC CGCGCGCGGA
CACCGGGTGG TGTTCCGCAC GCTCGACATC GGCTCGGACA AGGTGCTGCC CTATCTCAAG
CCGCAGGATG AGCCGAACCC CGCGATGGGC TGGCGCGCGA TCCGCGTCGG GCTCGACAAG
CCCGGCGTGC TGAGGATGCA GCTTCAGGCG CTGATCCGGG CCTCGGCGGG GCGCGATCTG
TCGATCATGT TCCCCTTCGT GTCCGAACAC CATGAATTCA TGATGGCCCG CAGCCATCTG
CTGCGCGAGC TGCACCGCGA GCGCAGCCTC GGCCATCCGG TGCCGGTGAA TATCCGCGTG
GGCACCATGC TGGAGACGCC GAGCCTTGCC TATGCGCCGC GCGCCTTCTT CGAGACCACC
GATTTCATCT CGATCGGCGG CAACGACCTG CGCCAGTTCT TCTTCGCGGC CGACCGCGAG
AACGAGCGCG TGCGCCGGCG CTACGACGTG CTGAACGTGA GCTTCCTGAC CTTCCTCGAG
CATATCGTGA ACCGCTGCGC CGAGACGGCG ACGCCCCTGT CCTTCTGCGG CGAGGATGCC
GGCCGCCCGG TCGAGGCTTT GTGCTTCGCG GCCATGGGCA TCCAGACGCT CTCGATGCGC
CCCGCCTCGA TCGGCCCGGT GAAGGCGCTG CTGCGCCGCG TGGATCTGAT CGAGGCCCGC
AAGGTGATCG AGATGGCCCG CGCCTCGGGC GCCGAGACGG TGCGCCCCGC CATCCTCGAG
TGGCTGGCGA CGCAGGTCGA CTGA
 
Protein sequence
MPERSESESR KLLRRLRDSL ARSGKGQDRL DRITSLIADS MRTEVCSIYL FRDPDTLELC 
ATQGLNPEAV HQTRLKMGEG LVGRVARMAS PINTGNAPSE RGFRFMPETG EEIYSAFLGV
PIQRVGEKLG VLVVQSREAR EYSEDEVYAL EVVAMVLAEM AELGVFVGEG EALSAKHTQP
VLMRGSTGQE GAAEGHVWLH EARVVVTNPV GDDPVHETER IRAAVAQLRV SVDDLLSAST
LDKDQRQVLE AYRLFAHSRG WLRRMEEDIM AGLSAEAAVQ KEQSAARARL EQVPDAYLRD
RLHDLDDLSH RLLRILTGQG RDTGAEMPEN PVLVARNIGP AELLEYGRKL RGIVLEEGSV
GSHAAVVARA LAIPLVIHAE RITTEALNGD HIMVDGDNGL VHLRPEASIA AAFRDKMAMQ
AKAQERYASL RNLPAQSKCG TVTGLMMNAG LMADLPSLDS SGAEGVGLFR TELQFLIRNQ
MPKRDELARL YARVMDAARG HRVVFRTLDI GSDKVLPYLK PQDEPNPAMG WRAIRVGLDK
PGVLRMQLQA LIRASAGRDL SIMFPFVSEH HEFMMARSHL LRELHRERSL GHPVPVNIRV
GTMLETPSLA YAPRAFFETT DFISIGGNDL RQFFFAADRE NERVRRRYDV LNVSFLTFLE
HIVNRCAETA TPLSFCGEDA GRPVEALCFA AMGIQTLSMR PASIGPVKAL LRRVDLIEAR
KVIEMARASG AETVRPAILE WLATQVD