Gene Rsph17029_2954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2954 
Symbol 
ID4895786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp3109129 
End bp3111312 
Gene Length2184 bp 
Protein Length727 aa 
Translation table11 
GC content70% 
IMG OID640113557 
Producttransketolase, central region 
Protein accessionYP_001044828 
Protein GI126463714 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit
[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.998903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGCT CGCTCATCGT CCATGAGAAT TTCCTCTCCC GCGTGAAGGC CCGCGACCTG 
CCGCAGGGCG CCCCGCCCAC CCCCGGCCTC GCCCCGCACG AGATGGTGGC ACTCTTCCGC
AGCCAGTGCC TGTCGCGCGC GCTCGACCGG ACCAGCCGTT CCATGCAGAA GGCGGGGCAG
GGCTTCTACA CGATCGGCTC CTCGGGGCAC GAGGGGATGG TCGCCGTGGC CCATGCGCTG
CGCCCCAGCG ACATGGCCTT CCTCCATTAC CGCGACGCGG CCTTCCAGAT CGCGCGCGCG
GCGCAGCTCG GCCAGAGCAT CGCCTGGGAC ATGCTTCTGT CCTTCGCCTC CTCCGCCGAG
GATCCGATCT CCGGCGGGCG GCACAAGGTG CTGGGCTCGA AGGCGCTGGC CATCCCGCCC
CAGACCTCGA CCATCGCGAG CCACCTGCCG AAGGCGGTGG GGGCGGCCTA TTCGCTGGGC
CTCGCGCGGC GCCGCCCGCC CGAGCACCGC GCCCTGTCCG AGGATGCGCT GGTGATGGCC
AGTTTCGGCG ACGCCTCGGC CAACCATTCC ACCGCGCAGG GCGCCTTCAA CACCGCGGGC
TGGACCGCCT TCCAGTCGGT GCCGCTGCCG CTCCTCTTCG TCTGCGAGGA CAATGGCATC
GGCATCTCGA CCAGAACCCC GCGCGGCTGG ATCGAGGCAA GCTTCCGCGC CCGCCCCGGC
CTGCGCTACT TCCGCGCCAA CGGGCTCGAC ATGTCAGAGA CTTACGCCGT GGCGGCCGAA
GCCGCAGCCT ATGTCCGCAA CCGCCGCAGG CCCGCCTTCC TGCATCTGGG AACCGTCCGC
CTCTATGGCC ATGCCGGGGC GGACCTGCCC ACCACCTACA TGAGCCGCGA GGAGGTCGAG
GCCGAGGAGG CCAACGATCC GCTCCTGCAC AGCGTCCGGC TGATGGAGGC CGCAGGCGCG
CTCGACCCCG ACGAGGCTCT CGCGATCTAC CTCGAGACGC AGGAGCGCGT GGACCGGGTC
GCGGCCGAGG CGGTCACCCG GCCGAGACTG AAGACGGCCT CCGACGTGAT GGCGAGCCTG
ATCCCCCCGG CCCGGCCCTG CGCCCCCACC AACGGCCCCT CGGCCGATTC CCGCGCCGCG
GCCTTCGGCT CCGACCTCAA GGCGATGGCC GAGCCGCAGC CGATGAGCCG CCTCATCAAC
TGGGCGCTCA CCGACCTCAT GCTCGCCCAC CCCGAGATCG TGCTGATGGG CGAGGATGTG
GGCCGCAAGG GCGGGGTCTA TGGCGTGACC CAGAAGCTCC AGACCCGCTT CGGCCCCGAC
CGGGTGATCG ACACGCTCCT CGACGAACAG TCGATCCTCG GCCTCGGGAT CGGCATGGCC
CACAACGGCT TCCTGCCCAT CCCCGAGATC CAGTTCCTCG CCTATCTCCA CAATGCCGAG
GACCAGATCC GCGGCGAGGC GGCCACCCTG CCCTTCTTCT CGAACGGACA ATATACCAAC
CCGATGGTGC TCCGGATCGC GGGGCTCGGC TATCAGAAGG GCTTCGGCGG CCATTTCCAC
AACGACAATT CCATCGCCGT CCTGCGCGAT ATCCCCGGGC TGATCCTCGC CTGTCCCTCG
GACGGGGCCG AGGCCGCGAT GATGCTGCGC GAATGCGTGC GGCTCGCGCG CGAAGAGCAG
CGGCTGGTGG TCTTCCTCGA ACCGATCGCG CTCTATCCGA TGCGCGACCT TGCGGAAGAG
AAGGACGGGG GCTGGATGCG GACCTATCCC GACCCGTCCG AGCGGCTCCG ATTCGGCGAG
ATTGGCTGCC ACGGCGAAGG CCGGGATCTG GCCATCGTGA CCTTCGGCAA CGGCATCTAC
CTGTCGCAAC AGGCGAATTT CACGCTTCGT GAAAATGGCG TGGCCGCGCG GATCCTCGAT
CTGCGCTGGC TCGCGCCCCT GCCGCTCGAG GCGATGCTCG AGGCCACGCG CGACTGCCGC
GCCGTCCTCG TGGTCGACGA ATGCCGCCGC TCGGCGGGCG GCCCGGCCGA GGCGCTGATG
ACGGCGCTGG CCGAGGCGGG CCGCACCCGC ATCGCCCGCA TCACCGCCGA GGACAGTTTC
ATCGCCACCG GCCCCGCCTA TGCCGCCACC CTGCCCTCGG CCGCCGGCAT CGCCGAGGCG
GCGCTCACGC TGGTGCGGGC ATGA
 
Protein sequence
MPRSLIVHEN FLSRVKARDL PQGAPPTPGL APHEMVALFR SQCLSRALDR TSRSMQKAGQ 
GFYTIGSSGH EGMVAVAHAL RPSDMAFLHY RDAAFQIARA AQLGQSIAWD MLLSFASSAE
DPISGGRHKV LGSKALAIPP QTSTIASHLP KAVGAAYSLG LARRRPPEHR ALSEDALVMA
SFGDASANHS TAQGAFNTAG WTAFQSVPLP LLFVCEDNGI GISTRTPRGW IEASFRARPG
LRYFRANGLD MSETYAVAAE AAAYVRNRRR PAFLHLGTVR LYGHAGADLP TTYMSREEVE
AEEANDPLLH SVRLMEAAGA LDPDEALAIY LETQERVDRV AAEAVTRPRL KTASDVMASL
IPPARPCAPT NGPSADSRAA AFGSDLKAMA EPQPMSRLIN WALTDLMLAH PEIVLMGEDV
GRKGGVYGVT QKLQTRFGPD RVIDTLLDEQ SILGLGIGMA HNGFLPIPEI QFLAYLHNAE
DQIRGEAATL PFFSNGQYTN PMVLRIAGLG YQKGFGGHFH NDNSIAVLRD IPGLILACPS
DGAEAAMMLR ECVRLAREEQ RLVVFLEPIA LYPMRDLAEE KDGGWMRTYP DPSERLRFGE
IGCHGEGRDL AIVTFGNGIY LSQQANFTLR ENGVAARILD LRWLAPLPLE AMLEATRDCR
AVLVVDECRR SAGGPAEALM TALAEAGRTR IARITAEDSF IATGPAYAAT LPSAAGIAEA
ALTLVRA