Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2954 |
Symbol | |
ID | 4895786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 3109129 |
End bp | 3111312 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640113557 |
Product | transketolase, central region |
Protein accession | YP_001044828 |
Protein GI | 126463714 |
COG category | [C] Energy production and conversion |
COG ID | [COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.998903 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGCT CGCTCATCGT CCATGAGAAT TTCCTCTCCC GCGTGAAGGC CCGCGACCTG CCGCAGGGCG CCCCGCCCAC CCCCGGCCTC GCCCCGCACG AGATGGTGGC ACTCTTCCGC AGCCAGTGCC TGTCGCGCGC GCTCGACCGG ACCAGCCGTT CCATGCAGAA GGCGGGGCAG GGCTTCTACA CGATCGGCTC CTCGGGGCAC GAGGGGATGG TCGCCGTGGC CCATGCGCTG CGCCCCAGCG ACATGGCCTT CCTCCATTAC CGCGACGCGG CCTTCCAGAT CGCGCGCGCG GCGCAGCTCG GCCAGAGCAT CGCCTGGGAC ATGCTTCTGT CCTTCGCCTC CTCCGCCGAG GATCCGATCT CCGGCGGGCG GCACAAGGTG CTGGGCTCGA AGGCGCTGGC CATCCCGCCC CAGACCTCGA CCATCGCGAG CCACCTGCCG AAGGCGGTGG GGGCGGCCTA TTCGCTGGGC CTCGCGCGGC GCCGCCCGCC CGAGCACCGC GCCCTGTCCG AGGATGCGCT GGTGATGGCC AGTTTCGGCG ACGCCTCGGC CAACCATTCC ACCGCGCAGG GCGCCTTCAA CACCGCGGGC TGGACCGCCT TCCAGTCGGT GCCGCTGCCG CTCCTCTTCG TCTGCGAGGA CAATGGCATC GGCATCTCGA CCAGAACCCC GCGCGGCTGG ATCGAGGCAA GCTTCCGCGC CCGCCCCGGC CTGCGCTACT TCCGCGCCAA CGGGCTCGAC ATGTCAGAGA CTTACGCCGT GGCGGCCGAA GCCGCAGCCT ATGTCCGCAA CCGCCGCAGG CCCGCCTTCC TGCATCTGGG AACCGTCCGC CTCTATGGCC ATGCCGGGGC GGACCTGCCC ACCACCTACA TGAGCCGCGA GGAGGTCGAG GCCGAGGAGG CCAACGATCC GCTCCTGCAC AGCGTCCGGC TGATGGAGGC CGCAGGCGCG CTCGACCCCG ACGAGGCTCT CGCGATCTAC CTCGAGACGC AGGAGCGCGT GGACCGGGTC GCGGCCGAGG CGGTCACCCG GCCGAGACTG AAGACGGCCT CCGACGTGAT GGCGAGCCTG ATCCCCCCGG CCCGGCCCTG CGCCCCCACC AACGGCCCCT CGGCCGATTC CCGCGCCGCG GCCTTCGGCT CCGACCTCAA GGCGATGGCC GAGCCGCAGC CGATGAGCCG CCTCATCAAC TGGGCGCTCA CCGACCTCAT GCTCGCCCAC CCCGAGATCG TGCTGATGGG CGAGGATGTG GGCCGCAAGG GCGGGGTCTA TGGCGTGACC CAGAAGCTCC AGACCCGCTT CGGCCCCGAC CGGGTGATCG ACACGCTCCT CGACGAACAG TCGATCCTCG GCCTCGGGAT CGGCATGGCC CACAACGGCT TCCTGCCCAT CCCCGAGATC CAGTTCCTCG CCTATCTCCA CAATGCCGAG GACCAGATCC GCGGCGAGGC GGCCACCCTG CCCTTCTTCT CGAACGGACA ATATACCAAC CCGATGGTGC TCCGGATCGC GGGGCTCGGC TATCAGAAGG GCTTCGGCGG CCATTTCCAC AACGACAATT CCATCGCCGT CCTGCGCGAT ATCCCCGGGC TGATCCTCGC CTGTCCCTCG GACGGGGCCG AGGCCGCGAT GATGCTGCGC GAATGCGTGC GGCTCGCGCG CGAAGAGCAG CGGCTGGTGG TCTTCCTCGA ACCGATCGCG CTCTATCCGA TGCGCGACCT TGCGGAAGAG AAGGACGGGG GCTGGATGCG GACCTATCCC GACCCGTCCG AGCGGCTCCG ATTCGGCGAG ATTGGCTGCC ACGGCGAAGG CCGGGATCTG GCCATCGTGA CCTTCGGCAA CGGCATCTAC CTGTCGCAAC AGGCGAATTT CACGCTTCGT GAAAATGGCG TGGCCGCGCG GATCCTCGAT CTGCGCTGGC TCGCGCCCCT GCCGCTCGAG GCGATGCTCG AGGCCACGCG CGACTGCCGC GCCGTCCTCG TGGTCGACGA ATGCCGCCGC TCGGCGGGCG GCCCGGCCGA GGCGCTGATG ACGGCGCTGG CCGAGGCGGG CCGCACCCGC ATCGCCCGCA TCACCGCCGA GGACAGTTTC ATCGCCACCG GCCCCGCCTA TGCCGCCACC CTGCCCTCGG CCGCCGGCAT CGCCGAGGCG GCGCTCACGC TGGTGCGGGC ATGA
|
Protein sequence | MPRSLIVHEN FLSRVKARDL PQGAPPTPGL APHEMVALFR SQCLSRALDR TSRSMQKAGQ GFYTIGSSGH EGMVAVAHAL RPSDMAFLHY RDAAFQIARA AQLGQSIAWD MLLSFASSAE DPISGGRHKV LGSKALAIPP QTSTIASHLP KAVGAAYSLG LARRRPPEHR ALSEDALVMA SFGDASANHS TAQGAFNTAG WTAFQSVPLP LLFVCEDNGI GISTRTPRGW IEASFRARPG LRYFRANGLD MSETYAVAAE AAAYVRNRRR PAFLHLGTVR LYGHAGADLP TTYMSREEVE AEEANDPLLH SVRLMEAAGA LDPDEALAIY LETQERVDRV AAEAVTRPRL KTASDVMASL IPPARPCAPT NGPSADSRAA AFGSDLKAMA EPQPMSRLIN WALTDLMLAH PEIVLMGEDV GRKGGVYGVT QKLQTRFGPD RVIDTLLDEQ SILGLGIGMA HNGFLPIPEI QFLAYLHNAE DQIRGEAATL PFFSNGQYTN PMVLRIAGLG YQKGFGGHFH NDNSIAVLRD IPGLILACPS DGAEAAMMLR ECVRLAREEQ RLVVFLEPIA LYPMRDLAEE KDGGWMRTYP DPSERLRFGE IGCHGEGRDL AIVTFGNGIY LSQQANFTLR ENGVAARILD LRWLAPLPLE AMLEATRDCR AVLVVDECRR SAGGPAEALM TALAEAGRTR IARITAEDSF IATGPAYAAT LPSAAGIAEA ALTLVRA
|
| |