Gene Rsph17025_4017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4017 
Symbol 
ID5086191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp46414 
End bp48387 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content71% 
IMG OID640485575 
Producttransketolase 
Protein accessionYP_001170175 
Protein GI146280018 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.212996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGATA TCGAGGTCTC GCAGGAGACA CGGATGGCCC ATGCCATCCG GGCTCTGGCG 
ATGGATGCGG TCGAAAAGGC GAAGTCGGGC CATCCCGGCA TGCCGATGGG CATGGCCGAC
GTGGCCACCG TCCTCTTCAA CCGCTTCATG ACCATCGACC CGGCGGCGCC GAAATGGCCC
GACCGCGATC GCTTCGTGCT CTCGGCCGGG CATGGCTCGA TGCTGCTCTA TGCGATCCAC
CACCTGCTGG GTTACGCGGA CATGGACATG GACCAGATCC GGTCCTTCCG CCAGCTCGGC
GCGCGGACGG CGGGGCACCC GGAATATGGC CACGCCGACG GGATCGAGGT GACGACCGGC
CCGCTGGGTC AGGGGATCGC GACCGCCGTC GGCATGGCGC TGGCCGAACG CATGAAGGCC
GCCCGCTACG GCGAGGCGCT GGTCGATCAT TACACCTATG TCATCGCGGG CGACGGCTGC
CTGATGGAGG GCATCTCCCA CGAGGCGATC GACATGGCGG GGCATCTGGG CCTTGGCCGG
CTGATCGTGC TCTGGGACGA CAACCGCATC ACCATCGACG GCGACACGGC GATCTCGACC
TCGACCGACC AGATGGCGCG CTTTGCCGCC GCGGGCTGGC ATGTGCAGGC CTGCGACGGC
CACGCACCCG AGGAGATCGC GGCCGCGATC GAGGCCGCGC GGCGCGACCC GCGCCCCTCG
ATGATCGCCT GCCGGACGGT GATCGGCTTC GGCGCGCCGA ACAAGCAGGG CGGCCATGAC
GTCCACGGCG CGCCGCTCGG CGCCGAGGAG ATCGCCGCGG CCCGTGCCTT CCTTGGCTGG
GAGCACGCGC CCTTCGAGAT CCCCGCGGAC CTCTACGCCG CCTGGCACGG GATCGCCGAA
CGCGGCGCGG CCGCGCGGGC GGCCTGGGAA GCGCGTCTCG CCGCCAGCCC CGCGCGCGCG
GCCTTCGAGG CGGCCGAGGC GGGCGACACC TCCGCACTTC CGCCCGCCAT CGCCGCCTAC
AAGGCGAAGC TGTCGGCGGA CAAGCCCAAG GTCGCAACCC GCAAGGCCAG CGAGATGGCG
CTCGAGGTGG TGAACGCGGC GCTGCCCTTC TCGGTCGGCG GCTCGGCGGA TCTGACGGGC
TCCAACCTCA CCCGCTCGAA GGGGATGGTC TCGGTGACGC CGGGCGCCTT CGGCGGCAGC
TACATCCACT ACGGCATCCG CGAGCACGGC ATGGCGGCCG CGATGAACGG CATCGCGCTG
CACGGCGGCC TGCGCCCCTA CGGCGGGACC TTCATGGCCT TCGCCGACTA CTGCCGGCCC
TCGATCCGGC TGTCGGCGCT GATGGGCGTG CCGGTGACCT ATGTCATGAC GCATGACTCC
ATCGGCCTCG GCGAGGACGG ACCGACCCAC CAGCCGGTCG AGCATCTGGC GAGCCTGCGC
GCCATCCCGA ACCTGACGGT GATCCGGCCC GCCGATGCGG TCGAGACCGC CGAGGCCTGG
GAAATCGCCA TGACCGCGAC CGCGACGCCG ACGCTCCTGG TGCTGTCGCG CCAGAACCTG
CCCACGGTGC GCACCGAGCA CGGAGCGGAG AACCTGACCG CGCGTGGCGC CTACCTGCTG
CGCGACCCCG CCAACCGGCA GGTGACGCTG ATCGCCACCG GCTCGGAACT GGAACTGGCC
CTCGCCGCCG CCGACCGGCT GGCCGAGGAG GGGATCGCCG CCGCCGTGGT CTCGGCGCCC
GCGTTCGAGC TGTTCGCGGC CCAGCCGGCC GACTACCGGG CGAAGATCCT CGGCACCGCG
CCGCGCGTGG GCTGCGAGGC GGCGCTGCGG CAGGGCTGGG ATCTGTTCCT GGGGCCGCAG
GACGGCTTCG TGGGCATGAC GGGCTTTGGC GCCTCGGCGC CCGCGCCCGC GCTTTACCAG
CATTTCAACA TCACGGCCGA CGCGATCGTC GCCGAAGCCA AGAACCGGAT CTGA
 
Protein sequence
MKDIEVSQET RMAHAIRALA MDAVEKAKSG HPGMPMGMAD VATVLFNRFM TIDPAAPKWP 
DRDRFVLSAG HGSMLLYAIH HLLGYADMDM DQIRSFRQLG ARTAGHPEYG HADGIEVTTG
PLGQGIATAV GMALAERMKA ARYGEALVDH YTYVIAGDGC LMEGISHEAI DMAGHLGLGR
LIVLWDDNRI TIDGDTAIST STDQMARFAA AGWHVQACDG HAPEEIAAAI EAARRDPRPS
MIACRTVIGF GAPNKQGGHD VHGAPLGAEE IAAARAFLGW EHAPFEIPAD LYAAWHGIAE
RGAAARAAWE ARLAASPARA AFEAAEAGDT SALPPAIAAY KAKLSADKPK VATRKASEMA
LEVVNAALPF SVGGSADLTG SNLTRSKGMV SVTPGAFGGS YIHYGIREHG MAAAMNGIAL
HGGLRPYGGT FMAFADYCRP SIRLSALMGV PVTYVMTHDS IGLGEDGPTH QPVEHLASLR
AIPNLTVIRP ADAVETAEAW EIAMTATATP TLLVLSRQNL PTVRTEHGAE NLTARGAYLL
RDPANRQVTL IATGSELELA LAAADRLAEE GIAAAVVSAP AFELFAAQPA DYRAKILGTA
PRVGCEAALR QGWDLFLGPQ DGFVGMTGFG ASAPAPALYQ HFNITADAIV AEAKNRI