Gene RPD_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1052 
Symbol 
ID4021528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1205398 
End bp1207413 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content68% 
IMG OID637961244 
Producttransketolase 
Protein accessionYP_568191 
Protein GI91975532 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.25792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACATCT CTGCACGCGA TATCCACGAC GAAGTCAGCC ACGGCGAAAT GGCCAATGCG 
GTCCGCTTCC TCGCGATCGA CGCAGTCGAA AAGGCGAAGT CCGGCCACCC CGGAATGCCG
ATGGGCATGG CCGACGTCGC CACCGTGCTG TTCACCCGCT TCATGAAGTT CGATGCCGCC
GATCCGACCT GGCCCGACCG CGATCGTTTC GTGCTGTCGG CCGGCCATGG CTCGATGCTG
CTCTACGCGC TGCTGCATCT GACCGGCTAC AAAGAGGTCA CGATCGATGA GTTGAAGGCG
TTCCGGCAGT GGGGCTCGAA GACGCCCGGC CATCCGGAAT ACGGCCACAC CGAAGGCGTC
GAGACCACCA CCGGCCCGCT CGGGCAGGGG CTCGCGACCT CGGTCGGCAT GGCGCTCGCC
GAGCGGATGC TGAACGCGCG TTACGGCGAT GCGCTGGTCG ATCACTTCAC CTATGTGATC
GCCGGCGACG GCTGCCTGAT GGAAGGCGTC AGCCACGAGG CGATCTCGCT CGCCGGTCAT
CTCAAGCTCA ATCGCCTGAT CGTGCTGTGG GACGACAACC ACATCTCGAT CGACGGCGAA
ACCTCACTGT CGTGCTCGGA TGATCAGCTC GCGCGCTTCG CCGCGTCGGG CTGGGCCACC
ACCCGCGTCG ATGGCCATGA TCCGGAGGCG GTCGAGGCCG CGATCGAGCA GGCCAGGAAA
AGCGACCGGC CGTCCCTGAT CGCCTGCCGC ACCAAGATCG GCTTCGGCTC GCCCAAGGTC
GAGGGCACCG AGAAAGCCCA TGGCGCGCCG CTCGGCGCCG ACGAGGTCGA GAAGACCCGC
GCTGCGCTGA ACTGGCCGCA CCCTCCGTTC GAAGTGCCGG ACGCGGTTCT TGCGCGCTGG
CGCGAGGCGG GCAGCCGCGG CAAGGCCGCG CATCAGGCGT GGACCGAGCG GCTCGGCAAT
CTCGATGCGG CGACCCGCGC CGGCTTCGAG AACGTCCTCG CCGGCAAGCT GTCGGCCGAC
TACGAGCCCG CGCTGAAAGC GCTGATCGCG AATTTCGCAT CCGACCAGCC GTCGATCGCC
ACCCGGCAGG CTTCGCAGCT CACCATCAAC GCGCTGGTGC CGGCGTCGCC GAACCTGCTC
GGCGGCTCGG CTGACCTGAC CCATTCCAAC CTGACCCACG CCAAGGGTTC GGTTTCGGTG
AAGCCCGGCG CCTATGGCGG CAGCTATCTG CACTACGGCA TTCGCGAATT CGGCATGGCG
GCGGCGATGA ACGGCCTCGC CCTGCACGGC GGCTTCATTC CCTATGGCGG CACCTTCCTG
GTGTTCGCCG ACTACAGCCG TCCGGCAATA CGGCTCGCGG CGCTGATGGG GGTGCGGGTG
ATCCATGTGA TGACCCACGA CTCGATCGGC CTCGGCGAGG ACGGCCCGAC CCATCAGCCG
GTCGAGCATG TCTCGTCGCT ACGGGCGATC CCGAACCTGC TGGTGTTTCG TCCGGCGGAC
GCGATCGAGA CCGCGCAGGC CTGGGACTGC GCGCTGAAGC AGGCGAGCCG GCCCTCGGTG
CTGGCTCTGT CGCGGCAGGC GCTGCCGATG TTGCCGCGGC CGAACGGCGT GAACGACAAT
CCGGTCGGGC GCGGCGCCTA CCTGGTGATC GATCCGGGCA AGCGCGACGT CACTTTGATC
GCGACCGGCT CGGAAGTCTC GCTGGCGCTG GAAGCGGCGT GCAAGCTCGA GGGCGAAGGC
ATCAAGGCGG CGGTGGTCTC GGCGCCCTGT TTCGAATTGT TCGCCGAACA GGACGAGGCC
TACCGCGCGA CCGTGCTTGG GACTGCGCCG CGGATCGGCG TCGAAGCGGC GCGCGATATC
GATTGGCGGC GTTGGATCGG CGATGGCGGC GCCTTCGTCG GCATGACCGG GTTCGGCGCC
TCGGCGCCGG CGCCGGTGCT GTACCAGAAG TTCGGTATCA CCGCAGATGC GGTCACGGAC
GCGGCCAAGG CCGCGATCGC TCGCAGCAAA CATTGA
 
Protein sequence
MNISARDIHD EVSHGEMANA VRFLAIDAVE KAKSGHPGMP MGMADVATVL FTRFMKFDAA 
DPTWPDRDRF VLSAGHGSML LYALLHLTGY KEVTIDELKA FRQWGSKTPG HPEYGHTEGV
ETTTGPLGQG LATSVGMALA ERMLNARYGD ALVDHFTYVI AGDGCLMEGV SHEAISLAGH
LKLNRLIVLW DDNHISIDGE TSLSCSDDQL ARFAASGWAT TRVDGHDPEA VEAAIEQARK
SDRPSLIACR TKIGFGSPKV EGTEKAHGAP LGADEVEKTR AALNWPHPPF EVPDAVLARW
REAGSRGKAA HQAWTERLGN LDAATRAGFE NVLAGKLSAD YEPALKALIA NFASDQPSIA
TRQASQLTIN ALVPASPNLL GGSADLTHSN LTHAKGSVSV KPGAYGGSYL HYGIREFGMA
AAMNGLALHG GFIPYGGTFL VFADYSRPAI RLAALMGVRV IHVMTHDSIG LGEDGPTHQP
VEHVSSLRAI PNLLVFRPAD AIETAQAWDC ALKQASRPSV LALSRQALPM LPRPNGVNDN
PVGRGAYLVI DPGKRDVTLI ATGSEVSLAL EAACKLEGEG IKAAVVSAPC FELFAEQDEA
YRATVLGTAP RIGVEAARDI DWRRWIGDGG AFVGMTGFGA SAPAPVLYQK FGITADAVTD
AAKAAIARSK H