Gene RPC_4766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4766 
Symbol 
ID3972801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5328136 
End bp5330196 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content67% 
IMG OID637927878 
Producttransketolase 
Protein accessionYP_534607 
Protein GI90426237 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATGGC AGCCGATCGG TACGGCGGGC CGCAACGTTC ATGATCCTCT CACTCAGGCG 
GTAGCTCTCA TGACGCAGGT CGATTCCTCC CGTATGGCCA ATGCAATCCG CGCGCTGGCG
ATGGACGCCG TGGAGAAGGC GAAGTCCGGC CACCCCGGAT TACCGATGGG CGCCGCCGAC
ATCGCCACCG TGTTGTTCAG CCAATTCCTG AAATTCGACG CCGAGGACCC GCGCTGGCCG
GATCGCGACC GCTTCGTGCT GTCCGCCGGG CACGGCTCGA TGCTGCTCTA CGCGCTGTTC
TATCTGACCG GCAATGAGGA CGTCACGCTC GACCAGATCA AGAATTTCCG CCAGCTCGAC
TCCAAGACTC CCGGCCATCC GGAGAACTTC ATCACCGCCG GCGTCGAGAC CACCACCGGC
CCGCTCGGCC AGGGCGTGGC GTCGTCGGTC GGGACTGCAT TGGCCGAGCG GCTGCTGGCG
GCGGAGTTCG GCGACGACAT CGTCGACCAC TACACCTACG TGCTGTGCTC CGACGGCGAC
CTGATGGAGG GCGTCAGCCA CGAGGCGATC GCGCTCGCCG GCCACCTCAA GCTATCCAAG
CTGATCTTCC TCTACGACGA CAACGGCATC TCGATCGACG GTCCGCTGAC GTTGTCGGAC
AGCGTCGACC AGGTCGCCCG CTTCAAGGCG CACGGCTGGA ACGCGGTGCA GATCGACGGC
CACGACCAGA AGGCGATCGC CGCAGCCATC ACCCAGGCGC AGAGCTCCGA TCGTCCGACG
ATGATCGCCT GCAAGACCAC GATCGGATTC GGCGCGCCGC ACAAGGCCGG CACCTCCAAG
GCCCACGGCG AGCCGCTCGG CGCTGAGGAA TTGGCCGGCG CCAAGAAGGC GCTGGGCTGG
AATTACGGCC CGTTCGAGAT CCCCGAGGAC GTGCTGCACG CCTGGCGCAA TGTCGGCCGC
GAAGGCGCCA GCGCCCATGC CGATTGGCGA CAGCGTTTCG AGGCGATGGA AGACACCAAG
CGCAGCGAAT TCCAGCGCCG GGTGGTCGAT CGCAAGCGCC CCGAGGTGCT CGGCGACACC
ATCCGGGCGC TGAAAGACAA GCTGGTCGCC GAGCCGCAGA CCATCGCCAC CCGCAAGGCC
AGCGAACTGG TGCTGGAGGC GATCGAGCCG ATCATGCCGG AATTGCTGCT CGGCTCGGCC
GATCTGACGC CGTCCAACAA CACCCGCGTC AAGTCCGCCA AGGACGTCAA GCCGGGGGAT
TTTTCCGGCC GCTACATCCA TTACGGCATC CGCGAAATGG GCATGGCGGC GGCGATGAAC
GGTATCTCCG CGCATGGCGG CTTCGCTCCG GCCGGCGGTA CTTTCATGTG TTTCACCGAC
TATGCGCGCC CTTCGATGCG GATCGCGGCG CTGTCGCACA CCCCGGTGGT CTACATCATG
ACCCACGATT CCATCGGGCT CGGCGAAGAC GGCCCGACCC ACCAGCCGGT GGAGCATCTG
GCGTCCTTGC GGGCGATGCC GAACCTGCGG CTGTTCCGCC CGGCCGATCC GGTCGAGACC
GCCGAATGCT GGCAGCTGGC GCTGGAATAT ACCACCGGCC CGACCGTGCT GGCGCTGTCG
CGGCAGAATC TGCAGCCGGT TCGCACCACC TCCCACCCGG ACAATCTCTG CGCGCAGGGC
GCCTACGAGC TGGTCGCCGC CGACGGCGCG GCCGAAGTGT CGCTGTTCGC CTCCGGCTCC
GAGGTGGAAA TCGCGGTGGC CGCGCAGAAG CTATTGGCGG AGCGCGGCAT CCCGACCCGG
GTGGTCTCGG TACCCTCGCT CGACCTGCTG CTGCTGCAGA AGGACGAGGT TCGCCATGGC
ATCATCGGCG AGGCGCGGGT CAAAGTCGCG GTCGAGGCCG CGGTGCGGTT CGGCTGGGAC
GCGGTGATCG GGCCGGACGG CGGCTTCGTC GGCATGAGTT CGTTCGGCGC CAGCGCCCCG
GCGAAGGACC TCTACAAGCA TTTCGGCATC ACCGCGGAGG CCGTGGTGAA GGCCGCCACC
GAACGGCTGC AGCGGAACTA G
 
Protein sequence
MRWQPIGTAG RNVHDPLTQA VALMTQVDSS RMANAIRALA MDAVEKAKSG HPGLPMGAAD 
IATVLFSQFL KFDAEDPRWP DRDRFVLSAG HGSMLLYALF YLTGNEDVTL DQIKNFRQLD
SKTPGHPENF ITAGVETTTG PLGQGVASSV GTALAERLLA AEFGDDIVDH YTYVLCSDGD
LMEGVSHEAI ALAGHLKLSK LIFLYDDNGI SIDGPLTLSD SVDQVARFKA HGWNAVQIDG
HDQKAIAAAI TQAQSSDRPT MIACKTTIGF GAPHKAGTSK AHGEPLGAEE LAGAKKALGW
NYGPFEIPED VLHAWRNVGR EGASAHADWR QRFEAMEDTK RSEFQRRVVD RKRPEVLGDT
IRALKDKLVA EPQTIATRKA SELVLEAIEP IMPELLLGSA DLTPSNNTRV KSAKDVKPGD
FSGRYIHYGI REMGMAAAMN GISAHGGFAP AGGTFMCFTD YARPSMRIAA LSHTPVVYIM
THDSIGLGED GPTHQPVEHL ASLRAMPNLR LFRPADPVET AECWQLALEY TTGPTVLALS
RQNLQPVRTT SHPDNLCAQG AYELVAADGA AEVSLFASGS EVEIAVAAQK LLAERGIPTR
VVSVPSLDLL LLQKDEVRHG IIGEARVKVA VEAAVRFGWD AVIGPDGGFV GMSSFGASAP
AKDLYKHFGI TAEAVVKAAT ERLQRN