Gene RPB_4466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4466 
Symbol 
ID3912282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5056019 
End bp5058007 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content68% 
IMG OID637886369 
Producttransketolase 
Protein accessionYP_488060 
Protein GI86751564 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.913477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGG TCGATCATTC CCGTATGGCG AACGCAATCC GGGCGCTGGC GATGGACGCG 
GTCGAGAAGG CCAAATCCGG CCACCCCGGC CTGCCGATGG GCGCCGCCGA CGTCGCCACA
GTGCTGTTCA CCGAGTTCCT GAAATTCGAC GCCGCCGATG CGCACTGGCC GGATCGCGAC
CGCTTCATCC TGTCCGCCGG CCACGGCTCG ATGCTGCTCT ATGCGCTGTT GTACCTCACC
GGTAATTCCG AGCTGACGCT CGACCAGATC AAGGCGTTCC GCCAGCTCGA CTCCAAGACG
CCCGGGCACC CGGAGAACTG CATCACAGAT TCGGTTGAAA CCACCACCGG CCCGCTCGGC
CAGGGCGTGG CGTCGTCGGT CGGCACCGCG CTGGCGGAGC GTCTGCTCGC CGCCGAATTC
GGCGAGATCG TCGATCACAC CACCTACGTG CTGTGCTCGG ACGGCGATCT GATGGAAGGC
GTGAGCCACG AGGCGATCGC GCTGGCCGGC CATCTCAGGC TGTCGAAGCT GATCTTCCTC
TACGACGACA ACGGCATCTC GATCGACGGC CCGCTGACGC TGACCGACAA TGTCGATCAG
GTCGCGCGCT TCCAGGCGCA TGGCTGGAAT GCGATGCGGA TCGACGGCCA CGATCACAAG
GCGATCGCCG AGGCGATCAA GGCCGCGAAA GCCTCCGACC GGCCGACCAT GATCGCCTGC
AAGACCACGA TCGGCTTCGG CGCCCCCACC CGGGCCGGGA CCTCCAAGGC GCATGGCGAG
CCGCTCGGCG CCGAAGAACT GGCCGGCGCC AAGAAGGCGC TGGGCTGGGA CTACGGCCCG
TTCGAAATTC CGGACGACGT GCTCTCCGCT TGGCGCGCGG TCGGCGCCAA GGGCGCCAAG
GCCCACGCGG AATGGCAGTC GAAGTTCGAC GCGATGGACA AGGAGCTGCG CGCCGAATTC
CAGCGCCGGG TGATCGACCG CAAGCGGCCG GCGGCGCTCG ACGGCGCGAT CCGCAAGCTC
AAGGACAGGC TCGTCGCGGA GCCGCAGACC ATCGCCACCC GCAAGGCCAG CGAGCTGGCG
CTGGAGGCCA TCGTCGAGGT GGTGCCGGAA ATGCTGCTCG GCTCGGCGGA CCTGACGCCG
TCCAACAACA CCCGCACCAA ACACGCGAAA GACGTCACCC CGGACGACTT CTCGGGTCGC
TACATCCATT ACGGCATCCG CGAAATGGGC ATGGCGGCGG CGATGAACGG TATCGCGATG
CATGGCGGTT TCGCGCCGGC CGGCGGCACC TTCATGTGCT TCGCCGACTA CGCCCGCCCG
TCGATGCGGA TCGCGGCGCT GTCGCATGTC CCGGTGGTCT ACATCATGAC CCATGATTCG
ATCGGGCTCG GCGAAGACGG CCCGACGCAT CAGCCGGTCG AGCACCTCGC TTCGCTGCGG
GCGATGCCCA ATATGCGGGT GTTCCGCCCG GCCGACCCGG TCGAGACCGC CGAATGCTGG
CAGCTCGCGC TGGAGAACAC CAAGGGCCCG ACGGTGCTGG CGCTGTCGCG GCAGAATCTG
ACGCCGGTGC GCACCAGCAA ATCCGACGAC AACCGCTGCG CCCGGGGCGC CTATGAGCTG
ATCGCCGCCG ACGGCAAGGC TCAGGTGACG ATCTTCGCCA CCGGCTCCGA GGTCGAGATC
GCGGTCGCGG CGCACAAGCT GCTCGCCGCC AAGGGCATCG CTGCGCGCGT GGTGTCGGTG
CCGTCGCTGG ATCTCTTGCT GCAGCAGGAC GACGCCACCC GCAAGGCGAT CATCGGCGAC
GCCCCGGTCA AGGTCGCGGT CGAGGCCGCG GTGCGGTTCG GCTGGGACGC GGTGATCGGC
CCGGAGGGCG GCTTCATCGG CATGTCGAGC TTCGGCGCCA GCGCGCCCGC GAAGGATCTG
TACAAGCATT TCGGAATTAC CGCCGAGGCG GTCGCAGAGG CTGCGGCAAG CCGTCTCGGC
GGCAAGTAA
 
Protein sequence
MAKVDHSRMA NAIRALAMDA VEKAKSGHPG LPMGAADVAT VLFTEFLKFD AADAHWPDRD 
RFILSAGHGS MLLYALLYLT GNSELTLDQI KAFRQLDSKT PGHPENCITD SVETTTGPLG
QGVASSVGTA LAERLLAAEF GEIVDHTTYV LCSDGDLMEG VSHEAIALAG HLRLSKLIFL
YDDNGISIDG PLTLTDNVDQ VARFQAHGWN AMRIDGHDHK AIAEAIKAAK ASDRPTMIAC
KTTIGFGAPT RAGTSKAHGE PLGAEELAGA KKALGWDYGP FEIPDDVLSA WRAVGAKGAK
AHAEWQSKFD AMDKELRAEF QRRVIDRKRP AALDGAIRKL KDRLVAEPQT IATRKASELA
LEAIVEVVPE MLLGSADLTP SNNTRTKHAK DVTPDDFSGR YIHYGIREMG MAAAMNGIAM
HGGFAPAGGT FMCFADYARP SMRIAALSHV PVVYIMTHDS IGLGEDGPTH QPVEHLASLR
AMPNMRVFRP ADPVETAECW QLALENTKGP TVLALSRQNL TPVRTSKSDD NRCARGAYEL
IAADGKAQVT IFATGSEVEI AVAAHKLLAA KGIAARVVSV PSLDLLLQQD DATRKAIIGD
APVKVAVEAA VRFGWDAVIG PEGGFIGMSS FGASAPAKDL YKHFGITAEA VAEAAASRLG
GK