Gene RPD_3474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3474 
Symbol 
ID4023988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3853655 
End bp3856507 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content66% 
IMG OID637963678 
Productbifunctional transaldolase/phosoglucose isomerase 
Protein accessionYP_570598 
Protein GI91977939 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0166] Glucose-6-phosphate isomerase
[COG0176] Transaldolase 
TIGRFAM ID[TIGR00876] transaldolase, mycobacterial type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.438566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCCG TCAAAGCGCT CGAACAACAC GGCCAGGCTG TCTGGCTGGA TTTCCTGGCC 
CGCGGCTTTA TCGCCAAGGG TGACCTGAAG AAGCTGATCG ACGGCGACGG CGTGAAGGGT
GTCACCTCCA ATCCGTCGAT CTTCGAGAAG GCGATCGGCT CGTCGGACGA ATATGACGGT
GCGATCGGGG CCGCCCTGAA GCAGGGCGAC CGTTCGGTCG GCGAATTGTA CGAGGCGGTG
GCGGTCGAGG ACATTCAGCA CGCGGCCGAC GTGCTGCGCC CGACCTACGA CAAGCTCGAA
GGCCGCGACG GCTTCGTCAG CCTCGAAGTT TCACCCTATC TGGCGCTCGA CACCAAGGCG
ACCATCGTCG AGGCCGAGCG GCTGTGGGGC GCGGTCAAGC GCGAGAACCT GATGGTCAAG
GTGCCGGCGA CCCGGCAGGG CCTGCCCGCG ATCAAGCACC TGATCTCGAA GGGCATCAGC
GTCAACATCA CGCTGCTGTT TTCGCAGAAA GTCTATGTCG AGGTCGCGGA AGCCTATCTC
TCGGGGCTCG AAGCCCTGAT CGCGAGCGGG GGCGACCCGT CGCATGTCGC CAGCGTCGCC
AGCTTCTTCG TCAGCCGGAT CGACAGTGCG GTCGACAAAG AACTCGACGA CAAGATTGCC
AAGGCCAACG ACCCCGCCGA GAAGGCGCGA TTGGAAAAGC TGAAGGGCAA GATCGCGATC
GCCAACGCCA AGCTCGCTTA TCAGGACTAC AAGCGGCTGT TCTCCGGCGA CCGCTGGAAG
AAGCTCGAGA TTTGCGGCGC CAAGCCGCAG CGGCTGCTGT GGGCCTCGAC CGGCACCAAG
AACAAGGCCT ACAGCGACGT GCTGTATATC GAGGAGCTGA TCGGACCCGA CACGGTCAAC
ACCGTGCCGC CGTCGACGCT CGATGCGTTC CGCGATCACG GCAAGGCCCG CGCCAGCCTC
GAAGAAAACG TCGACGACGC CCGCGCGGCG TTGAAGGACC TCGACGGCGT CGGCATCTCG
CTCGACAAGA TCACCGACAG GCTGGTGACC GAGGGCGTGC AGCTCTTCGC CGACGCATTC
GACAAGCTGC TCGGCGCGGT CGCATCCAAG CGCGAGACCG TGCTCGCCGG CGGCGTCAAT
ACGCAGAAGC TCGCGCTCGC GGCCGACCTC GCCAACTCCG TCAAGGAGCA CGGCGAGGAG
TGGCGCAACA CCGGCAAGAT TCGCAAGCTG TGGGACCAGG ACAAATCGGT GTGGACCGGC
GCCGACGAGG ACAAATGGCT CGGCTGGCTG AATTCCGCCG CAGCGGAAAA GGCGAAGCTC
GCCGACTACG CGGAGTTCGC GAAATGGGTG AAGGCGCGCG GCTTCACCGA TGCCGTCGTG
CTCGGCATGG GCGGGTCGAG CCTCGGCCCG GAAGTGCTGG CCAAGACCTT CGCACAGCAG
CCGGGCTTCC CGAAGCTGCA CGTTCTGGAC TCCACCGATC CGGCGCAGGT GCGCTCGCTG
GAGAGCAGCG TCACATTGGC GACCACGCTG TTCATCGTGT CGTCCAAATC CGGCGGCACC
ACCGAGCCGA ACGCGATGAA GGACTACTTC GTCGCGCGCG TCGGCGAGAA CGTCGGCGTC
GACAAGGCCG GCCAACATTT CGTCGCGGTG ACCGATCCCG GCTCGTCGAT GGAGAAGGTC
GCGACCGCAG CGAAATTCGC CCGGATCTTC CACGGCAATC CGACGATCGG CGGTCGCTAT
TCGGTGCTGT CGCCGTTCGG CATGGCGCCG GCCGCCGCCG CCGGCCTCGA TCTCGGAAAG
TTCCTCGATC TGACACTCGC AATGGTGCGC TCCTGCGGGC CGGACGTGCC GCCGCAGGAA
AATCCCGGCG TGCAGCTCGG ACTTGCGATG GGCTGCGCTG GCCTGCAAGG CCGCGACAAG
GTGACGATCA CCTCCTCCAG GGCGATCGCC GATTTCGGCG CCTGGGCCGA GCAACTGATC
GCCGAGTCGA CCGGCAAGGA CGGCAAGGGA CTGATCCCGA TCGACGGCGA GCCGCTGGCC
GAGCCCTCGA CCTACGGCAA CGACCGGCTG TTCATCGATC TGCGTATCGA GAGCGAAAGC
GACGCCGCGC ATGACGGCAA GCTCGCGGCG CTGGAGCAAG CAGGACATCC GGTGGTGCGG
ATCGTGCTGA AATCGCCCGA CGCCATCGGC CAGGAGTTCT TCCGCTTCGA ATTCGCCACC
GCGGTCGCGG GCGCAATCCT CGGCATCAAT CCGTTCAATC AGCCGGATGT GGAATCCGCC
AAGATCAAGA CCCGCGAACT GACCGCGGCG TTCGAGACAT CCGGCGCACT GCCCGCGGAG
AAGCCGGCGC TGGCCACCGC GCAAGCCGAT CTCTACACCG ACGAGTCCAA CGCCGCCGCG
CTACGCAAGG CCGGCGCCGA CGGCACGCTC GGCTCGTGGA TCAAGGCGCA TCTGTCGCGC
TCGCAGGCCG GCGACTACGT CGCGCTGCTC GCCTATATCG AGCGGAATGC CGCGCATATC
GACGCGCTGC AGACGATGCG GCTCGCGGTG CGCGACGCCA GGCATCTGGC GACCTGCGCC
GAGTTCGGTC CGCGCTTCCT GCACTCGACC GGCCAGGCCT ACAAAGGCGG ACCGGACAGC
GGCGTGTTTC TGCAGATCAC CGCCGACGAC GCCGAGGATC TTCCCGTCCC CGGCCAGACC
GCCAGCTTCG GCGTGATCAA GGCGGCGCAG GCCCGCGGCG ATTTCGACGT GCTGACCGAA
CGCGGCCGCC GCGCGCTGCG GGTCCACATC AAAGGCGATC TCGGCGCCGG ACTGAAAGCG
CTCGACGCCG CGATCCGCGA CGCCTTGAAC TGA
 
Protein sequence
MNPVKALEQH GQAVWLDFLA RGFIAKGDLK KLIDGDGVKG VTSNPSIFEK AIGSSDEYDG 
AIGAALKQGD RSVGELYEAV AVEDIQHAAD VLRPTYDKLE GRDGFVSLEV SPYLALDTKA
TIVEAERLWG AVKRENLMVK VPATRQGLPA IKHLISKGIS VNITLLFSQK VYVEVAEAYL
SGLEALIASG GDPSHVASVA SFFVSRIDSA VDKELDDKIA KANDPAEKAR LEKLKGKIAI
ANAKLAYQDY KRLFSGDRWK KLEICGAKPQ RLLWASTGTK NKAYSDVLYI EELIGPDTVN
TVPPSTLDAF RDHGKARASL EENVDDARAA LKDLDGVGIS LDKITDRLVT EGVQLFADAF
DKLLGAVASK RETVLAGGVN TQKLALAADL ANSVKEHGEE WRNTGKIRKL WDQDKSVWTG
ADEDKWLGWL NSAAAEKAKL ADYAEFAKWV KARGFTDAVV LGMGGSSLGP EVLAKTFAQQ
PGFPKLHVLD STDPAQVRSL ESSVTLATTL FIVSSKSGGT TEPNAMKDYF VARVGENVGV
DKAGQHFVAV TDPGSSMEKV ATAAKFARIF HGNPTIGGRY SVLSPFGMAP AAAAGLDLGK
FLDLTLAMVR SCGPDVPPQE NPGVQLGLAM GCAGLQGRDK VTITSSRAIA DFGAWAEQLI
AESTGKDGKG LIPIDGEPLA EPSTYGNDRL FIDLRIESES DAAHDGKLAA LEQAGHPVVR
IVLKSPDAIG QEFFRFEFAT AVAGAILGIN PFNQPDVESA KIKTRELTAA FETSGALPAE
KPALATAQAD LYTDESNAAA LRKAGADGTL GSWIKAHLSR SQAGDYVALL AYIERNAAHI
DALQTMRLAV RDARHLATCA EFGPRFLHST GQAYKGGPDS GVFLQITADD AEDLPVPGQT
ASFGVIKAAQ ARGDFDVLTE RGRRALRVHI KGDLGAGLKA LDAAIRDALN