Gene RPB_1892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1892 
Symbol 
ID3907971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2165111 
End bp2167963 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content66% 
IMG OID637883786 
Productbifunctional transaldolase/phosoglucose isomerase 
Protein accessionYP_485511 
Protein GI86749015 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0166] Glucose-6-phosphate isomerase
[COG0176] Transaldolase 
TIGRFAM ID[TIGR00876] transaldolase, mycobacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCCG TCAAAGCGCT CGAACAACAC GGCCAGGCAA TCTGGCTGGA TTTCTTGGCC 
CGCGGCTTTA TCGCCAAGGG CGACCTGACG AAGCTGATCG ACGCCGACGG CGTGAAGGGT
GTCACCTCCA ACCCATCGAT CTTCGAAAAG GCGATCGGCT CCTCGGACGA ATATGACGGC
GCGATCGGCG CCGCTCTCGC GCAGGGCGAC CGTTCGGTCG GCGAATTGTA CGAGGCGGTC
GCGGTCGAGG ACATTCAGCA CGCAGCCGAT GTGCTGCGCC CGACCTACGA CAAGCTCGAA
GGCCGCGACG GCTTCGTCAG CCTCGAGGTC TCGCCCTATC TGGCGCTCGA CACCAAGGCG
ACGATCGTCG AAGCCGAGCG ACTGTGGGCC GCGGTGAAGC GCGAGAATTT GATGGTCAAG
GTCCCTGCCA CACGGCAGGG CCTGCCCGCG ATCAAGCAGC TGATTTCCAA GGGCATCAGC
GTCAACGTCA CGCTGCTGTT CTCGCAGAAG GTCTATGTCG AGGTCGCGGA GGCCTATCTC
TCCGGGCTCG AAGCGCTGAT CGCGAACGGC GGCGACCCGT CACACGTCGC CAGCGTCGCC
AGCTTCTTCG TCAGCCGGAT CGACAGCGCG GTCGACAAGG AACTCGACGA CAAGATCGCC
ACCGCCAACG ATCCGGCCGA GAAGGCCCGG CTGGAGAAGC TGAAGGGCAA GATCGCGATC
GCCAACGCCA AGATCGCTTA TCAGGACTAC AAGCGGCTGT TCTCCGGCGA CCGCTGGAAG
AAGCTGCAGG TTTGTGGCGC CAAGCCGCAG CGACTGCTGT GGGCATCGAC CGGCACCAAG
AACAAAGCCT ACAGCGACGT CGTCTATATC GAGGAGCTGA TCGGCCCCGA CACCGTCAAT
ACCGTGCCGC CGGCGACGCT CGACGCCTTC CGCGATCACG GCAAGCCGCG CGCCAGCCTG
GAAGAACACG TCGACGACGC CCGCGCGGCG CTGAAGGATC TCGCCAATGT CGGCGTCTCG
CTCGATGCGA TCACCGACAG GCTGGTCACC GAAGCAGTGC AGTTGTTCGC GGATTCGTTC
GACAAGCTGC TCGGCGCGGT GGCGTTCAAG CGCGAAACCG TGCTTGCCGG CGGCATCGAC
ACCCAGAAGC TCGCACTCGC CGACGATCTC GCCAAATCCG TCAAGGAGCA TGGCGAGGAC
TGGCGCAACA CCGGCAAGAT CCGTCGGCTG TGGGATCGTG ATAAATCGGT GTGGACCGGC
ACCGACGAAG ACAAATGGCT CGGCTGGCTG ACTTCGGCCG CGGCGGAGAA GGCGAAGCTC
GCCGACTACA CCGAATTCGC CAAATGGGTG AAGGCGCGCG GCTTCACCGA CGCCGTCGTG
CTCGGCATGG GCGGATCGAG CCTCGGTCCC GAAGTGCTCG CCCACACCTT CGCGCAGCAG
CCGGGCTTCC CGAAGCTGCA TGTGCTCGAC TCCACCGATC CGGCGCAGGT GCGCACGCTG
GAGCACAGCG TCACGCTCGG AACGACGCTG TTCATCGTGT CGTCGAAATC CGGCGGCACC
ACCGAGCCCA ACGTGATGAA GGATTACTTC TTCGCCCGCG TCGGCGAGAC CGTCGGCGCC
GACAAGGCCG GACAGCATTT CGTCGCCGTC ACCGATCCCG GCTCGTCGAT GGAAAAGGTC
GCGACCGAAG CGAAGTTCGC CCGCATCTTC CATGGCGATC CGACCATCGG CGGTCGCTAT
TCGGTGCTGT CACCGTTCGG CATGGTGCCG GCCGCCGCGG CCGGCATCGA TCTCGGCCAA
CTGCTCGACC TGACAATGGC GATGGTCCGC TCCTGCGGGC CTGACGTGCC GCCGCAGGAA
AACCCGGGCG TGCAGCTCGG CCTCGCGATG GGCTGCGCCG GGCTGGAAGG CCGCGACAAG
GTGACCATCA CGTCGTCGAA GGCGATCGCC GATTTCGGCG CCTGGGCCGA GCAACTGATC
GCGGAATCGA CCGGCAAGGA CGGCAAGGGC CTGATTCCGA TCGACGGCGA GCCGCTGGCC
GCGCCGTCGA CTTACGGCAA CGACCGGCTG TTCATCGATC TGCGCACCGA CGGCGAGAGC
GACGCCGCGC ACGACGCCAA GCTCGCAGCA CTGGAGGACG CCGGCCATCC GGTGGTGCGG
ATCGTGCTGA AATCAGCCGA CGCGATCGGC CAGGAGTTCT TCCGCTTCGA ACTCGCCACC
GCGGTGGCCG GTGCGATCCT CGGCATCAAT CCGTTCAACC AGCCGGACGT CGAATCCGCC
AAGATCAAGA CCCGCGAACT GACCGCGGCG TTCGAGACCT CCGGCGTCCT TCCGCCCGAG
AAGCCGGCTC TGACGACGGC GGACGCCGAT CTCTACACCG ACGAGTCCAA CGTCGCGGCG
CTACGCAAGG CCGGGGCCGA CGGCACGCTG GATTCGTGGA TCAAGGCGCA TCTCGCCCGC
ACGCAGAGCG GCGACTATGT GGCGCTGCTG GCCTATATCG AACGCAATGC GGCGCATATC
GACACGCTGC AGTCGATGCG CCTCGCGGTG CGCGACGCCA GGCATCTGGC GACCTGCGCC
GAATTCGGCC CGCGCTTCCT GCACTCCACC GGCCAGGCCT ACAAGGGCGG GCCGGACAGC
GGCGTGTTCC TGCAGATCAC CGCCGACGAT GCGAACGATC TCGCCGTCCC GGGCCAGAGC
GCGAGCTTCG GCGTGATCAA GGCTGCGCAG GCCCGCGGCG ATTTCGACGT GCTCACGGAA
CGTGGCCGTC GCGCGCTCCG CGTCCATCTC AAGGGCGACC TCGGGGCCGG ACTGAAGTCG
CTGGATAAGG CGATCCGCGA CGCACTGAAC TAA
 
Protein sequence
MNPVKALEQH GQAIWLDFLA RGFIAKGDLT KLIDADGVKG VTSNPSIFEK AIGSSDEYDG 
AIGAALAQGD RSVGELYEAV AVEDIQHAAD VLRPTYDKLE GRDGFVSLEV SPYLALDTKA
TIVEAERLWA AVKRENLMVK VPATRQGLPA IKQLISKGIS VNVTLLFSQK VYVEVAEAYL
SGLEALIANG GDPSHVASVA SFFVSRIDSA VDKELDDKIA TANDPAEKAR LEKLKGKIAI
ANAKIAYQDY KRLFSGDRWK KLQVCGAKPQ RLLWASTGTK NKAYSDVVYI EELIGPDTVN
TVPPATLDAF RDHGKPRASL EEHVDDARAA LKDLANVGVS LDAITDRLVT EAVQLFADSF
DKLLGAVAFK RETVLAGGID TQKLALADDL AKSVKEHGED WRNTGKIRRL WDRDKSVWTG
TDEDKWLGWL TSAAAEKAKL ADYTEFAKWV KARGFTDAVV LGMGGSSLGP EVLAHTFAQQ
PGFPKLHVLD STDPAQVRTL EHSVTLGTTL FIVSSKSGGT TEPNVMKDYF FARVGETVGA
DKAGQHFVAV TDPGSSMEKV ATEAKFARIF HGDPTIGGRY SVLSPFGMVP AAAAGIDLGQ
LLDLTMAMVR SCGPDVPPQE NPGVQLGLAM GCAGLEGRDK VTITSSKAIA DFGAWAEQLI
AESTGKDGKG LIPIDGEPLA APSTYGNDRL FIDLRTDGES DAAHDAKLAA LEDAGHPVVR
IVLKSADAIG QEFFRFELAT AVAGAILGIN PFNQPDVESA KIKTRELTAA FETSGVLPPE
KPALTTADAD LYTDESNVAA LRKAGADGTL DSWIKAHLAR TQSGDYVALL AYIERNAAHI
DTLQSMRLAV RDARHLATCA EFGPRFLHST GQAYKGGPDS GVFLQITADD ANDLAVPGQS
ASFGVIKAAQ ARGDFDVLTE RGRRALRVHL KGDLGAGLKS LDKAIRDALN