Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1892 |
Symbol | |
ID | 3907971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2165111 |
End bp | 2167963 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637883786 |
Product | bifunctional transaldolase/phosoglucose isomerase |
Protein accession | YP_485511 |
Protein GI | 86749015 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0166] Glucose-6-phosphate isomerase [COG0176] Transaldolase |
TIGRFAM ID | [TIGR00876] transaldolase, mycobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCCG TCAAAGCGCT CGAACAACAC GGCCAGGCAA TCTGGCTGGA TTTCTTGGCC CGCGGCTTTA TCGCCAAGGG CGACCTGACG AAGCTGATCG ACGCCGACGG CGTGAAGGGT GTCACCTCCA ACCCATCGAT CTTCGAAAAG GCGATCGGCT CCTCGGACGA ATATGACGGC GCGATCGGCG CCGCTCTCGC GCAGGGCGAC CGTTCGGTCG GCGAATTGTA CGAGGCGGTC GCGGTCGAGG ACATTCAGCA CGCAGCCGAT GTGCTGCGCC CGACCTACGA CAAGCTCGAA GGCCGCGACG GCTTCGTCAG CCTCGAGGTC TCGCCCTATC TGGCGCTCGA CACCAAGGCG ACGATCGTCG AAGCCGAGCG ACTGTGGGCC GCGGTGAAGC GCGAGAATTT GATGGTCAAG GTCCCTGCCA CACGGCAGGG CCTGCCCGCG ATCAAGCAGC TGATTTCCAA GGGCATCAGC GTCAACGTCA CGCTGCTGTT CTCGCAGAAG GTCTATGTCG AGGTCGCGGA GGCCTATCTC TCCGGGCTCG AAGCGCTGAT CGCGAACGGC GGCGACCCGT CACACGTCGC CAGCGTCGCC AGCTTCTTCG TCAGCCGGAT CGACAGCGCG GTCGACAAGG AACTCGACGA CAAGATCGCC ACCGCCAACG ATCCGGCCGA GAAGGCCCGG CTGGAGAAGC TGAAGGGCAA GATCGCGATC GCCAACGCCA AGATCGCTTA TCAGGACTAC AAGCGGCTGT TCTCCGGCGA CCGCTGGAAG AAGCTGCAGG TTTGTGGCGC CAAGCCGCAG CGACTGCTGT GGGCATCGAC CGGCACCAAG AACAAAGCCT ACAGCGACGT CGTCTATATC GAGGAGCTGA TCGGCCCCGA CACCGTCAAT ACCGTGCCGC CGGCGACGCT CGACGCCTTC CGCGATCACG GCAAGCCGCG CGCCAGCCTG GAAGAACACG TCGACGACGC CCGCGCGGCG CTGAAGGATC TCGCCAATGT CGGCGTCTCG CTCGATGCGA TCACCGACAG GCTGGTCACC GAAGCAGTGC AGTTGTTCGC GGATTCGTTC GACAAGCTGC TCGGCGCGGT GGCGTTCAAG CGCGAAACCG TGCTTGCCGG CGGCATCGAC ACCCAGAAGC TCGCACTCGC CGACGATCTC GCCAAATCCG TCAAGGAGCA TGGCGAGGAC TGGCGCAACA CCGGCAAGAT CCGTCGGCTG TGGGATCGTG ATAAATCGGT GTGGACCGGC ACCGACGAAG ACAAATGGCT CGGCTGGCTG ACTTCGGCCG CGGCGGAGAA GGCGAAGCTC GCCGACTACA CCGAATTCGC CAAATGGGTG AAGGCGCGCG GCTTCACCGA CGCCGTCGTG CTCGGCATGG GCGGATCGAG CCTCGGTCCC GAAGTGCTCG CCCACACCTT CGCGCAGCAG CCGGGCTTCC CGAAGCTGCA TGTGCTCGAC TCCACCGATC CGGCGCAGGT GCGCACGCTG GAGCACAGCG TCACGCTCGG AACGACGCTG TTCATCGTGT CGTCGAAATC CGGCGGCACC ACCGAGCCCA ACGTGATGAA GGATTACTTC TTCGCCCGCG TCGGCGAGAC CGTCGGCGCC GACAAGGCCG GACAGCATTT CGTCGCCGTC ACCGATCCCG GCTCGTCGAT GGAAAAGGTC GCGACCGAAG CGAAGTTCGC CCGCATCTTC CATGGCGATC CGACCATCGG CGGTCGCTAT TCGGTGCTGT CACCGTTCGG CATGGTGCCG GCCGCCGCGG CCGGCATCGA TCTCGGCCAA CTGCTCGACC TGACAATGGC GATGGTCCGC TCCTGCGGGC CTGACGTGCC GCCGCAGGAA AACCCGGGCG TGCAGCTCGG CCTCGCGATG GGCTGCGCCG GGCTGGAAGG CCGCGACAAG GTGACCATCA CGTCGTCGAA GGCGATCGCC GATTTCGGCG CCTGGGCCGA GCAACTGATC GCGGAATCGA CCGGCAAGGA CGGCAAGGGC CTGATTCCGA TCGACGGCGA GCCGCTGGCC GCGCCGTCGA CTTACGGCAA CGACCGGCTG TTCATCGATC TGCGCACCGA CGGCGAGAGC GACGCCGCGC ACGACGCCAA GCTCGCAGCA CTGGAGGACG CCGGCCATCC GGTGGTGCGG ATCGTGCTGA AATCAGCCGA CGCGATCGGC CAGGAGTTCT TCCGCTTCGA ACTCGCCACC GCGGTGGCCG GTGCGATCCT CGGCATCAAT CCGTTCAACC AGCCGGACGT CGAATCCGCC AAGATCAAGA CCCGCGAACT GACCGCGGCG TTCGAGACCT CCGGCGTCCT TCCGCCCGAG AAGCCGGCTC TGACGACGGC GGACGCCGAT CTCTACACCG ACGAGTCCAA CGTCGCGGCG CTACGCAAGG CCGGGGCCGA CGGCACGCTG GATTCGTGGA TCAAGGCGCA TCTCGCCCGC ACGCAGAGCG GCGACTATGT GGCGCTGCTG GCCTATATCG AACGCAATGC GGCGCATATC GACACGCTGC AGTCGATGCG CCTCGCGGTG CGCGACGCCA GGCATCTGGC GACCTGCGCC GAATTCGGCC CGCGCTTCCT GCACTCCACC GGCCAGGCCT ACAAGGGCGG GCCGGACAGC GGCGTGTTCC TGCAGATCAC CGCCGACGAT GCGAACGATC TCGCCGTCCC GGGCCAGAGC GCGAGCTTCG GCGTGATCAA GGCTGCGCAG GCCCGCGGCG ATTTCGACGT GCTCACGGAA CGTGGCCGTC GCGCGCTCCG CGTCCATCTC AAGGGCGACC TCGGGGCCGG ACTGAAGTCG CTGGATAAGG CGATCCGCGA CGCACTGAAC TAA
|
Protein sequence | MNPVKALEQH GQAIWLDFLA RGFIAKGDLT KLIDADGVKG VTSNPSIFEK AIGSSDEYDG AIGAALAQGD RSVGELYEAV AVEDIQHAAD VLRPTYDKLE GRDGFVSLEV SPYLALDTKA TIVEAERLWA AVKRENLMVK VPATRQGLPA IKQLISKGIS VNVTLLFSQK VYVEVAEAYL SGLEALIANG GDPSHVASVA SFFVSRIDSA VDKELDDKIA TANDPAEKAR LEKLKGKIAI ANAKIAYQDY KRLFSGDRWK KLQVCGAKPQ RLLWASTGTK NKAYSDVVYI EELIGPDTVN TVPPATLDAF RDHGKPRASL EEHVDDARAA LKDLANVGVS LDAITDRLVT EAVQLFADSF DKLLGAVAFK RETVLAGGID TQKLALADDL AKSVKEHGED WRNTGKIRRL WDRDKSVWTG TDEDKWLGWL TSAAAEKAKL ADYTEFAKWV KARGFTDAVV LGMGGSSLGP EVLAHTFAQQ PGFPKLHVLD STDPAQVRTL EHSVTLGTTL FIVSSKSGGT TEPNVMKDYF FARVGETVGA DKAGQHFVAV TDPGSSMEKV ATEAKFARIF HGDPTIGGRY SVLSPFGMVP AAAAGIDLGQ LLDLTMAMVR SCGPDVPPQE NPGVQLGLAM GCAGLEGRDK VTITSSKAIA DFGAWAEQLI AESTGKDGKG LIPIDGEPLA APSTYGNDRL FIDLRTDGES DAAHDAKLAA LEDAGHPVVR IVLKSADAIG QEFFRFELAT AVAGAILGIN PFNQPDVESA KIKTRELTAA FETSGVLPPE KPALTTADAD LYTDESNVAA LRKAGADGTL DSWIKAHLAR TQSGDYVALL AYIERNAAHI DTLQSMRLAV RDARHLATCA EFGPRFLHST GQAYKGGPDS GVFLQITADD ANDLAVPGQS ASFGVIKAAQ ARGDFDVLTE RGRRALRVHL KGDLGAGLKS LDKAIRDALN
|
| |