Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4360 |
Symbol | |
ID | 3912175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4943741 |
End bp | 4945552 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637886266 |
Product | thiol:disulfide interchange protein |
Protein accession | YP_487958 |
Protein GI | 86751462 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0322676 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGACC GACTTTTCCG CGCGCTCATC CTGCTCGTCG CGATCGCAGC CGGCGTCCAC GGCGCGCTCG CCGCCGGTCC GGCGCAGGCC GCGCCGTTCG AACTCAGCGT ACAGCCCGGC GCCGACGGCG TCGATCTGAC CTGGCGGATC GCCCCGGACG ATTATCTCTA TCGCGACAAG ATCGTCGTCA CCACCGCCGA CGGCGCGCGC GTCGCGCTGC GGACGCCGGC CGGCGAGATC AAGGACGATC CGAATTTCGG CATGACCGAA ATCTATCACC GCAGCCTGAC CGCGACGATC CCCGCCGACG CGGTGAACGG CGCGAGCCGG CTGACGGTGA GCTATCAGGG CTGCGCCGAG CGCGGCATCT GCTATCCGCC GGTGACGGCG AGCGTCGATC TCGGCACCTA TCAGGTGTCG ATCGCCGGCG GCGCGACGCC GAGTGCGAGC CCGGCCTGGC CCTCGGACAT GCCGGCGCTG CCGGACCTCG CCGCGCCCGC GGAGACGCCC GCACCCTTCG CGGCTTCGGT GCTGCCGTCG ATGACGCAAG GCTGGTTGCC GCTGCTGCTG GCGTTCGCCG GGTTCGGGCT GCTGCTGGCG TTCACGCCCT GCGTGCTGCC GATGATGCCG ATCGTCGCCG GCATGCTGAC GCGCTCCGGC CCCAACCTCT CGCCGGCGCG CGGCTTCGCA CTGGCCTCGA TCTACACGCT GGCGATGGCC GCGGCCTATG CGACGCTCGG CGTCGCGGCG GCGTGGTCCG GGCAGAATCT GCAGAGCGCG TTGCAGGCGC CGCTGGCGCT GGCGGTGATG GCGGCGATCT ATGTCGCGCT GGCGCTGTCG AGCTTCGGCC TGTTCGAGCT GCAACTGCCG GCGCGGTTCG GCGGCGATCT CGCCGGCCGC CTGCACGGCC GCGCCGGACC GTTGCTCGGC GCCGCCGCGC TCGGCTTCAC CTCGGCGCTG ATCGTCGGGC CGTGCGTGAC CCCACCGCTC GCCGCGGCGC TGCTCTACGT CGCACAGACC GGTGACATGC TGCGCGGCGC GGCGGCGCTG TTCGCGCTCG GCCTCGGCAT GGGCTTTCCG CTGATCCTGG TCGGGCTGTT CGGCGCCGGC GTGCTGCCGC GCTCGGGGCC GTGGCTGGTG ACGATCCGCC AATTGTTCGG CTTCGCCTTT CTCGGCCTCG CGGTGGCGTT GATTGCGCGG GTGCTGCCGG GATCGGTGGC GCTGCTGCTG TGGGCCGGCC TCGCCATCGG CCTCGCGGCG TTTCTCGGCG CGTTCGACCG GCTCGCGCCG CAGGGCGGCG CGACGCGGCG TTTCGGCAAG GCGGCGGGCG TCGCCGTGTT CGTCTATGGC GCGACGCTGA TCGTCGGCGC CGCCGGCGGC AGCGACGATC CGCTGCGGCC GCTCGCGGTG TTCGGCGCCG ACCAGCCAGC GGCGGCCGCG ATGTTCGCCG CGCCGGTGAC GTCGATCCGC GCGCTGGATC AGGCGATCAG CGACGGCCGC GCGCGCGGCA AGCCGATCAT GATCGACTTC TCCGCCGACT GGTGCACCTC GTGCAAGACC ATGGAGCGCG AGGTGTTCGG CGATCCCGCG ATCCGGCAGC GGCTGCAGGA TCTCACGCTG ATCCGCGCCG ACGTCACCCG GTCCGACGCC GAGACCGCGG CGCTGATGAA GCGCTTCGAC GTCGTCGGGC CACCGACGGT GGTGTTTCTC GATCGGCGCG ACGGCCGCGA AATCACCGCC GCCCGCACCG TCGGCGAAGT GACCGCCGAC AGCTTCGTCA AGACGCTGCA GCGCGTCGGC GCGGCGTCCT GA
|
Protein sequence | MTDRLFRALI LLVAIAAGVH GALAAGPAQA APFELSVQPG ADGVDLTWRI APDDYLYRDK IVVTTADGAR VALRTPAGEI KDDPNFGMTE IYHRSLTATI PADAVNGASR LTVSYQGCAE RGICYPPVTA SVDLGTYQVS IAGGATPSAS PAWPSDMPAL PDLAAPAETP APFAASVLPS MTQGWLPLLL AFAGFGLLLA FTPCVLPMMP IVAGMLTRSG PNLSPARGFA LASIYTLAMA AAYATLGVAA AWSGQNLQSA LQAPLALAVM AAIYVALALS SFGLFELQLP ARFGGDLAGR LHGRAGPLLG AAALGFTSAL IVGPCVTPPL AAALLYVAQT GDMLRGAAAL FALGLGMGFP LILVGLFGAG VLPRSGPWLV TIRQLFGFAF LGLAVALIAR VLPGSVALLL WAGLAIGLAA FLGAFDRLAP QGGATRRFGK AAGVAVFVYG ATLIVGAAGG SDDPLRPLAV FGADQPAAAA MFAAPVTSIR ALDQAISDGR ARGKPIMIDF SADWCTSCKT MEREVFGDPA IRQRLQDLTL IRADVTRSDA ETAALMKRFD VVGPPTVVFL DRRDGREITA ARTVGEVTAD SFVKTLQRVG AAS
|
| |