Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4146 |
Symbol | |
ID | 3911954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4721591 |
End bp | 4724644 |
Gene Length | 3054 bp |
Protein Length | 1017 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637886050 |
Product | excinuclease ABC subunit B |
Protein accession | YP_487749 |
Protein GI | 86751253 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0556] Helicase subunit of the DNA excision repair complex |
TIGRFAM ID | [TIGR00631] excinuclease ABC, B subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.919849 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.885523 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATGA AACCGCGGTC GCGAACCCTT TATTCGGCCC GTGCTCCGCC CCATATTGCC GGCATGGCGA AGACCCCCGA CCAATCCGCG AAGCCCACAT CGAAAGCGCC GACATCCAAG GCGCCGAAAT CCAAGCCGCC GAACTCCAAG GCGCACCGCC CCGACGTGCA ACCGATCGGG CCGGCGCTGG CCGAGTTGCT CAATCCCGCG ATCAATCGCG GAGATGCCGG CATGGGCTCG GGCACCGGCC TGCAGCCGCC GCCGGACAAT TCGCGCGACC GCCGCACCGG CGGCGAGGCC GCGGTGCATC GCGGCCGGGC CTCGACGGCG AAAACAGTCG GCGACGAAGC CGCGCCGCGG CCGACGCCAT TGCAACCCGC GCCGCAGCCG CCGGGCGCGC GTCGCGGCGG CTTCGACGAA GCCCCGCAAG CGACCTACGG CACCGCCGCC ACCATCCCGA CGCTGGATCC GGAACTGGCG CGGCAGCTCG GGCTGCCGAC CGAGGAGGAC GACGCGGCCG CGATGGCGCG GCCGCCGCGC AACAAGATGG AAGCACTCGG CGTGCAGGCC ACCGCCGAGG CGCTGGAGGC GCTGATCCGC GACGGCCGGC CGGAATTCAA GGGCGACGAC GGCAACGTCA AGCTGTGGGT GCCGCACCGG CCGCCGCGGC CGGAGAAATC CGAAGGCGGC GTGCGCTTCG TCATCAAGTC GGACTACGAG CCGAAGGGCG ACCAGCCGAC CGCCATCAAG GAACTGGTCG AAGGCATCGC GCGCAACGAT CGGACCCAGG TGCTGCTCGG CGTCACCGGC TCGGGCAAGA CCTACACCAT GGCCAAGGTG ATCGAGGCGA CGCAGCGCCC GGCGATCATC CTGGCGCCGA ACAAGACGCT GGCGGCGCAG CTCTACGGCG AGTTCAAGAG CTTCTTCCCC GACAACGCCG TCGAGTACTT CGTCTCGTAT TACGACTACT ACCAGCCGGA AGCCTACGTC CCGCGCACCG ACACCTATAT CGAGAAGGAC TCCTCGATCA ACGAGCAGAT CGACCGGATG CGGCATTCGG CGACCCGTGC GCTGCTGGAG CGCGACGACG TCATCATCGT GGCGTCGGTG TCGTGCATCT ACGGTATCGG CTCGGTCGAG ACCTACACGG CGATGACCTT CGCGCTGAAG AAGGGCGAGC GGATCGACCA GCGCGCGCTG ATCGCCGATC TGGTCGCGCT GCAATACAAG CGGACGCAGG CCGACTTCAC CCGCGGCACG TTTCGCGTGC GCGGTGATGT GATCGACATT TTCCCGGCGC ACTACGAGGA TCGCGCCTGG CGGGTGAAGA TGTTCGGCGA CGAGGTCGAA GCCATCGAGG AGTTCGACCC GCTCACCGGC CACAAGCAGG ACGAGCTGGA ATTCGTCAAG ATCTACGCCA ATTCGCACTA TGTGACGCCG CGGCCGACGC TGATCCAGGC GATCAAGTCG ATCAAGTCCG AACTGAAATG GCGGCTCGAC CAATTGCACG CGCAGGGACG CCTCTTGGAG GCGCAGCGGC TGGAGCAGCG CACCACCTTC GACATCGAGA TGATGGAAGC GACCGGCAGC TGCGCCGGCA TCGAGAACTA CTCACGCTAC CTCACCGGCC GCCGCCCCGG CGAGCCGCCG CCGACGCTGT TCGAATACGT GCCCGACAAC GCGCTGGTGT TCGCCGACGA GAGCCACGTC ACCGTGCCGC AGATCGGCGG CATGTTCAAA GGCGACTTCC GGCGCAAGGC GACGCTGGCC GAATACGGCT TCCGGCTGCC GTCCTGCATG GACAACCGGC CGCTGCGCTT CGAGGAATGG GACATGATGC GGCCGCAGAG CGTGGCCGTG TCGGCGACGC CGGCGGCGTG GGAGCTGAAC GAGAGCGGCG GCGTGTTCGT CGAACAAGTC ATCCGCCCGA CCGGGCTGAT CGACCCGCCG GTCGACATCC GCCCGGCGCG CACGCAAGTG GACGATCTCG TCGGCGAAGT CCGCGCCACC GCCAATGCCG GCTATCGCTC GCTGATCACC GTGCTGACCA AGCGGATGGC CGAGGACCTC ACCGAGTTCC TGCACGAGCA GGGCATTCGT GTGCGCTACA TGCATTCGGA CATCGACACC ATCGAGCGCA TCGAGATCAT CCGCGACCTG CGGCTCGGCG CGTTCGACGC GCTGGTCGGC ATCAATCTGC TGCGCGAAGG CCTCGACATT CCGGAATGCG CGCTGGTGGC GATCCTCGAC GCGGACAAAG AGGGATTCCT GCGCAGCGAG ACCTCGCTGA TCCAGACCAT CGGCCGCGCC GCGCGCAACG TCGACGGCAA GGTGATCCTC TACGCCGATC ACGTCACCGG TTCGATGCAG CGCGCGATGG ACGAGACCGG CCGCCGCCGC GAGAAGCAGA TCGAATACAA CACCGCGCAC GGCATCACGC CGGAGAGCAT CAAGAAGTCG ATCGGCGACA TCCTGGGCTC GGTGTACGAG CGCGACCACG TGCTGGTCGA GATCGGCGAT GGCAAGGGCT CGGGCTTCAC CGACGACGCG GCGGTGATCG GGCACAATTT CGGCGCGGTG CTGGCCGACC TCGAAACGCG GATGCGCGAG GCCGCGGCCG ACCTGAACTT CGAGGAAGCC GCGCGACTGC GCGACGAAGT CAAACGCCTG CGCGCGACCG AACTGGCGGT GAGCGACGAT CCCACGGTGA AGCAGCGCGG CGTCGCGGCG AAGGCCGGCA GCTACAAGGG CGACAAACAG TTCGGCGCTT CGGCCAATCT GCCGAAACTC TCGACCGAAC GCGGCGGCAA CAACACCCCG CGCAGCAAGG TGCACAAGCC CGACCTCGAC GAAATGGGCA TCGCCGGCTG GCACGAAGTC AAGAAAGTGC AACGCGCCAA GCCGCGCAAG CCGACGCTCG ACGAGATGGG CCCGGGGACG GAGAGCAAGA TCTTCCAGCC GAAGAATTCA CGCGAGTCCG GCCCGGAATT CGGCCCGGCG CCGCGGAGCA GTGGCGGCGC GCCGGGGCAT CGGGGCGGGT GGAAGAAGAG GTAG
|
Protein sequence | MRMKPRSRTL YSARAPPHIA GMAKTPDQSA KPTSKAPTSK APKSKPPNSK AHRPDVQPIG PALAELLNPA INRGDAGMGS GTGLQPPPDN SRDRRTGGEA AVHRGRASTA KTVGDEAAPR PTPLQPAPQP PGARRGGFDE APQATYGTAA TIPTLDPELA RQLGLPTEED DAAAMARPPR NKMEALGVQA TAEALEALIR DGRPEFKGDD GNVKLWVPHR PPRPEKSEGG VRFVIKSDYE PKGDQPTAIK ELVEGIARND RTQVLLGVTG SGKTYTMAKV IEATQRPAII LAPNKTLAAQ LYGEFKSFFP DNAVEYFVSY YDYYQPEAYV PRTDTYIEKD SSINEQIDRM RHSATRALLE RDDVIIVASV SCIYGIGSVE TYTAMTFALK KGERIDQRAL IADLVALQYK RTQADFTRGT FRVRGDVIDI FPAHYEDRAW RVKMFGDEVE AIEEFDPLTG HKQDELEFVK IYANSHYVTP RPTLIQAIKS IKSELKWRLD QLHAQGRLLE AQRLEQRTTF DIEMMEATGS CAGIENYSRY LTGRRPGEPP PTLFEYVPDN ALVFADESHV TVPQIGGMFK GDFRRKATLA EYGFRLPSCM DNRPLRFEEW DMMRPQSVAV SATPAAWELN ESGGVFVEQV IRPTGLIDPP VDIRPARTQV DDLVGEVRAT ANAGYRSLIT VLTKRMAEDL TEFLHEQGIR VRYMHSDIDT IERIEIIRDL RLGAFDALVG INLLREGLDI PECALVAILD ADKEGFLRSE TSLIQTIGRA ARNVDGKVIL YADHVTGSMQ RAMDETGRRR EKQIEYNTAH GITPESIKKS IGDILGSVYE RDHVLVEIGD GKGSGFTDDA AVIGHNFGAV LADLETRMRE AAADLNFEEA ARLRDEVKRL RATELAVSDD PTVKQRGVAA KAGSYKGDKQ FGASANLPKL STERGGNNTP RSKVHKPDLD EMGIAGWHEV KKVQRAKPRK PTLDEMGPGT ESKIFQPKNS RESGPEFGPA PRSSGGAPGH RGGWKKR
|
| |