Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3987 |
Symbol | |
ID | 4024504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 4435092 |
End bp | 4438235 |
Gene Length | 3144 bp |
Protein Length | 1047 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637964190 |
Product | excinuclease ABC subunit B |
Protein accession | YP_571107 |
Protein GI | 91978448 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0556] Helicase subunit of the DNA excision repair complex |
TIGRFAM ID | [TIGR00631] excinuclease ABC, B subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.584236 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAGA CTCCCGACAA ACCCTCACAG CCGAAATCGA AAGCGCCGAA ATCCAAGGCG CCGAATTCCA AAGCCCACCG GCCCGACGTC AAACCGATCG GGCCGGCGCT GGCGGAACTG CTCAATCCCG CGATCAATCG CGGCGACGCC GGCATGGGCT CGGGCACCGG GCTGCAGCCG CCGCCGGACA ATTCGCGCGA CCGCCGCACC GGCGGCGAAG CCGCGATGCA TCGCGGCCGG GCCTCGACGC CCAAGGCTTT CGGCGACGAG GCCGCGCCGC GCGCGATGCC GCTGCGGCCG AATCCGCAGC CGGTCGGCGG GCGATCGTCC GCATCGCAAG TGCCCGCACC CAACCCTCCC CCGCAAGCGG GAGAGGGCTC CGCGCGGCAG GCGCCGGGGG AACACGTCAG CGATCATTTC GCGGAGGGGC AACCATCGCC GCCTCTCCCG CTTGCGGGGG ATGGTCGGGG TAGGGGCGCC GCGGGCAATG ACACGCGAGG CTTCGACGAA GCCCCGCAAG CCACCTACGG CACCGCCGCC ACCATCCCGA CGCTCGATCC CGAGCTGGCG CGGCAACTCG GGCTGCCGAC CGAGGAAGAC GACGAGGCCG CGATGGCGCG GCCGCCGCGC AACAAGATGG AGGCGCTCGG CGTGCAGGCC ACCGCCGAGG CGCTGGAGAA TCTGATCCGC GAGGGCCGGC CGGAATTCAA GGGCGACGAT GGCGGCGTCA AGCTGTGGGT GCCGCATCGC CCGCCGCGAC CGGAGAAATC CGAAGGCGGC GTCCGCTTCG TCATCAAGTC GGAATACGAG CCGAAGGGCG ACCAGCCGAC CGCGATCAAG GAACTGGTCG AAGGCATCGA CCGCAATGAC CGAACGCAGG TGCTGCTCGG CGTCACCGGC TCGGGCAAGA CCTACACCAT GGCCAAGGTG ATCGAGGCGA CGCAGCGGCC GGCGATCATC CTGGCGCCGA ACAAGACGCT GGCGGCGCAG CTCTACGGCG AGTTCAAGAG CTTCTTCCCG GACAACGCGG TCGAGTATTT CGTCTCGTAT TACGACTACT ATCAGCCGGA AGCCTACGTT CCGCGCACCG ACACCTATAT CGAGAAGGAC TCCTCGATCA ACGAGCAGAT CGACCGGATG CGGCATTCGG CGACGCGCGC CCTCTTGGAG CGCGACGACG TCATCATCGT TGCGTCAGTG TCGTGCATCT ACGGTATCGG CTCGGTCGAG ACCTATACGG CGATGACCTT CGCGCTGAAG AAGGGCGAGC GGATCGACCA GCGCCAGTTG ATCGCCGATC TGGTGGCGCT GCAATACAAG CGGACGCAGG CCGACTTCAC CCGCGGCACC TTCCGGGTGC GCGGCGACGT CATCGACATC TTCCCGGCGC ACTACGAGGA CCGCGCCTGG CGCGTCGGCC TGTTCGGCGA CACGGTCGAG ACCATCGAGG AATTCGACCC GCTCACCGGG CACAAGCAGG ACGAGCTGGA ATTCGTCAAG ATCTACGCCA ATTCGCATTA CGTGACGCCG CGGCCGACGC TGATCCAGGC GATCAAGTCG ATCAAATCCG AGCTGAAATG GCGGCTCGAT CAGTTGCACG CGCAGGGCCG CCTCTTGGAA GCGCAGCGGC TGGAGCAACG CACCACTTTC GACATCGAGA TGATGGAAGC GACCGGCTCT TGCGCCGGCA TCGAGAACTA CTCGCGCTAT CTCACCGGCC GCCGCCCCGG CGAGCCGCCG CCGACGCTGT TCGAATACGT GCCCGACAAC GCGCTGGTGT TCGCCGACGA AAGCCACGTC ACCGTGCCGC AGATCGGCGG CATGTTCAAA GGCGACTTCC GGCGCAAGGC GACGCTGGCC GAATACGGCT TCCGGCTACC GTCCTGCATG GACAATCGGC CGCTGCGCTT TGAAGAATGG GACATGATGC GCCCGCAATC GGTCGCGGTG TCGGCGACCC CGGCGGCGTG GGAGCTGAAC GAAAGCGGCG GCGTGTTCGT CGAGCAGGTG ATCCGCCCGA CCGGGCTGAT CGACCCGCCG GTCGACATCC GCCCGGCGCG CACCCAGGTC GACGATCTGG TCGGCGAAGT CCGCGCCACC GCGCAGGCCG GCTATCGCTC GCTGATCACC GTGCTGACCA AGCGGATGGC GGAGGACCTC ACGGAGTTTC TGCACGAGCA GGGAATCCGC GTACGCTACA TGCATTCCGA CATCGACACC ATCGAGCGCA TCGAGATCAT CCGCGATCTG CGGCTCGGCG CGTTCGACGC GCTGGTCGGC ATCAATCTGT TGCGCGAGGG CCTCGACATT CCGGAATGCG CGCTGGTGGC GATCCTCGAC GCCGACAAGG AAGGCTTTTT GCGCAGCGAG ACGTCACTGA TCCAAACGAT CGGCCGCGCC GCGCGAAACG TCGACGGCAA GGTGATCCTC TATGCCGATC ACGTCACCGG CTCGATGCAG CGGGCGATGG ACGAGACCGG TCGCCGTCGT GAGAAGCAGA TCGAATACAA CACCGCGCAC GGCATCACGC CGGAGAGCAT CAAGAAATCG ATCGGCGATA TTCTGGGCTC GGTTTACGAG CGCGACCATG TGCTGGTGGA GATCGGCGAC GGCAAGGGCT CGGGCTTCAC CGACGACGCC GCGGTGATCG GGCACAATTT CGGCGCGGTG CTGGCCGACC TCGAAACCAG GATGCGCGAG GCGGCGGCCG ATCTGAACTT CGAGGAAGCC GCAAGGCTGC GTGACGAAGT CAAACGCCTG CGCGCCACCG AACTCGCGGT GATCGACGAC CCCACCGTCA AGCAACGCGG CGTCGCGGCG AAAGCCGGGA GCTACAAGGG CGACAAACAA TTCGGCGCCT CGGCCAATCT GCCGAAGCTG TCGACCGAAC GCGGCGGCAA CAACACCCCG CGCAGCAAGG TGCACAAACC CGATCTCGAC GAAATGGGCA TCGCCGGCTG GCACGAGATC AAGAAGGTGC AACGGCCCAA GCCGCGCAAA CCGACGCTCG ACGAGATGGG CCCGGGTGCG GAGAGCAAGA TCTATCAGCC GACCAACAGC CGCGAGTCCG GGCCGGAATT CGGTCCCGCG CCGCGCAGCA GCGGCGGCGC GCCGGGGCAT CGGGGCGGGT GGAAGAAGAG GTAG
|
Protein sequence | MAKTPDKPSQ PKSKAPKSKA PNSKAHRPDV KPIGPALAEL LNPAINRGDA GMGSGTGLQP PPDNSRDRRT GGEAAMHRGR ASTPKAFGDE AAPRAMPLRP NPQPVGGRSS ASQVPAPNPP PQAGEGSARQ APGEHVSDHF AEGQPSPPLP LAGDGRGRGA AGNDTRGFDE APQATYGTAA TIPTLDPELA RQLGLPTEED DEAAMARPPR NKMEALGVQA TAEALENLIR EGRPEFKGDD GGVKLWVPHR PPRPEKSEGG VRFVIKSEYE PKGDQPTAIK ELVEGIDRND RTQVLLGVTG SGKTYTMAKV IEATQRPAII LAPNKTLAAQ LYGEFKSFFP DNAVEYFVSY YDYYQPEAYV PRTDTYIEKD SSINEQIDRM RHSATRALLE RDDVIIVASV SCIYGIGSVE TYTAMTFALK KGERIDQRQL IADLVALQYK RTQADFTRGT FRVRGDVIDI FPAHYEDRAW RVGLFGDTVE TIEEFDPLTG HKQDELEFVK IYANSHYVTP RPTLIQAIKS IKSELKWRLD QLHAQGRLLE AQRLEQRTTF DIEMMEATGS CAGIENYSRY LTGRRPGEPP PTLFEYVPDN ALVFADESHV TVPQIGGMFK GDFRRKATLA EYGFRLPSCM DNRPLRFEEW DMMRPQSVAV SATPAAWELN ESGGVFVEQV IRPTGLIDPP VDIRPARTQV DDLVGEVRAT AQAGYRSLIT VLTKRMAEDL TEFLHEQGIR VRYMHSDIDT IERIEIIRDL RLGAFDALVG INLLREGLDI PECALVAILD ADKEGFLRSE TSLIQTIGRA ARNVDGKVIL YADHVTGSMQ RAMDETGRRR EKQIEYNTAH GITPESIKKS IGDILGSVYE RDHVLVEIGD GKGSGFTDDA AVIGHNFGAV LADLETRMRE AAADLNFEEA ARLRDEVKRL RATELAVIDD PTVKQRGVAA KAGSYKGDKQ FGASANLPKL STERGGNNTP RSKVHKPDLD EMGIAGWHEI KKVQRPKPRK PTLDEMGPGA ESKIYQPTNS RESGPEFGPA PRSSGGAPGH RGGWKKR
|
| |