Gene RPB_4146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4146 
Symbol 
ID3911954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4721591 
End bp4724644 
Gene Length3054 bp 
Protein Length1017 aa 
Translation table11 
GC content67% 
IMG OID637886050 
Productexcinuclease ABC subunit B 
Protein accessionYP_487749 
Protein GI86751253 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.919849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.885523 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATGA AACCGCGGTC GCGAACCCTT TATTCGGCCC GTGCTCCGCC CCATATTGCC 
GGCATGGCGA AGACCCCCGA CCAATCCGCG AAGCCCACAT CGAAAGCGCC GACATCCAAG
GCGCCGAAAT CCAAGCCGCC GAACTCCAAG GCGCACCGCC CCGACGTGCA ACCGATCGGG
CCGGCGCTGG CCGAGTTGCT CAATCCCGCG ATCAATCGCG GAGATGCCGG CATGGGCTCG
GGCACCGGCC TGCAGCCGCC GCCGGACAAT TCGCGCGACC GCCGCACCGG CGGCGAGGCC
GCGGTGCATC GCGGCCGGGC CTCGACGGCG AAAACAGTCG GCGACGAAGC CGCGCCGCGG
CCGACGCCAT TGCAACCCGC GCCGCAGCCG CCGGGCGCGC GTCGCGGCGG CTTCGACGAA
GCCCCGCAAG CGACCTACGG CACCGCCGCC ACCATCCCGA CGCTGGATCC GGAACTGGCG
CGGCAGCTCG GGCTGCCGAC CGAGGAGGAC GACGCGGCCG CGATGGCGCG GCCGCCGCGC
AACAAGATGG AAGCACTCGG CGTGCAGGCC ACCGCCGAGG CGCTGGAGGC GCTGATCCGC
GACGGCCGGC CGGAATTCAA GGGCGACGAC GGCAACGTCA AGCTGTGGGT GCCGCACCGG
CCGCCGCGGC CGGAGAAATC CGAAGGCGGC GTGCGCTTCG TCATCAAGTC GGACTACGAG
CCGAAGGGCG ACCAGCCGAC CGCCATCAAG GAACTGGTCG AAGGCATCGC GCGCAACGAT
CGGACCCAGG TGCTGCTCGG CGTCACCGGC TCGGGCAAGA CCTACACCAT GGCCAAGGTG
ATCGAGGCGA CGCAGCGCCC GGCGATCATC CTGGCGCCGA ACAAGACGCT GGCGGCGCAG
CTCTACGGCG AGTTCAAGAG CTTCTTCCCC GACAACGCCG TCGAGTACTT CGTCTCGTAT
TACGACTACT ACCAGCCGGA AGCCTACGTC CCGCGCACCG ACACCTATAT CGAGAAGGAC
TCCTCGATCA ACGAGCAGAT CGACCGGATG CGGCATTCGG CGACCCGTGC GCTGCTGGAG
CGCGACGACG TCATCATCGT GGCGTCGGTG TCGTGCATCT ACGGTATCGG CTCGGTCGAG
ACCTACACGG CGATGACCTT CGCGCTGAAG AAGGGCGAGC GGATCGACCA GCGCGCGCTG
ATCGCCGATC TGGTCGCGCT GCAATACAAG CGGACGCAGG CCGACTTCAC CCGCGGCACG
TTTCGCGTGC GCGGTGATGT GATCGACATT TTCCCGGCGC ACTACGAGGA TCGCGCCTGG
CGGGTGAAGA TGTTCGGCGA CGAGGTCGAA GCCATCGAGG AGTTCGACCC GCTCACCGGC
CACAAGCAGG ACGAGCTGGA ATTCGTCAAG ATCTACGCCA ATTCGCACTA TGTGACGCCG
CGGCCGACGC TGATCCAGGC GATCAAGTCG ATCAAGTCCG AACTGAAATG GCGGCTCGAC
CAATTGCACG CGCAGGGACG CCTCTTGGAG GCGCAGCGGC TGGAGCAGCG CACCACCTTC
GACATCGAGA TGATGGAAGC GACCGGCAGC TGCGCCGGCA TCGAGAACTA CTCACGCTAC
CTCACCGGCC GCCGCCCCGG CGAGCCGCCG CCGACGCTGT TCGAATACGT GCCCGACAAC
GCGCTGGTGT TCGCCGACGA GAGCCACGTC ACCGTGCCGC AGATCGGCGG CATGTTCAAA
GGCGACTTCC GGCGCAAGGC GACGCTGGCC GAATACGGCT TCCGGCTGCC GTCCTGCATG
GACAACCGGC CGCTGCGCTT CGAGGAATGG GACATGATGC GGCCGCAGAG CGTGGCCGTG
TCGGCGACGC CGGCGGCGTG GGAGCTGAAC GAGAGCGGCG GCGTGTTCGT CGAACAAGTC
ATCCGCCCGA CCGGGCTGAT CGACCCGCCG GTCGACATCC GCCCGGCGCG CACGCAAGTG
GACGATCTCG TCGGCGAAGT CCGCGCCACC GCCAATGCCG GCTATCGCTC GCTGATCACC
GTGCTGACCA AGCGGATGGC CGAGGACCTC ACCGAGTTCC TGCACGAGCA GGGCATTCGT
GTGCGCTACA TGCATTCGGA CATCGACACC ATCGAGCGCA TCGAGATCAT CCGCGACCTG
CGGCTCGGCG CGTTCGACGC GCTGGTCGGC ATCAATCTGC TGCGCGAAGG CCTCGACATT
CCGGAATGCG CGCTGGTGGC GATCCTCGAC GCGGACAAAG AGGGATTCCT GCGCAGCGAG
ACCTCGCTGA TCCAGACCAT CGGCCGCGCC GCGCGCAACG TCGACGGCAA GGTGATCCTC
TACGCCGATC ACGTCACCGG TTCGATGCAG CGCGCGATGG ACGAGACCGG CCGCCGCCGC
GAGAAGCAGA TCGAATACAA CACCGCGCAC GGCATCACGC CGGAGAGCAT CAAGAAGTCG
ATCGGCGACA TCCTGGGCTC GGTGTACGAG CGCGACCACG TGCTGGTCGA GATCGGCGAT
GGCAAGGGCT CGGGCTTCAC CGACGACGCG GCGGTGATCG GGCACAATTT CGGCGCGGTG
CTGGCCGACC TCGAAACGCG GATGCGCGAG GCCGCGGCCG ACCTGAACTT CGAGGAAGCC
GCGCGACTGC GCGACGAAGT CAAACGCCTG CGCGCGACCG AACTGGCGGT GAGCGACGAT
CCCACGGTGA AGCAGCGCGG CGTCGCGGCG AAGGCCGGCA GCTACAAGGG CGACAAACAG
TTCGGCGCTT CGGCCAATCT GCCGAAACTC TCGACCGAAC GCGGCGGCAA CAACACCCCG
CGCAGCAAGG TGCACAAGCC CGACCTCGAC GAAATGGGCA TCGCCGGCTG GCACGAAGTC
AAGAAAGTGC AACGCGCCAA GCCGCGCAAG CCGACGCTCG ACGAGATGGG CCCGGGGACG
GAGAGCAAGA TCTTCCAGCC GAAGAATTCA CGCGAGTCCG GCCCGGAATT CGGCCCGGCG
CCGCGGAGCA GTGGCGGCGC GCCGGGGCAT CGGGGCGGGT GGAAGAAGAG GTAG
 
Protein sequence
MRMKPRSRTL YSARAPPHIA GMAKTPDQSA KPTSKAPTSK APKSKPPNSK AHRPDVQPIG 
PALAELLNPA INRGDAGMGS GTGLQPPPDN SRDRRTGGEA AVHRGRASTA KTVGDEAAPR
PTPLQPAPQP PGARRGGFDE APQATYGTAA TIPTLDPELA RQLGLPTEED DAAAMARPPR
NKMEALGVQA TAEALEALIR DGRPEFKGDD GNVKLWVPHR PPRPEKSEGG VRFVIKSDYE
PKGDQPTAIK ELVEGIARND RTQVLLGVTG SGKTYTMAKV IEATQRPAII LAPNKTLAAQ
LYGEFKSFFP DNAVEYFVSY YDYYQPEAYV PRTDTYIEKD SSINEQIDRM RHSATRALLE
RDDVIIVASV SCIYGIGSVE TYTAMTFALK KGERIDQRAL IADLVALQYK RTQADFTRGT
FRVRGDVIDI FPAHYEDRAW RVKMFGDEVE AIEEFDPLTG HKQDELEFVK IYANSHYVTP
RPTLIQAIKS IKSELKWRLD QLHAQGRLLE AQRLEQRTTF DIEMMEATGS CAGIENYSRY
LTGRRPGEPP PTLFEYVPDN ALVFADESHV TVPQIGGMFK GDFRRKATLA EYGFRLPSCM
DNRPLRFEEW DMMRPQSVAV SATPAAWELN ESGGVFVEQV IRPTGLIDPP VDIRPARTQV
DDLVGEVRAT ANAGYRSLIT VLTKRMAEDL TEFLHEQGIR VRYMHSDIDT IERIEIIRDL
RLGAFDALVG INLLREGLDI PECALVAILD ADKEGFLRSE TSLIQTIGRA ARNVDGKVIL
YADHVTGSMQ RAMDETGRRR EKQIEYNTAH GITPESIKKS IGDILGSVYE RDHVLVEIGD
GKGSGFTDDA AVIGHNFGAV LADLETRMRE AAADLNFEEA ARLRDEVKRL RATELAVSDD
PTVKQRGVAA KAGSYKGDKQ FGASANLPKL STERGGNNTP RSKVHKPDLD EMGIAGWHEV
KKVQRAKPRK PTLDEMGPGT ESKIFQPKNS RESGPEFGPA PRSSGGAPGH RGGWKKR