Gene RPD_3987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3987 
Symbol 
ID4024504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4435092 
End bp4438235 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content66% 
IMG OID637964190 
Productexcinuclease ABC subunit B 
Protein accessionYP_571107 
Protein GI91978448 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.584236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGA CTCCCGACAA ACCCTCACAG CCGAAATCGA AAGCGCCGAA ATCCAAGGCG 
CCGAATTCCA AAGCCCACCG GCCCGACGTC AAACCGATCG GGCCGGCGCT GGCGGAACTG
CTCAATCCCG CGATCAATCG CGGCGACGCC GGCATGGGCT CGGGCACCGG GCTGCAGCCG
CCGCCGGACA ATTCGCGCGA CCGCCGCACC GGCGGCGAAG CCGCGATGCA TCGCGGCCGG
GCCTCGACGC CCAAGGCTTT CGGCGACGAG GCCGCGCCGC GCGCGATGCC GCTGCGGCCG
AATCCGCAGC CGGTCGGCGG GCGATCGTCC GCATCGCAAG TGCCCGCACC CAACCCTCCC
CCGCAAGCGG GAGAGGGCTC CGCGCGGCAG GCGCCGGGGG AACACGTCAG CGATCATTTC
GCGGAGGGGC AACCATCGCC GCCTCTCCCG CTTGCGGGGG ATGGTCGGGG TAGGGGCGCC
GCGGGCAATG ACACGCGAGG CTTCGACGAA GCCCCGCAAG CCACCTACGG CACCGCCGCC
ACCATCCCGA CGCTCGATCC CGAGCTGGCG CGGCAACTCG GGCTGCCGAC CGAGGAAGAC
GACGAGGCCG CGATGGCGCG GCCGCCGCGC AACAAGATGG AGGCGCTCGG CGTGCAGGCC
ACCGCCGAGG CGCTGGAGAA TCTGATCCGC GAGGGCCGGC CGGAATTCAA GGGCGACGAT
GGCGGCGTCA AGCTGTGGGT GCCGCATCGC CCGCCGCGAC CGGAGAAATC CGAAGGCGGC
GTCCGCTTCG TCATCAAGTC GGAATACGAG CCGAAGGGCG ACCAGCCGAC CGCGATCAAG
GAACTGGTCG AAGGCATCGA CCGCAATGAC CGAACGCAGG TGCTGCTCGG CGTCACCGGC
TCGGGCAAGA CCTACACCAT GGCCAAGGTG ATCGAGGCGA CGCAGCGGCC GGCGATCATC
CTGGCGCCGA ACAAGACGCT GGCGGCGCAG CTCTACGGCG AGTTCAAGAG CTTCTTCCCG
GACAACGCGG TCGAGTATTT CGTCTCGTAT TACGACTACT ATCAGCCGGA AGCCTACGTT
CCGCGCACCG ACACCTATAT CGAGAAGGAC TCCTCGATCA ACGAGCAGAT CGACCGGATG
CGGCATTCGG CGACGCGCGC CCTCTTGGAG CGCGACGACG TCATCATCGT TGCGTCAGTG
TCGTGCATCT ACGGTATCGG CTCGGTCGAG ACCTATACGG CGATGACCTT CGCGCTGAAG
AAGGGCGAGC GGATCGACCA GCGCCAGTTG ATCGCCGATC TGGTGGCGCT GCAATACAAG
CGGACGCAGG CCGACTTCAC CCGCGGCACC TTCCGGGTGC GCGGCGACGT CATCGACATC
TTCCCGGCGC ACTACGAGGA CCGCGCCTGG CGCGTCGGCC TGTTCGGCGA CACGGTCGAG
ACCATCGAGG AATTCGACCC GCTCACCGGG CACAAGCAGG ACGAGCTGGA ATTCGTCAAG
ATCTACGCCA ATTCGCATTA CGTGACGCCG CGGCCGACGC TGATCCAGGC GATCAAGTCG
ATCAAATCCG AGCTGAAATG GCGGCTCGAT CAGTTGCACG CGCAGGGCCG CCTCTTGGAA
GCGCAGCGGC TGGAGCAACG CACCACTTTC GACATCGAGA TGATGGAAGC GACCGGCTCT
TGCGCCGGCA TCGAGAACTA CTCGCGCTAT CTCACCGGCC GCCGCCCCGG CGAGCCGCCG
CCGACGCTGT TCGAATACGT GCCCGACAAC GCGCTGGTGT TCGCCGACGA AAGCCACGTC
ACCGTGCCGC AGATCGGCGG CATGTTCAAA GGCGACTTCC GGCGCAAGGC GACGCTGGCC
GAATACGGCT TCCGGCTACC GTCCTGCATG GACAATCGGC CGCTGCGCTT TGAAGAATGG
GACATGATGC GCCCGCAATC GGTCGCGGTG TCGGCGACCC CGGCGGCGTG GGAGCTGAAC
GAAAGCGGCG GCGTGTTCGT CGAGCAGGTG ATCCGCCCGA CCGGGCTGAT CGACCCGCCG
GTCGACATCC GCCCGGCGCG CACCCAGGTC GACGATCTGG TCGGCGAAGT CCGCGCCACC
GCGCAGGCCG GCTATCGCTC GCTGATCACC GTGCTGACCA AGCGGATGGC GGAGGACCTC
ACGGAGTTTC TGCACGAGCA GGGAATCCGC GTACGCTACA TGCATTCCGA CATCGACACC
ATCGAGCGCA TCGAGATCAT CCGCGATCTG CGGCTCGGCG CGTTCGACGC GCTGGTCGGC
ATCAATCTGT TGCGCGAGGG CCTCGACATT CCGGAATGCG CGCTGGTGGC GATCCTCGAC
GCCGACAAGG AAGGCTTTTT GCGCAGCGAG ACGTCACTGA TCCAAACGAT CGGCCGCGCC
GCGCGAAACG TCGACGGCAA GGTGATCCTC TATGCCGATC ACGTCACCGG CTCGATGCAG
CGGGCGATGG ACGAGACCGG TCGCCGTCGT GAGAAGCAGA TCGAATACAA CACCGCGCAC
GGCATCACGC CGGAGAGCAT CAAGAAATCG ATCGGCGATA TTCTGGGCTC GGTTTACGAG
CGCGACCATG TGCTGGTGGA GATCGGCGAC GGCAAGGGCT CGGGCTTCAC CGACGACGCC
GCGGTGATCG GGCACAATTT CGGCGCGGTG CTGGCCGACC TCGAAACCAG GATGCGCGAG
GCGGCGGCCG ATCTGAACTT CGAGGAAGCC GCAAGGCTGC GTGACGAAGT CAAACGCCTG
CGCGCCACCG AACTCGCGGT GATCGACGAC CCCACCGTCA AGCAACGCGG CGTCGCGGCG
AAAGCCGGGA GCTACAAGGG CGACAAACAA TTCGGCGCCT CGGCCAATCT GCCGAAGCTG
TCGACCGAAC GCGGCGGCAA CAACACCCCG CGCAGCAAGG TGCACAAACC CGATCTCGAC
GAAATGGGCA TCGCCGGCTG GCACGAGATC AAGAAGGTGC AACGGCCCAA GCCGCGCAAA
CCGACGCTCG ACGAGATGGG CCCGGGTGCG GAGAGCAAGA TCTATCAGCC GACCAACAGC
CGCGAGTCCG GGCCGGAATT CGGTCCCGCG CCGCGCAGCA GCGGCGGCGC GCCGGGGCAT
CGGGGCGGGT GGAAGAAGAG GTAG
 
Protein sequence
MAKTPDKPSQ PKSKAPKSKA PNSKAHRPDV KPIGPALAEL LNPAINRGDA GMGSGTGLQP 
PPDNSRDRRT GGEAAMHRGR ASTPKAFGDE AAPRAMPLRP NPQPVGGRSS ASQVPAPNPP
PQAGEGSARQ APGEHVSDHF AEGQPSPPLP LAGDGRGRGA AGNDTRGFDE APQATYGTAA
TIPTLDPELA RQLGLPTEED DEAAMARPPR NKMEALGVQA TAEALENLIR EGRPEFKGDD
GGVKLWVPHR PPRPEKSEGG VRFVIKSEYE PKGDQPTAIK ELVEGIDRND RTQVLLGVTG
SGKTYTMAKV IEATQRPAII LAPNKTLAAQ LYGEFKSFFP DNAVEYFVSY YDYYQPEAYV
PRTDTYIEKD SSINEQIDRM RHSATRALLE RDDVIIVASV SCIYGIGSVE TYTAMTFALK
KGERIDQRQL IADLVALQYK RTQADFTRGT FRVRGDVIDI FPAHYEDRAW RVGLFGDTVE
TIEEFDPLTG HKQDELEFVK IYANSHYVTP RPTLIQAIKS IKSELKWRLD QLHAQGRLLE
AQRLEQRTTF DIEMMEATGS CAGIENYSRY LTGRRPGEPP PTLFEYVPDN ALVFADESHV
TVPQIGGMFK GDFRRKATLA EYGFRLPSCM DNRPLRFEEW DMMRPQSVAV SATPAAWELN
ESGGVFVEQV IRPTGLIDPP VDIRPARTQV DDLVGEVRAT AQAGYRSLIT VLTKRMAEDL
TEFLHEQGIR VRYMHSDIDT IERIEIIRDL RLGAFDALVG INLLREGLDI PECALVAILD
ADKEGFLRSE TSLIQTIGRA ARNVDGKVIL YADHVTGSMQ RAMDETGRRR EKQIEYNTAH
GITPESIKKS IGDILGSVYE RDHVLVEIGD GKGSGFTDDA AVIGHNFGAV LADLETRMRE
AAADLNFEEA ARLRDEVKRL RATELAVIDD PTVKQRGVAA KAGSYKGDKQ FGASANLPKL
STERGGNNTP RSKVHKPDLD EMGIAGWHEI KKVQRPKPRK PTLDEMGPGA ESKIYQPTNS
RESGPEFGPA PRSSGGAPGH RGGWKKR