Gene RPC_4137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4137 
Symbol 
ID3970330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4597003 
End bp4600164 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content66% 
IMG OID637927241 
Productexcinuclease ABC subunit B 
Protein accessionYP_533982 
Protein GI90425612 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGA CTCCCGACAA ACCCAACAAG CCGATCAAGA CCCCGAAATC CAAGGCGCAT 
CGGCCCGACG TGAAGCCGAT CGGGCCGGCG CTGGCGGAAC TGCTCAATCC CGCGATCAAT
CGCGGCGACG CCGGCATGGG TTCCGGCACC GGGCTACAGC CGCCGCCGGA CAATTCCCGC
GACCGCCGCA CCGGCGGCGA GGCCGCGATC CATCGCGCGC GAGCGTCGAC GACACAAGTG
CCCCCACCCA ACCCTCCCCC GCAAGCGGGA GAGGGCTCCG CGCGGCGAGC GCCGCACACC
GTCGCTCGCG TTGAATCCGT GGAGGGGCAG CCATCGCCCC CTCTCCCGCT TGCGGGGGAG
GGCCGGGGTG GGGGCGCCAC GCGCACGACC GCGCCCGGCT TCGCCGAAGC CCCGCAATCC
GACTTCGCCA CGCCGAATTA CGGCACCACA GCCACCATCC CGACGCTCGA TCCGGAACTC
GCCAAGCAGC TCGGCTTCAC CACCGAGGAA GAGGATGAAG CCGCCCTCGC CCGCCCGCCG
CGCAACAAGA TGGAAGCGCT CGGCGTGCAG GCCACCGCCG ACGCGCTGGA GAACCTGATC
CGCGACGGCC GCCCGGAATT CAAGGGCGAG GACGGCAACG TCAAGCTATG GACGCCGCAT
CGCCCGCCGC GCCCGGAGAA GACCGAAGGC GGCGTGCGCT TCGTCATCAA ATCCGAATAT
GAACCGAAGG GCGACCAGCC CACCGCGATC AAGGAACTGG TCGAAGGCAT TTCGCGCAAC
GATCGCACCC AGGTGCTGCT CGGCGTCACC GGCTCCGGCA AGACCTACAC CATGGCCAAG
GTGATCGAGG CGACGCAGCG CCCGGCGATC ATTCTGGCGC CGAACAAGAC GCTGGCGGCG
CAGCTCTATG GCGAGTTCAA GAGTTTTTTC CCCGACAACG CCGTCGAGTA CTTTGTCAGC
TATTACGATT ACTACCAGCC CGAGGCCTAC GTCCCCCGCA CCGACACCTA CATCGAAAAA
GATTCATCGA TCAACGAACA GATCGACCGG ATGCGCCACG CCGCCACCCG CGCGCTGCTG
GAGCGCGACG ACGTCATCAT CGTCGCCTCG GTGAGCTGCA TCTACGGTAT CGGCTCGGTC
GAGACCTATA CCGCGATGAC CTTTGCGCTG AAGAAGGGCG AGCGGATCGA CCAGCGCGCG
CTGATCGCCG ATCTGGTGGC GCTGCAATAC AAGCGCACCC AGGCCGATTT TACTCGCGGC
ACCTTCCGGG TGCGCGGTGA CGTCATCGAC ATCTTCCCGG CGCACTATGA AGACCGCGCC
TGGCGCGTCA ATCTGTTCGG CGACACCGTC GAGACCATCG AGGAATTCGA CCCGCTCACC
GGCCACAAGC AGGACGAGCT GGAATTCATC AAGATCTACG CCAATTCGCA TTACGTCACG
CCGCGGCCGA CGCTGTTGCA GGCGATCAAA TCGATCAAGG CCGAGTTGAA ATGGCGGCTC
GATCAGCTCA ACGATCAGGG CCGATTGCTG GAGGCGCAGC GGCTCGAGCA GCGCACCACC
TTCGACATCG AGATGATGGA GGCCACCGGA AGCTGCGCCG GCATCGAGAA CTACTCGCGC
TATCTCACCG GCCGCCGCCC CGGCGAGCCG CCGCCGACGC TGTTCGAATA CGTCCCCGAC
AACGCGCTGG TGTTCGCCGA CGAGAGCCAC GTCACCGTGC CGCAGATCGG CGGCATGTTC
AAAGGCGATT TTCGACGCAA GGCGACGCTG GCCGAGTATG GATTCCGCTT GCCGTCCTGC
ATGGACAACC GGCCGCTGCG GTTCGAGGAA TGGGACATGA TGCGCCCGCA ATCGGTCGCG
GTGTCGGCGA CGCCGTCGGC GTGGGAGCTG AACGAGAGCG GCGGCGTGTT CGTCGAACAG
GTCATCCGCC CGACCGGGCT GATCGACCCG CCGGTCAACA TCCGTCCGGC CCGCACCCAG
GTCGACGACC TGGTCGGCGA GGTCCGCGCC ACCGCGCAGG CCGGCTATCG CTCGCTGATC
ACCGTCTTGA CCAAGCGGAT GGCCGAAGAC CTCACCGAAT ATCTGCACGA ACAGGGCATT
CGGGTGCGCT ATATGCACAG CGACATCGAC ACCATCGAAC GCATCGAGAT CATCCGCGAT
TTACGCTTGG GGGCTTTTGA CGCCCTGGTC GGCATCAACC TGTTGCGCGA AGGACTCGAC
ATTCCGGAAT GCGCGCTGGT GGCGATCCTC GACGCCGACA AGGAAGGCTT TCTGCGCAGC
GAGACCTCCC TGATCCAGAC CATCGGCCGC GCCGCGCGCA ACGTCGACGG CAAGGTGATC
CTCTACGCCG ACCGCATCAC CGGATCGATG GAGCGCGCCA TCGCCGAGAC CGACCGCCGC
CGCGAGAAGC AGGTCGAGTA CAACACCGAA CACAACATCA CCCCGGAGAG CATCAAGAAA
TCGATCGGCG ACATTCTCAA CAGCGTCTAT GAGCGCGACC ACGTGCTGGT CGAAATCGGC
GGCGGCGGCC AAGGCGGCAG CTGGTCCGAC GACGTCGGCG CGATCGGGCA TAATTTCGAG
ACGGTGCTGG CCGATCTCGA AACCAGGATG CGCGAGGCCG CGGCCGACCT GAACTTCGAA
GAGGCCGCGC GGCTGCGCGA CGAGGTCAAG CGGCTGCGCG CCACCGAATT GGCGGTGGTC
GACGATCCCA CCGCCAAGCA ACGCACCGTG CAGGGCAAGG CGGGGTCCTA CGCCGGCGCC
AAGAAATACG GCGCTGCTGC CAACCTGCCG CAGCAATCCA AGGAACGCGG CGGCAACAAC
ACGCCGAAGG TGCGGGGGGC GACGGGCGCC GCCACGCAAC GCGACGGTTC TTCTCCCTCC
CCCCTTGCGG GGGAGGGTCG GGGTGGGGGG TCTAAAGCGG CAGCCTCTCG CATCCACAAG
CCCGATCTCG ACGAGATGGG TATCGCCAGC TGGCACGAGG TCATGCCGGA TCGCAAAGGC
CGCGCCAAGC CGCGCAAACC GACGCTCGAC GAGATGGGCC CCGGCACCGA GAGCAGGATT
TTTCAGCCGA AGACGTCGCG CGAATCCGGC CCGGAATTCG GCCCCTCGCC GCGCTCATCC
GGCGGCGCGC CGGGGCACAG GGGAGGATGG AAGAAGAGGT AG
 
Protein sequence
MAKTPDKPNK PIKTPKSKAH RPDVKPIGPA LAELLNPAIN RGDAGMGSGT GLQPPPDNSR 
DRRTGGEAAI HRARASTTQV PPPNPPPQAG EGSARRAPHT VARVESVEGQ PSPPLPLAGE
GRGGGATRTT APGFAEAPQS DFATPNYGTT ATIPTLDPEL AKQLGFTTEE EDEAALARPP
RNKMEALGVQ ATADALENLI RDGRPEFKGE DGNVKLWTPH RPPRPEKTEG GVRFVIKSEY
EPKGDQPTAI KELVEGISRN DRTQVLLGVT GSGKTYTMAK VIEATQRPAI ILAPNKTLAA
QLYGEFKSFF PDNAVEYFVS YYDYYQPEAY VPRTDTYIEK DSSINEQIDR MRHAATRALL
ERDDVIIVAS VSCIYGIGSV ETYTAMTFAL KKGERIDQRA LIADLVALQY KRTQADFTRG
TFRVRGDVID IFPAHYEDRA WRVNLFGDTV ETIEEFDPLT GHKQDELEFI KIYANSHYVT
PRPTLLQAIK SIKAELKWRL DQLNDQGRLL EAQRLEQRTT FDIEMMEATG SCAGIENYSR
YLTGRRPGEP PPTLFEYVPD NALVFADESH VTVPQIGGMF KGDFRRKATL AEYGFRLPSC
MDNRPLRFEE WDMMRPQSVA VSATPSAWEL NESGGVFVEQ VIRPTGLIDP PVNIRPARTQ
VDDLVGEVRA TAQAGYRSLI TVLTKRMAED LTEYLHEQGI RVRYMHSDID TIERIEIIRD
LRLGAFDALV GINLLREGLD IPECALVAIL DADKEGFLRS ETSLIQTIGR AARNVDGKVI
LYADRITGSM ERAIAETDRR REKQVEYNTE HNITPESIKK SIGDILNSVY ERDHVLVEIG
GGGQGGSWSD DVGAIGHNFE TVLADLETRM REAAADLNFE EAARLRDEVK RLRATELAVV
DDPTAKQRTV QGKAGSYAGA KKYGAAANLP QQSKERGGNN TPKVRGATGA ATQRDGSSPS
PLAGEGRGGG SKAAASRIHK PDLDEMGIAS WHEVMPDRKG RAKPRKPTLD EMGPGTESRI
FQPKTSRESG PEFGPSPRSS GGAPGHRGGW KKR