Gene RPD_2758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2758 
SymboluvrA 
ID4023256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3077117 
End bp3080143 
Gene Length3027 bp 
Protein Length1008 aa 
Translation table11 
GC content65% 
IMG OID637962956 
Productexcinuclease ABC subunit A 
Protein accessionYP_569887 
Protein GI91977228 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCAAGG CGAAGCGCCA GCAGTCCCAA CAACAGGCCG TGGCCTCGAG CCGGACGGCC 
ATCACCATTC GCGGCGCTCG CGAGCACAAT CTGAAGAACG TCGACGTCGT CATCCCGCGC
GACAAGCTGG CGGTGTTCAC CGGCCTGTCC GGCTCCGGCA AGTCGTCATT GGCGTTCGAC
ACCATCTACG CCGAGGGCCA GCGCCGCTAT GTCGAATCGC TGTCGGCCTA TGCCCGGCAG
TTCCTCGAGA TGATGCAGAA GCCGGACGTC GACCAGATCG ACGGGCTGTC GCCGGCGATC
TCGATCGAGC AGAAGACCAC CTCGAAGAAC CCGCGCTCGA CCGTCGGCAC CGTCACCGAG
ATCTACGACT ACATGCGGCT GTTGTGGGCG CGCGTCGGCG TGCCCTATTC GCCGGCGACG
GGCCTGCCGA TCGAGAGCCA GACCGTCTCG CAGATGGTCG ACCGCGTGCT GGCGTTGCCG
GAAGGCACGC GGCTGTATCT GCTGGCGCCG GTGGTGCGCG GCCGCAAGGG CGAGTACCGC
AAGGAGCTCG CCGAGTGGCT GAAGAAGGGG TTTCAGCGCG TCAAGATCGA CGGCGCATTC
CATGAGCTGG CCGAGGCGCC GACGCTCGAC AAGAAATTCC CGCACGACAT CGACGTGGTG
GTTGACCGCA TCGTCGTCCG CCCCGACATC GGCCAACGCC TGGCGGAAAG CTTCGAGACC
GCGCTGAAGC TCGCCGAGGG GCTGGCGGTG ATCGAATATG CCGATGCGCC GGCCGCCGCC
GCCTCGGCGC CAGCCGAACC CGGTCAGGAC GAGCCCGGCG ACAAGAAGAA AAAGGCCGAC
AAGAAGGTCG CCAAGATTCA CGACAAGACC GGCGCCGAAC GCATCCTGTT CTCGGAGAAA
TTCGCCTGCC CGGTGTCCGG CTTCACCATT CCGGAGATCG AGCCGCGACT GTTCTCGTTC
AACAACCCCT ATGGCGCCTG CCCGGCCTGC GGCGGGCTCG GCATCGAGCA GCATATCGAC
GCCGATCTGG TGGTTCCCGA CAAGGAGCTG ACGCTGCGCA AAGGCGCGAT CGCGCCGTGG
GCGAAGTCGT CGTCGCCTTA TTATCTGCAG ACCCTGACGG CGCTGGCGAA GTACTACAAG
TTCACGCTCG ACACCAAATG GAAGGACCTG ACCAAGAAGG TCCAGACCGC CCTGCTCTAT
GGCTCGGGCG ACGACGAGAT CAAGTTTTCC TACGAGGACG GCGTCCGCTC CTACGACACC
AAGAAGCCGT TCGAGGGCGT GGTCACCAAT ATCGAGCGGC GCTTCCGCGA GACCGAGAGC
GAATGGGCGC GCGAGGAGCT CGGCAAGTAC TTCTCCGACG TGCCGTGCGA CGCCTGCCAT
GGGCACCGCC TCAAGCCCGA GGCGCTGTGC GTCAAGATCG GCGGCAAGCA CATCGGCGAC
ATCAGCGAAT TGTCGGTGAG GCGCGCCGGC GAATGGTTCG AAGCCGTGCC GGCCGCGCTC
AACAAGCAGC AGAACGAGAT CGCCACCCGG ATCCTGAAAG AGATCCGCGA CCGGCTGTCG
TTCCTGCTCG ACGTCGGCCT GAACTATCTG ACGCTGGCGC GCGCCGCCGG CACGCTCAGC
GGCGGCGAAA GCCAGCGCAT TCGGCTGGCC TCGCAGATCG GCTCGGGCCT CACCGGCGTG
CTCTACGTAC TGGACGAGCC GTCGATCGGC CTGCACCAGC GCGACAACGC CCGGCTGCTC
GACACGCTGA AGCGGCTGCG CGACCTCGGC AACACCGTGA TCGTGGTCGA GCACGACGAG
GACGCCATTC GGCTGGCCGA TTTCGTGCTC GATATCGGCC CCGGCGCCGG CGTCCATGGC
GGCCACATCG TCGCGCAGGG CACGCCCGCC GAGGTGATGG CCAATCCGAA ATCGCTGACC
GGCAAATATC TCACCGGCGA ACTCTCGGTG CCGATCCCGG AGCGCAGGCC ACCGAACCAT
CGCCGCACCC TCAAGCTGGT CAACGCCCGC GGCAACAACC TCAAGAACGT CACCGCCGAA
ATTCCGCTCG GCCTGTTCAC CTGCGTCACC GGCGTGTCCG GCGGCGGCAA GTCGACGCTG
CTGATCGACA CCTTCTACAA GGCGATCGCC CGCAAGCTGA ACAACGCCAG CGAGCCGCCG
GCGCCGCACG ACCGCATCGA GGGCCTCGAG CACATCGACA AGATCATCGA CATCGACCAG
TCGCCGATCG GCCGCACCCC GCGCTCCAAC CCCGCCACCT ACACCGGAGC GTTCACCCCG
ATCCGCGAGT GGTTCGCCGG GCTACCGGAA TCCAAGGCGC GCGGCTACGA GCCCGGCCGG
TTCTCGTTCA ACGTCAAGGG CGGCCGCTGC GAAGCCTGTC AGGGCGACGG CGTCATCAAG
ATCGAGATGC ACTTTTTGCC CGACGTCTAC GTCACCTGCG ATGTGTGCAA AGGAAAACGC
TACAACCGCG AAACCCTCGA AGTGCTGTTC AAGGGCAAGT CGATCGCCGA CGTGCTCGAC
ATGACCGTCG AGGAAGCCGC CGACTTCTTC AAGGCGGTAC CGCGCGTCCG CGAGACCTTC
AAGACGCTGC ACCGCGTCGG CCTCGACTAC ATCCATGTCG GCCAGCAGGC CACCACGCTG
TCCGGCGGCG AAGCCCAGCG CGTCAAGCTC GCCAAGGAAC TGAGCAAACG CGCCACCGGC
CGCACGCTCT ACATCCTCGA CGAGCCGACC ACGGGACTAC ATTTTCACGA CGTCGCCAAA
CTGCTCGAAG TGCTGCACGA ACTGGTCGCG CAGGGCAACA CCGTGGTGGT GATCGAGCAC
AATCTCGAAG TGATCAAGAC CGCCGACTGG GTGATCGACC TCGGACCCGA AGGTGGCGAC
GGCGGCGGCG AAATCGTCGC CTGGGGCCCG CCGGAAGACA TCGTCAAGGC GCCGCGGAGC
TACACGGGGA AATTCCTGAA GCCGGTGCTG GAGAAGAAGG GACCGGCGGC GGTGCGGAAA
AAGAAGGCGG ATGAGGCGGC GGAGTGA
 
Protein sequence
MIKAKRQQSQ QQAVASSRTA ITIRGAREHN LKNVDVVIPR DKLAVFTGLS GSGKSSLAFD 
TIYAEGQRRY VESLSAYARQ FLEMMQKPDV DQIDGLSPAI SIEQKTTSKN PRSTVGTVTE
IYDYMRLLWA RVGVPYSPAT GLPIESQTVS QMVDRVLALP EGTRLYLLAP VVRGRKGEYR
KELAEWLKKG FQRVKIDGAF HELAEAPTLD KKFPHDIDVV VDRIVVRPDI GQRLAESFET
ALKLAEGLAV IEYADAPAAA ASAPAEPGQD EPGDKKKKAD KKVAKIHDKT GAERILFSEK
FACPVSGFTI PEIEPRLFSF NNPYGACPAC GGLGIEQHID ADLVVPDKEL TLRKGAIAPW
AKSSSPYYLQ TLTALAKYYK FTLDTKWKDL TKKVQTALLY GSGDDEIKFS YEDGVRSYDT
KKPFEGVVTN IERRFRETES EWAREELGKY FSDVPCDACH GHRLKPEALC VKIGGKHIGD
ISELSVRRAG EWFEAVPAAL NKQQNEIATR ILKEIRDRLS FLLDVGLNYL TLARAAGTLS
GGESQRIRLA SQIGSGLTGV LYVLDEPSIG LHQRDNARLL DTLKRLRDLG NTVIVVEHDE
DAIRLADFVL DIGPGAGVHG GHIVAQGTPA EVMANPKSLT GKYLTGELSV PIPERRPPNH
RRTLKLVNAR GNNLKNVTAE IPLGLFTCVT GVSGGGKSTL LIDTFYKAIA RKLNNASEPP
APHDRIEGLE HIDKIIDIDQ SPIGRTPRSN PATYTGAFTP IREWFAGLPE SKARGYEPGR
FSFNVKGGRC EACQGDGVIK IEMHFLPDVY VTCDVCKGKR YNRETLEVLF KGKSIADVLD
MTVEEAADFF KAVPRVRETF KTLHRVGLDY IHVGQQATTL SGGEAQRVKL AKELSKRATG
RTLYILDEPT TGLHFHDVAK LLEVLHELVA QGNTVVVIEH NLEVIKTADW VIDLGPEGGD
GGGEIVAWGP PEDIVKAPRS YTGKFLKPVL EKKGPAAVRK KKADEAAE