Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2757 |
Symbol | uvrA |
ID | 3970174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 2995093 |
End bp | 2998098 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637925867 |
Product | excinuclease ABC subunit A |
Protein accession | YP_532624 |
Protein GI | 90424254 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.30708 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.666672 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGAAG TGATCAAGGC CAAGCGCCAG CCGGCGGGCT CCCAGCAGCG CTCAATTACC ATTCGCGGCG CCCGCGAACA TAACCTCAAA AACGTCGACC TCGAGATCCC GCGCGACCAG CTCGTGGTGT TCACCGGGCT GTCCGGCTCC GGCAAATCCT CGCTGGCGTT CGACACCATC TATGCCGAGG GCCAGCGCCG TTACGTCGAA TCGCTGTCGG CCTATGCCCG GCAGTTTCTG GAGATGATGC AGAAGCCCGA CGTCGACCAG ATCGACGGGC TGTCGCCGGC GATCTCGATC GAGCAGAAGA CCACCTCGAA GAACCCGCGC TCGACGGTCG GCACGGTGAC CGAGATCTAC GACTACATGC GGCTGTTGTG GGCGCGGGCC GGCATTCCTT ATTCGCCGGC CACCGGGCTG CCGATCGAAA GCCAGATCGT CTCGCAGATG GTCGACAAGG TTTTGGCGCT GCCGGAAGGC ACCAGGCTGT ACCTGCTGGC GCCGGTGGTG CGCGGCCGCA AGGGCGAATA CCGCAAGGAA TTGGCGGACT ATCTCAAGAA GGGCTTTCAG CGCGTCAAGA TCGACGGCGC ATTCCACGAA CTCGCCGAAG CGCCGACGCT GGACAAGAAA TTCCCGCACG ACATCGACGT GGTGGTCGAC CGCATCGTGG TGCGCCCGGA CATCGGCCAG CGGCTCGCCG AAAGCTTTGA GACCGCGCTG AAGCTCGCCG ATGGCCTCGC GGTGATCGAA TATGCCGACG CGCCGGCCAC GTCGCCGGCG GCGACCGCAG CCGCGGAGCC CGAAAAGAAA AAGGCCGACA AGAAGGTCGC GAAAATTCAC GACAAGACCG GCGCCGAACG CATCATGTTT TCGGAAAAAT TCGCCTGCCC GGTGTCTGGC TTCACCATTC CGGAGATCGA GCCGCGGCTG TTCTCGTTCA ACAACCCCTA TGGCGCCTGC CCGGCCTGCG GCGGGCTCGG CATCGAGCAG CACATCGACG CCGATCTGGT GATCCCCGAC AAGGAGCTGA GCTTGCGCAA GGGCGCGATC GCGCCGTGGG CGAAGTCGTC GTCGCCGTAC TACATCCAGA CCCTCACCGC GCTGGGAAAA TTCTACAAGT TCACGCTCGA CACCAAGTGG AAGGATCTGC CGAAGAAGAC CCAGAACGCG CTGCTGCACG GCAGCGGCGA CGACGAGATC AAGTTCTCCT ACGAAGACGG CGTGCGCTCT TACGACACCA AGAAGCCGTT CGAGGGCGTC GTCACCAACA TCCAGCGCCG CTTCCGCGAG ACCGAAAGCG AATGGGCGCG CGAGGAACTC GGCAAGTATT TCTCCGACGT GCCCTGCGCC GCCTGCCACG GCTTCCGGCT GAAGCCCGAG GCGCTGTGCG TCAAGATCGG CGGCAAGCAT ATCGGCGAGG TCTCTGAGCT GTCGGTGCGC CGCGCCGGCG AATGGTTCGA GACCGTGCCG AAACTGCTCA ACAAGCAGCA GAACGAGATC GCGGTCCGGA TCCTGAAGGA GATCCGTGAG CGGCTCTCCT TCCTGCTCGA CGTCGGCCTG AATTATCTGA CGCTGGCGCG CGCCTCCGGC ACGCTGTCCG GCGGCGAGAG CCAGCGCATC CGCCTCGCCT CGCAGATCGG AAGCGGCCTC ACCGGCGTGC TCTATGTGCT GGACGAGCCG TCGATCGGGC TGCACCAGCG CGACAACGCG CGCCTCTTGG ATACGCTGCG GCGGCTGCGC GACCTCGGCA ACACCGTGAT CGTGGTCGAG CACGACGAGG ACGCGGTGCT CGCCGCCGAC TACGTCGTCG ATGTCGGCCC CGGCGCCGGC ACCCATGGCG GCCATATCGT CGCGCAGGGC ACGCCCGCCG AGGTGATGAA GAATCCGAAA TCGCTGACCG GCAAATATCT CACCGGCGAA TTGTTCGTGC CGGTGCCGGA GCGGCGGCCG CCGAACCATC GCCGCACCCT GAAAGTGATC AACGCCCGCG GCAACAATCT GAAGAATGTT TCCGCGGAAA TTCCGCTCGG GCTGTTCACC TGCGTCACCG GCGTCTCCGG CGGCGGCAAG TCGACGCTGC TGATCGACAC TTTGTACAAA GCCATCGCGC GAAAACTCAA CAACGCCTCT GAAGGCGCCG CACCGCATGA CCGCATCGAG GGGCTGGAGC ACATCGACAA GATCATCGAC ATCGATCAGT CGCCGATCGG CCGCACCCCG CGCTCCAACC CCGCGACCTA CACCGGCGCC TTCACGCCGA TCCGCGAATG GTTCGCCGGC CTGCCGGAAT CCAAGGCGCG CGGCTACGAG CCGGGCCGCT TCTCGTTCAA CGTCAAGGGC GGCCGCTGCG AGGCCTGCCA GGGCGACGGC GTAATCAAGA TCGAGATGCA CTTCCTGCCC GACGTCTACG TCACCTGCGA CGTCTGCAAG GGCAAGCGCT ACAACCGCGA GACGCTCGAG GTCTTGTTCA AGGGCAAGTC GATCGCCGAC GTGCTCGACA TGACGGTGGA AGAAGCCGCC GAGTTCTTCA AGGCGGTGCC GCGGGTGCGC GAGACCTTCA AGACGCTGAA GCGCGTCGGG CTCGACTACA TCCATGTCGG CCAGCAGGCC ACCACGCTGT CCGGCGGCGA GGCGCAGCGC GTCAAGCTGG CGAAGGAGCT GAGCAAGCGC GCCACCGGCC GCACGCTGTA CATCCTCGAC GAGCCGACCA CCGGTCTGCA CTTCCACGAC GTCGCCAAGC TCTTGGAAGT GCTGCACGAG CTGGTGTCGC AGGGCAACAG CGTGGTGGTG ATCGAGCACA ATCTCGAAGT GATCAAGACC GCCGACTGGG TGATCGACCT CGGCCCCGAA GGCGGCGACG GCGGCGGCGA AATCGTCGCC TGGGGCCCGC CGGAGGATAT CGTCAAGGCG CCGCGCAGCT ACACCGGAAA ATTCCTGAAG CCGGTGCTGG AAAAGGCGGC AGGCATCGCG AAGAAGAAGC GCAAGACGGG CGAAGCGGCG GAGTAA
|
Protein sequence | MDEVIKAKRQ PAGSQQRSIT IRGAREHNLK NVDLEIPRDQ LVVFTGLSGS GKSSLAFDTI YAEGQRRYVE SLSAYARQFL EMMQKPDVDQ IDGLSPAISI EQKTTSKNPR STVGTVTEIY DYMRLLWARA GIPYSPATGL PIESQIVSQM VDKVLALPEG TRLYLLAPVV RGRKGEYRKE LADYLKKGFQ RVKIDGAFHE LAEAPTLDKK FPHDIDVVVD RIVVRPDIGQ RLAESFETAL KLADGLAVIE YADAPATSPA ATAAAEPEKK KADKKVAKIH DKTGAERIMF SEKFACPVSG FTIPEIEPRL FSFNNPYGAC PACGGLGIEQ HIDADLVIPD KELSLRKGAI APWAKSSSPY YIQTLTALGK FYKFTLDTKW KDLPKKTQNA LLHGSGDDEI KFSYEDGVRS YDTKKPFEGV VTNIQRRFRE TESEWAREEL GKYFSDVPCA ACHGFRLKPE ALCVKIGGKH IGEVSELSVR RAGEWFETVP KLLNKQQNEI AVRILKEIRE RLSFLLDVGL NYLTLARASG TLSGGESQRI RLASQIGSGL TGVLYVLDEP SIGLHQRDNA RLLDTLRRLR DLGNTVIVVE HDEDAVLAAD YVVDVGPGAG THGGHIVAQG TPAEVMKNPK SLTGKYLTGE LFVPVPERRP PNHRRTLKVI NARGNNLKNV SAEIPLGLFT CVTGVSGGGK STLLIDTLYK AIARKLNNAS EGAAPHDRIE GLEHIDKIID IDQSPIGRTP RSNPATYTGA FTPIREWFAG LPESKARGYE PGRFSFNVKG GRCEACQGDG VIKIEMHFLP DVYVTCDVCK GKRYNRETLE VLFKGKSIAD VLDMTVEEAA EFFKAVPRVR ETFKTLKRVG LDYIHVGQQA TTLSGGEAQR VKLAKELSKR ATGRTLYILD EPTTGLHFHD VAKLLEVLHE LVSQGNSVVV IEHNLEVIKT ADWVIDLGPE GGDGGGEIVA WGPPEDIVKA PRSYTGKFLK PVLEKAAGIA KKKRKTGEAA E
|
| |