Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1612 |
Symbol | uvrA |
ID | 4897209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 1694610 |
End bp | 1697468 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640112203 |
Product | excinuclease ABC subunit A |
Protein accession | YP_001043494 |
Protein GI | 126462380 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0475567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0441926 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAAC AGAAGTTCAT CTCGGTGCGC GGCGCGCGCG AGCACAATCT CAAGGGCATC GATGTCGACA TCCCGCGGGA TCAGCTGGTG GTCATCACCG GCCTGTCGGG GTCGGGCAAG TCGAGCCTCG CCTTCGACAC GATCTATGCC GAGGGGCAGC GCCGCTATGT CGAGAGCCTC TCGGCCTATG CGCGGCAGTT CCTCGACATG ATGGGCAAGC CGGATGTGGA CCATATCTCG GGTCTGTCGC CGGCCATCTC CATCGAGCAG AAGACGACCT CGAAGAACCC GCGCTCGACC GTGGGGACCG TGACCGAGAT CTACGACTAC ATGCGCCTTC TGTGGGCGCG GGTCGGCACG CCCTACAGCC CGGCCACCGG CCTGCCCATC GCGGCGCAGC AGGTGCAGGA CATGGTCGAT GCGGTGATGG CAATGCCCGA GGGCGCGCGG GGCTATCTGC TCGCGCCCAT CGTCCGGGAC CGCAAGGGCG AATACAAGAA GGAGTTCATC GAGCTCCGCA AGCAGGGCTT CCAGCGCGTG AAGGTGAACG GCACCTTCCA CGAGCTGGAG GAGCCGCCGA CGCTGGACAA GAAGTTCCGC CACGACATCG ATGTGGTGGT GGACCGGATC GTGGTGCGCG AGGGGATCGA GACGCGGCTG GCCGACAGCT TCCGCACCGC GCTGAACCTC GCCGACGGGA TCGCGATCTT CGAGAGCGCG CCCGCCGAGG GCGAGCCGGT GCGCACGACC TTCTCCGAGA AATTCGCCTG CCCGGTCTCG GGCTTCACCA TCCCCGAGAT CGAGCCGCGG CTCTTTTCCT TCAACGCGCC CTTCGGCGCC TGCCCGGAAT GCGACGGGCT GGGGATGGAG CTCTTCTTCG ACGAGCGCCT CGTGGTGCCC GATCAGGGGC TCACGCTGAA GCAGGGGGCC ATCGCGCCCT GGGCCAAATC GAAATCGCCC TACTACACCC AGACCATCGA GGCGCTGGCG CGGCATTACG GCTTCGATCC GAAGAAGAAA TGGAAGGATC TTCCGTTCAA CGTGCAATCC GTCTTCCTGC GCGGCTCGGG CGAGGAGGAG ATCACCTTCC GCTACGACGA GGGCGGGCGG ATCTATCAGG TCAGCCGCAG CTTCGAGGGC GTGATCCCGA ACATGGAGCG CCGCTACCGC GAGACCGACT CGGCCTGGGT GCGCGAGGAA TTCGAGCGCT ACCAGAACAA CCGGCCCTGC CATGTCTGCG GCGGCTACCG GCTGAAGCCC GAGGCGCTAG CGGTGAAGAT CGGCGGCCTG CATATCGGGC AGGTGGTGCA GATGTCGATC AAGGAGGCCT TCGCCTGGAT CGAGACAGTG CCGGGCCATC TGACTGCGCA GAAGAACGAG ATCGCGCGCG CGATCCTCAA GGAGATCCGC GAGCGGCTGG GCTTCCTCGT CAATGTGGGG CTCGACTATC TGTCGATGAG CCGCGCGGCC GGCACGCTCT CGGGCGGGGA AAGCCAGCGG ATCCGGCTCG CCTCCCAGAT CGGCTCGGGG CTGACGGGCG TGCTCTATGT GCTCGACGAG CCCTCGATCG GGCTGCACCA GCGCGACAAC GACCGCCTGC TGACCACGCT GAAGAACCTG CGCGACCAGG GCAATTCGGT GCTGGTGGTG GAGCATGACG AGGATGCGAT CCGCGAGGCG GATTATGTGT TCGACGTGGG CCCGGGCGCG GGCGTCCATG GCGGGCAGGT GGTGGCGCAC GGCACGCCCG CCGAGATCGC CGCCGATCCG GCGAGCCTCA CCGGTCAGTA TCTCTCGGGC ACACGCGAAA TCTCGGTGCC CGCCGAGCGG CGGACGGGCA ACGGCAAGTC CCTGACGGTG GTGAAGGCCA GCGGCAACAA CCTCCACGAT GTGACGGTGG ACTTCCCCCT GGGCAAGTTC GTTTGCGTGA CCGGCGTCTC GGGCGGCGGC AAGTCGACGC TGACCATCGA GACGCTCTAC AAGACGGCGG CGATGCGGCT GAACGGGGCG CGCGAGACGC CGGCGCCCTG CGAGACGATC AAGGGCTTCG AGCAGCTCGA CAAGGTCATC GACATCGACC AGCGGCCGAT CGGGCGCACG CCGCGCTCGA ACCCCGCGAC CTACACCGGG GCCTTCACGC CGATCCGCGA CTGGTTCGCG GGCCTGCCCG AGTCGAAGGC GCGCGGCTAC CAGCCGGGCC GGTTCTCGTT CAACGTGAAG GGCGGGCGCT GCGAGGCCTG CCAGGGCGAT GGCGTCATCA AGATCGAGAT GCACTTCCTG CCCGACGTCT ATGTGACCTG CGAGACCTGC AAGGGCCACC GCTACAACCG CGAGACGCTG GAGATCAAGT TCAAGGGCAA GAGCATCGCC GACGTGCTCG AGATGACGGT CGAGGATGCG CAGGAGTTCT TCCAGGCGGT GCCCTCGATC CGCGAGAAGA TGGACGCGCT GATGCGGGTG GGCCTCGGCT ACATCAAGGT GGGCCAGCAG GCGACGACGC TCTCGGGCGG CGAGGCGCAG CGGGTGAAAC TCTCCAAGGA GCTGAGCCGC CGCGCCACGG GACGCACGCT CTACATTCTC GATGAGCCGA CGACGGGGCT GCATTTCGAG GATGTGAAAA AGCTGCTCGA GGTGCTGCAC GAGCTGGTGG AGCAGGGCAA CACGGTGGTG GTGATCGAGC ACAATCTCGA TGTGGTGAAG ACCGCGGACT GGATCATCGA CATCGGCCCG GAAGGCGGCG ACGGCGGCGG CAAGATCGTG GCCGAGGGAA CGCCCGAGGA GGTGGCCAAG GTCGAGGCCT CTCACACCGG CCGCTACCTC CGCGACATGC TGAAACCCCG ACGTCTGGCC GCGGAATAG
|
Protein sequence | MAEQKFISVR GAREHNLKGI DVDIPRDQLV VITGLSGSGK SSLAFDTIYA EGQRRYVESL SAYARQFLDM MGKPDVDHIS GLSPAISIEQ KTTSKNPRST VGTVTEIYDY MRLLWARVGT PYSPATGLPI AAQQVQDMVD AVMAMPEGAR GYLLAPIVRD RKGEYKKEFI ELRKQGFQRV KVNGTFHELE EPPTLDKKFR HDIDVVVDRI VVREGIETRL ADSFRTALNL ADGIAIFESA PAEGEPVRTT FSEKFACPVS GFTIPEIEPR LFSFNAPFGA CPECDGLGME LFFDERLVVP DQGLTLKQGA IAPWAKSKSP YYTQTIEALA RHYGFDPKKK WKDLPFNVQS VFLRGSGEEE ITFRYDEGGR IYQVSRSFEG VIPNMERRYR ETDSAWVREE FERYQNNRPC HVCGGYRLKP EALAVKIGGL HIGQVVQMSI KEAFAWIETV PGHLTAQKNE IARAILKEIR ERLGFLVNVG LDYLSMSRAA GTLSGGESQR IRLASQIGSG LTGVLYVLDE PSIGLHQRDN DRLLTTLKNL RDQGNSVLVV EHDEDAIREA DYVFDVGPGA GVHGGQVVAH GTPAEIAADP ASLTGQYLSG TREISVPAER RTGNGKSLTV VKASGNNLHD VTVDFPLGKF VCVTGVSGGG KSTLTIETLY KTAAMRLNGA RETPAPCETI KGFEQLDKVI DIDQRPIGRT PRSNPATYTG AFTPIRDWFA GLPESKARGY QPGRFSFNVK GGRCEACQGD GVIKIEMHFL PDVYVTCETC KGHRYNRETL EIKFKGKSIA DVLEMTVEDA QEFFQAVPSI REKMDALMRV GLGYIKVGQQ ATTLSGGEAQ RVKLSKELSR RATGRTLYIL DEPTTGLHFE DVKKLLEVLH ELVEQGNTVV VIEHNLDVVK TADWIIDIGP EGGDGGGKIV AEGTPEEVAK VEASHTGRYL RDMLKPRRLA AE
|
| |