Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A1752 |
Symbol | uvrA |
ID | 3835174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 2044331 |
End bp | 2047183 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637825849 |
Product | excinuclease ABC subunit A |
Protein accession | YP_426839 |
Protein GI | 83593087 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.72619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCC AGGAAATCCG CGTGCGCGGT GCGCGCGAAC ACAACCTTCG CAATGTCGAT GTGACCTTGC CCCGCGACAA ACTGGTCGTG ATCACCGGGC TGTCGGGTTC GGGGAAATCG AGTCTCGCTT TTGACACGAT CTATGCCGAA GGCCAGCGGC GCTATGTGGA ATCCCTGTCG GCCTATGCCC GCCAGTTCCT GGAGATGATG CAAAAGTCCG ATGTGGATTC GATCGAGGGG CTGTCGCCAG CGATTTCCAT CGAGCAGAAG ACCACCTCGC GCAATCCGCG CTCGACCGTC GGCACCGTGA CCGAGATCCA CGACTACATG CGCCTGCTGT GGGCGCGCAT CGGCGTTCCC CATTCCCCGG CCACCGGCCT GCCGATCGAA AGCCAGACGG TCAGCCAGAT GGTCGATCGC ACCCTGGCCC TGCCCGAAGG CACCCGGCTT TATCTGCTGG CCCCGGTGGC GCGCGGCCGC AAGGGCGAGT TCAAGAAGGA ACTGGCCGAG CTGCAGAAAA AGGGCTTCAG CCGGGTCAAG GTCGATGGCA CGATCTATGA GATCCCCGAG GTGCCCGCCC TCAACAAAAA GATCAAGCAC GATATCGAGG TGGTGGTCGA CCGTCTGGTG GTCCGCGCCG ATATCGCCAG CCGGCTGGCC GATTCCTTTG AAACCGCCCT TGAGCTCTCC GATGGACTGG TCTTCGCCGA AGACGCGGTC TCGGGCGAGC GCCACACCTT TTCCGCCCGC TTCGCCTGCC CGGTCAGCGG CTTCACCATC GACGAGATCG AACCCCGGCT GTTCTCGTTC AACAATCCCT TCGGCGCCTG TCCGACCTGT GACGGCCTGG GGGTGACGCT GTATTTCGAC CCCGAGCTGG TGGTGCCCGA TCCCAGCCGC ACCCTCAATC GCGGCGCCGT CGCCCCGTGG TCGGGACAAA CCCCGCCCTC GCCCTATTAC GCCCAGGCGC TGGCGAGCAT CGCCGCCCAT TTCGGCGCCG ATATGGACAC GCCGTGGAAG GATCTGCCCG AGGAGATGCG CCGGATCATC CTTGAAGGCT CGGGCAAGGA GATCATCCCG CTCAGCTTCG ATGACGGCAC GCGCAGCTAT CGCACCCAGA AGCCCTTCGA AGGCGTCATC CCCAATATCG CCCGGCGCTG GCGCGAGACC GAAAGCAACT GGATCCGCGA CGAATTATCG CGCTACCAGG GTTCGGCCCC CTGCCCGGCC TGCGGCGGCT ATCGCCTGAA GCCCCAGGCC CTGGCGGTCA AGATCAACGG CCGCCATATC GGCGAGGCCT CCGAGGTTTC GATCGCCGAG GCCCGGGCCT GGTTCGCCGG GCTCGAGGCC AAACTCAGCC CCAAGCACCG CGAGATCGCC GACCGCATCT TGCGCGAGAT CAACGAGCGC CTGGGCTTTC TCGGCAATGT CGGCCTTGAT TATCTCAGCT TGTCGCGCAA TTCGGGCACA CTCTCGGGCG GCGAAAGCCA GCGCATCCGC TTGGCCAGCC AGATCGGTTC GGGGTTGACC GGGGTTCTTT ATGTGCTCGA CGAGCCGTCG ATCGGCCTGC ACCAGCGCGA TAACGACCGC CTGCTGATCA CGCTCAAGCG CCTGCGCGAC ATCGGCAATA CGGTGATCGT CGTCGAGCAC GACGAGGACG CCATTCGCAA CGCCGATTAT CTGGTCGACA TGGGGCCCGG GGCGGGCGTC CACGGCGGCA CCATCGTCGC CCAGGGCACG CCCGAACAGG TGATGGCCAA TCCCGCCAGC CTGACCGGCC AGTATCTGAC CGGCAAGCGC AGCGTGCCGG TGCCCACGGT TCGCCGCCAG GGCAATGGCA AAGTCCTGAC CCTGCGCGGG GCGCGGGCCA ATAATCTGCA AAACGTCGAT GTGTCCATTC CGCTTGGCAC CTTCACCTGC ATCACCGGCG TCTCGGGCGG CGGCAAATCG ACCCTGGTTT TGGAAACCCT TTACAAGGCG TTGGCCCGTC AGCTTCACGG GGCGCGCGAT CTGCCCGGCG AGCATGACGC CATCGAAGGC GCCGAGCAGA TCGACAAGAT CGTCGATATC GACCAATCGC CGATCGGCCG CACGCCGCGC TCCAACCCCG CCACCTATAC GGGCGCCTTC ACCCCCATCC GCGACTGGTT CTCGGGCCTG CCCGAGGCCA AGGCCCGGGG CTATAAGCCC GGCCGCTTCT CGTTCAACGT CAAGGGCGGA CGCTGCGAAG CCTGCCAGGG CGACGGGCTG ATCAAGATCG AGATGCACTT CCTGCCCGAT GTCTATGTCA CCTGCGATGT CTGCAAGGGC AAGCGCTACA ACCGCGAAAC CCTGGATGTC ACCTTCAAGG GCAAATCGAT CGCCGATGTG TTGGATATGA CGATCGAAGA GGCCGGTGAC TTCTTCAAGG CGGTGCCGGC GGTGCGCGAC AAGATGGAGA TGCTCCAGCA GGTCGGGCTT GATTATATCC GCCTCGGCCA ACAGGCGACG ACCCTGTCGG GTGGCGAGGC CCAGCGCGTC AAGCTGGCCA AGGAACTGTC ACGCCGGGCG ACCGGGCGAA CGCTTTATAT CCTGGATGAG CCGACCACCG GCCTGCATTT CGAGGATGTG CGTAAGCTGA TGGAGGTGCT GCAGGCCCTG GTCGATACGG GCAATACGGT GGTGGTGATC GAGCATAACC TGGAAGTGAT CAAAACCGCC GACCATATCA TCGACATGGG GCCAGAAGGC GGATCGGGCG GCGGCCGGGT GGTGGCCCAA GGCACTCCCG AGGAGGTCGC GGCCAATCCG GCCAGCCATA CCGGCAGCTA TCTCAAGCCC TATCTCTCGG CCCTCGCCCG GCGCAGCGCG TAA
|
Protein sequence | MSTQEIRVRG AREHNLRNVD VTLPRDKLVV ITGLSGSGKS SLAFDTIYAE GQRRYVESLS AYARQFLEMM QKSDVDSIEG LSPAISIEQK TTSRNPRSTV GTVTEIHDYM RLLWARIGVP HSPATGLPIE SQTVSQMVDR TLALPEGTRL YLLAPVARGR KGEFKKELAE LQKKGFSRVK VDGTIYEIPE VPALNKKIKH DIEVVVDRLV VRADIASRLA DSFETALELS DGLVFAEDAV SGERHTFSAR FACPVSGFTI DEIEPRLFSF NNPFGACPTC DGLGVTLYFD PELVVPDPSR TLNRGAVAPW SGQTPPSPYY AQALASIAAH FGADMDTPWK DLPEEMRRII LEGSGKEIIP LSFDDGTRSY RTQKPFEGVI PNIARRWRET ESNWIRDELS RYQGSAPCPA CGGYRLKPQA LAVKINGRHI GEASEVSIAE ARAWFAGLEA KLSPKHREIA DRILREINER LGFLGNVGLD YLSLSRNSGT LSGGESQRIR LASQIGSGLT GVLYVLDEPS IGLHQRDNDR LLITLKRLRD IGNTVIVVEH DEDAIRNADY LVDMGPGAGV HGGTIVAQGT PEQVMANPAS LTGQYLTGKR SVPVPTVRRQ GNGKVLTLRG ARANNLQNVD VSIPLGTFTC ITGVSGGGKS TLVLETLYKA LARQLHGARD LPGEHDAIEG AEQIDKIVDI DQSPIGRTPR SNPATYTGAF TPIRDWFSGL PEAKARGYKP GRFSFNVKGG RCEACQGDGL IKIEMHFLPD VYVTCDVCKG KRYNRETLDV TFKGKSIADV LDMTIEEAGD FFKAVPAVRD KMEMLQQVGL DYIRLGQQAT TLSGGEAQRV KLAKELSRRA TGRTLYILDE PTTGLHFEDV RKLMEVLQAL VDTGNTVVVI EHNLEVIKTA DHIIDMGPEG GSGGGRVVAQ GTPEEVAANP ASHTGSYLKP YLSALARRSA
|
| |