Gene Rru_A1752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1752 
SymboluvrA 
ID3835174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2044331 
End bp2047183 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content64% 
IMG OID637825849 
Productexcinuclease ABC subunit A 
Protein accessionYP_426839 
Protein GI83593087 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.72619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCC AGGAAATCCG CGTGCGCGGT GCGCGCGAAC ACAACCTTCG CAATGTCGAT 
GTGACCTTGC CCCGCGACAA ACTGGTCGTG ATCACCGGGC TGTCGGGTTC GGGGAAATCG
AGTCTCGCTT TTGACACGAT CTATGCCGAA GGCCAGCGGC GCTATGTGGA ATCCCTGTCG
GCCTATGCCC GCCAGTTCCT GGAGATGATG CAAAAGTCCG ATGTGGATTC GATCGAGGGG
CTGTCGCCAG CGATTTCCAT CGAGCAGAAG ACCACCTCGC GCAATCCGCG CTCGACCGTC
GGCACCGTGA CCGAGATCCA CGACTACATG CGCCTGCTGT GGGCGCGCAT CGGCGTTCCC
CATTCCCCGG CCACCGGCCT GCCGATCGAA AGCCAGACGG TCAGCCAGAT GGTCGATCGC
ACCCTGGCCC TGCCCGAAGG CACCCGGCTT TATCTGCTGG CCCCGGTGGC GCGCGGCCGC
AAGGGCGAGT TCAAGAAGGA ACTGGCCGAG CTGCAGAAAA AGGGCTTCAG CCGGGTCAAG
GTCGATGGCA CGATCTATGA GATCCCCGAG GTGCCCGCCC TCAACAAAAA GATCAAGCAC
GATATCGAGG TGGTGGTCGA CCGTCTGGTG GTCCGCGCCG ATATCGCCAG CCGGCTGGCC
GATTCCTTTG AAACCGCCCT TGAGCTCTCC GATGGACTGG TCTTCGCCGA AGACGCGGTC
TCGGGCGAGC GCCACACCTT TTCCGCCCGC TTCGCCTGCC CGGTCAGCGG CTTCACCATC
GACGAGATCG AACCCCGGCT GTTCTCGTTC AACAATCCCT TCGGCGCCTG TCCGACCTGT
GACGGCCTGG GGGTGACGCT GTATTTCGAC CCCGAGCTGG TGGTGCCCGA TCCCAGCCGC
ACCCTCAATC GCGGCGCCGT CGCCCCGTGG TCGGGACAAA CCCCGCCCTC GCCCTATTAC
GCCCAGGCGC TGGCGAGCAT CGCCGCCCAT TTCGGCGCCG ATATGGACAC GCCGTGGAAG
GATCTGCCCG AGGAGATGCG CCGGATCATC CTTGAAGGCT CGGGCAAGGA GATCATCCCG
CTCAGCTTCG ATGACGGCAC GCGCAGCTAT CGCACCCAGA AGCCCTTCGA AGGCGTCATC
CCCAATATCG CCCGGCGCTG GCGCGAGACC GAAAGCAACT GGATCCGCGA CGAATTATCG
CGCTACCAGG GTTCGGCCCC CTGCCCGGCC TGCGGCGGCT ATCGCCTGAA GCCCCAGGCC
CTGGCGGTCA AGATCAACGG CCGCCATATC GGCGAGGCCT CCGAGGTTTC GATCGCCGAG
GCCCGGGCCT GGTTCGCCGG GCTCGAGGCC AAACTCAGCC CCAAGCACCG CGAGATCGCC
GACCGCATCT TGCGCGAGAT CAACGAGCGC CTGGGCTTTC TCGGCAATGT CGGCCTTGAT
TATCTCAGCT TGTCGCGCAA TTCGGGCACA CTCTCGGGCG GCGAAAGCCA GCGCATCCGC
TTGGCCAGCC AGATCGGTTC GGGGTTGACC GGGGTTCTTT ATGTGCTCGA CGAGCCGTCG
ATCGGCCTGC ACCAGCGCGA TAACGACCGC CTGCTGATCA CGCTCAAGCG CCTGCGCGAC
ATCGGCAATA CGGTGATCGT CGTCGAGCAC GACGAGGACG CCATTCGCAA CGCCGATTAT
CTGGTCGACA TGGGGCCCGG GGCGGGCGTC CACGGCGGCA CCATCGTCGC CCAGGGCACG
CCCGAACAGG TGATGGCCAA TCCCGCCAGC CTGACCGGCC AGTATCTGAC CGGCAAGCGC
AGCGTGCCGG TGCCCACGGT TCGCCGCCAG GGCAATGGCA AAGTCCTGAC CCTGCGCGGG
GCGCGGGCCA ATAATCTGCA AAACGTCGAT GTGTCCATTC CGCTTGGCAC CTTCACCTGC
ATCACCGGCG TCTCGGGCGG CGGCAAATCG ACCCTGGTTT TGGAAACCCT TTACAAGGCG
TTGGCCCGTC AGCTTCACGG GGCGCGCGAT CTGCCCGGCG AGCATGACGC CATCGAAGGC
GCCGAGCAGA TCGACAAGAT CGTCGATATC GACCAATCGC CGATCGGCCG CACGCCGCGC
TCCAACCCCG CCACCTATAC GGGCGCCTTC ACCCCCATCC GCGACTGGTT CTCGGGCCTG
CCCGAGGCCA AGGCCCGGGG CTATAAGCCC GGCCGCTTCT CGTTCAACGT CAAGGGCGGA
CGCTGCGAAG CCTGCCAGGG CGACGGGCTG ATCAAGATCG AGATGCACTT CCTGCCCGAT
GTCTATGTCA CCTGCGATGT CTGCAAGGGC AAGCGCTACA ACCGCGAAAC CCTGGATGTC
ACCTTCAAGG GCAAATCGAT CGCCGATGTG TTGGATATGA CGATCGAAGA GGCCGGTGAC
TTCTTCAAGG CGGTGCCGGC GGTGCGCGAC AAGATGGAGA TGCTCCAGCA GGTCGGGCTT
GATTATATCC GCCTCGGCCA ACAGGCGACG ACCCTGTCGG GTGGCGAGGC CCAGCGCGTC
AAGCTGGCCA AGGAACTGTC ACGCCGGGCG ACCGGGCGAA CGCTTTATAT CCTGGATGAG
CCGACCACCG GCCTGCATTT CGAGGATGTG CGTAAGCTGA TGGAGGTGCT GCAGGCCCTG
GTCGATACGG GCAATACGGT GGTGGTGATC GAGCATAACC TGGAAGTGAT CAAAACCGCC
GACCATATCA TCGACATGGG GCCAGAAGGC GGATCGGGCG GCGGCCGGGT GGTGGCCCAA
GGCACTCCCG AGGAGGTCGC GGCCAATCCG GCCAGCCATA CCGGCAGCTA TCTCAAGCCC
TATCTCTCGG CCCTCGCCCG GCGCAGCGCG TAA
 
Protein sequence
MSTQEIRVRG AREHNLRNVD VTLPRDKLVV ITGLSGSGKS SLAFDTIYAE GQRRYVESLS 
AYARQFLEMM QKSDVDSIEG LSPAISIEQK TTSRNPRSTV GTVTEIHDYM RLLWARIGVP
HSPATGLPIE SQTVSQMVDR TLALPEGTRL YLLAPVARGR KGEFKKELAE LQKKGFSRVK
VDGTIYEIPE VPALNKKIKH DIEVVVDRLV VRADIASRLA DSFETALELS DGLVFAEDAV
SGERHTFSAR FACPVSGFTI DEIEPRLFSF NNPFGACPTC DGLGVTLYFD PELVVPDPSR
TLNRGAVAPW SGQTPPSPYY AQALASIAAH FGADMDTPWK DLPEEMRRII LEGSGKEIIP
LSFDDGTRSY RTQKPFEGVI PNIARRWRET ESNWIRDELS RYQGSAPCPA CGGYRLKPQA
LAVKINGRHI GEASEVSIAE ARAWFAGLEA KLSPKHREIA DRILREINER LGFLGNVGLD
YLSLSRNSGT LSGGESQRIR LASQIGSGLT GVLYVLDEPS IGLHQRDNDR LLITLKRLRD
IGNTVIVVEH DEDAIRNADY LVDMGPGAGV HGGTIVAQGT PEQVMANPAS LTGQYLTGKR
SVPVPTVRRQ GNGKVLTLRG ARANNLQNVD VSIPLGTFTC ITGVSGGGKS TLVLETLYKA
LARQLHGARD LPGEHDAIEG AEQIDKIVDI DQSPIGRTPR SNPATYTGAF TPIRDWFSGL
PEAKARGYKP GRFSFNVKGG RCEACQGDGL IKIEMHFLPD VYVTCDVCKG KRYNRETLDV
TFKGKSIADV LDMTIEEAGD FFKAVPAVRD KMEMLQQVGL DYIRLGQQAT TLSGGEAQRV
KLAKELSRRA TGRTLYILDE PTTGLHFEDV RKLMEVLQAL VDTGNTVVVI EHNLEVIKTA
DHIIDMGPEG GSGGGRVVAQ GTPEEVAANP ASHTGSYLKP YLSALARRSA