Gene Rsph17029_1612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1612 
SymboluvrA 
ID4897209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1694610 
End bp1697468 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content66% 
IMG OID640112203 
Productexcinuclease ABC subunit A 
Protein accessionYP_001043494 
Protein GI126462380 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0475567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0441926 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAC AGAAGTTCAT CTCGGTGCGC GGCGCGCGCG AGCACAATCT CAAGGGCATC 
GATGTCGACA TCCCGCGGGA TCAGCTGGTG GTCATCACCG GCCTGTCGGG GTCGGGCAAG
TCGAGCCTCG CCTTCGACAC GATCTATGCC GAGGGGCAGC GCCGCTATGT CGAGAGCCTC
TCGGCCTATG CGCGGCAGTT CCTCGACATG ATGGGCAAGC CGGATGTGGA CCATATCTCG
GGTCTGTCGC CGGCCATCTC CATCGAGCAG AAGACGACCT CGAAGAACCC GCGCTCGACC
GTGGGGACCG TGACCGAGAT CTACGACTAC ATGCGCCTTC TGTGGGCGCG GGTCGGCACG
CCCTACAGCC CGGCCACCGG CCTGCCCATC GCGGCGCAGC AGGTGCAGGA CATGGTCGAT
GCGGTGATGG CAATGCCCGA GGGCGCGCGG GGCTATCTGC TCGCGCCCAT CGTCCGGGAC
CGCAAGGGCG AATACAAGAA GGAGTTCATC GAGCTCCGCA AGCAGGGCTT CCAGCGCGTG
AAGGTGAACG GCACCTTCCA CGAGCTGGAG GAGCCGCCGA CGCTGGACAA GAAGTTCCGC
CACGACATCG ATGTGGTGGT GGACCGGATC GTGGTGCGCG AGGGGATCGA GACGCGGCTG
GCCGACAGCT TCCGCACCGC GCTGAACCTC GCCGACGGGA TCGCGATCTT CGAGAGCGCG
CCCGCCGAGG GCGAGCCGGT GCGCACGACC TTCTCCGAGA AATTCGCCTG CCCGGTCTCG
GGCTTCACCA TCCCCGAGAT CGAGCCGCGG CTCTTTTCCT TCAACGCGCC CTTCGGCGCC
TGCCCGGAAT GCGACGGGCT GGGGATGGAG CTCTTCTTCG ACGAGCGCCT CGTGGTGCCC
GATCAGGGGC TCACGCTGAA GCAGGGGGCC ATCGCGCCCT GGGCCAAATC GAAATCGCCC
TACTACACCC AGACCATCGA GGCGCTGGCG CGGCATTACG GCTTCGATCC GAAGAAGAAA
TGGAAGGATC TTCCGTTCAA CGTGCAATCC GTCTTCCTGC GCGGCTCGGG CGAGGAGGAG
ATCACCTTCC GCTACGACGA GGGCGGGCGG ATCTATCAGG TCAGCCGCAG CTTCGAGGGC
GTGATCCCGA ACATGGAGCG CCGCTACCGC GAGACCGACT CGGCCTGGGT GCGCGAGGAA
TTCGAGCGCT ACCAGAACAA CCGGCCCTGC CATGTCTGCG GCGGCTACCG GCTGAAGCCC
GAGGCGCTAG CGGTGAAGAT CGGCGGCCTG CATATCGGGC AGGTGGTGCA GATGTCGATC
AAGGAGGCCT TCGCCTGGAT CGAGACAGTG CCGGGCCATC TGACTGCGCA GAAGAACGAG
ATCGCGCGCG CGATCCTCAA GGAGATCCGC GAGCGGCTGG GCTTCCTCGT CAATGTGGGG
CTCGACTATC TGTCGATGAG CCGCGCGGCC GGCACGCTCT CGGGCGGGGA AAGCCAGCGG
ATCCGGCTCG CCTCCCAGAT CGGCTCGGGG CTGACGGGCG TGCTCTATGT GCTCGACGAG
CCCTCGATCG GGCTGCACCA GCGCGACAAC GACCGCCTGC TGACCACGCT GAAGAACCTG
CGCGACCAGG GCAATTCGGT GCTGGTGGTG GAGCATGACG AGGATGCGAT CCGCGAGGCG
GATTATGTGT TCGACGTGGG CCCGGGCGCG GGCGTCCATG GCGGGCAGGT GGTGGCGCAC
GGCACGCCCG CCGAGATCGC CGCCGATCCG GCGAGCCTCA CCGGTCAGTA TCTCTCGGGC
ACACGCGAAA TCTCGGTGCC CGCCGAGCGG CGGACGGGCA ACGGCAAGTC CCTGACGGTG
GTGAAGGCCA GCGGCAACAA CCTCCACGAT GTGACGGTGG ACTTCCCCCT GGGCAAGTTC
GTTTGCGTGA CCGGCGTCTC GGGCGGCGGC AAGTCGACGC TGACCATCGA GACGCTCTAC
AAGACGGCGG CGATGCGGCT GAACGGGGCG CGCGAGACGC CGGCGCCCTG CGAGACGATC
AAGGGCTTCG AGCAGCTCGA CAAGGTCATC GACATCGACC AGCGGCCGAT CGGGCGCACG
CCGCGCTCGA ACCCCGCGAC CTACACCGGG GCCTTCACGC CGATCCGCGA CTGGTTCGCG
GGCCTGCCCG AGTCGAAGGC GCGCGGCTAC CAGCCGGGCC GGTTCTCGTT CAACGTGAAG
GGCGGGCGCT GCGAGGCCTG CCAGGGCGAT GGCGTCATCA AGATCGAGAT GCACTTCCTG
CCCGACGTCT ATGTGACCTG CGAGACCTGC AAGGGCCACC GCTACAACCG CGAGACGCTG
GAGATCAAGT TCAAGGGCAA GAGCATCGCC GACGTGCTCG AGATGACGGT CGAGGATGCG
CAGGAGTTCT TCCAGGCGGT GCCCTCGATC CGCGAGAAGA TGGACGCGCT GATGCGGGTG
GGCCTCGGCT ACATCAAGGT GGGCCAGCAG GCGACGACGC TCTCGGGCGG CGAGGCGCAG
CGGGTGAAAC TCTCCAAGGA GCTGAGCCGC CGCGCCACGG GACGCACGCT CTACATTCTC
GATGAGCCGA CGACGGGGCT GCATTTCGAG GATGTGAAAA AGCTGCTCGA GGTGCTGCAC
GAGCTGGTGG AGCAGGGCAA CACGGTGGTG GTGATCGAGC ACAATCTCGA TGTGGTGAAG
ACCGCGGACT GGATCATCGA CATCGGCCCG GAAGGCGGCG ACGGCGGCGG CAAGATCGTG
GCCGAGGGAA CGCCCGAGGA GGTGGCCAAG GTCGAGGCCT CTCACACCGG CCGCTACCTC
CGCGACATGC TGAAACCCCG ACGTCTGGCC GCGGAATAG
 
Protein sequence
MAEQKFISVR GAREHNLKGI DVDIPRDQLV VITGLSGSGK SSLAFDTIYA EGQRRYVESL 
SAYARQFLDM MGKPDVDHIS GLSPAISIEQ KTTSKNPRST VGTVTEIYDY MRLLWARVGT
PYSPATGLPI AAQQVQDMVD AVMAMPEGAR GYLLAPIVRD RKGEYKKEFI ELRKQGFQRV
KVNGTFHELE EPPTLDKKFR HDIDVVVDRI VVREGIETRL ADSFRTALNL ADGIAIFESA
PAEGEPVRTT FSEKFACPVS GFTIPEIEPR LFSFNAPFGA CPECDGLGME LFFDERLVVP
DQGLTLKQGA IAPWAKSKSP YYTQTIEALA RHYGFDPKKK WKDLPFNVQS VFLRGSGEEE
ITFRYDEGGR IYQVSRSFEG VIPNMERRYR ETDSAWVREE FERYQNNRPC HVCGGYRLKP
EALAVKIGGL HIGQVVQMSI KEAFAWIETV PGHLTAQKNE IARAILKEIR ERLGFLVNVG
LDYLSMSRAA GTLSGGESQR IRLASQIGSG LTGVLYVLDE PSIGLHQRDN DRLLTTLKNL
RDQGNSVLVV EHDEDAIREA DYVFDVGPGA GVHGGQVVAH GTPAEIAADP ASLTGQYLSG
TREISVPAER RTGNGKSLTV VKASGNNLHD VTVDFPLGKF VCVTGVSGGG KSTLTIETLY
KTAAMRLNGA RETPAPCETI KGFEQLDKVI DIDQRPIGRT PRSNPATYTG AFTPIRDWFA
GLPESKARGY QPGRFSFNVK GGRCEACQGD GVIKIEMHFL PDVYVTCETC KGHRYNRETL
EIKFKGKSIA DVLEMTVEDA QEFFQAVPSI REKMDALMRV GLGYIKVGQQ ATTLSGGEAQ
RVKLSKELSR RATGRTLYIL DEPTTGLHFE DVKKLLEVLH ELVEQGNTVV VIEHNLDVVK
TADWIIDIGP EGGDGGGKIV AEGTPEEVAK VEASHTGRYL RDMLKPRRLA AE