Gene Rleg2_1741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1741 
SymboluvrA 
ID6980478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1781607 
End bp1784528 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content63% 
IMG OID643396464 
Productexcinuclease ABC subunit A 
Protein accessionYP_002281254 
Protein GI209549337 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.806378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC TGAAGACGAT CTCCATCCGC GGCGCGCGCG AGCACAATCT CAAGAGCATC 
GATCTCGATC TGCCGCGCAA CAAGCTGATC GTCATGACCG GGCTTTCGGG CTCCGGCAAG
TCCTCGCTTG CCTTCGACAC GATCTATGCC GAAGGCCAGC GCCGTTATGT CGAGAGCCTG
TCGGCCTATG CCCGGCAGTT CCTCGAAATG ATGCAAAAGC CCGACGTCGA CCAGATCGAC
GGGCTGTCGC CGGCGATCTC GATCGAGCAG AAGACCACCT CGCGCAACCC GCGCTCGACG
GTCGGCACGG TCACCGAGAT CTACGACTAT ATGCGCCTGC TCTTTGCCCG CGTCGGCGTT
CCCTATTCGC CGGCGACCGG CCTGCCGATC GAGAGCCAGA CGGTCAGCCA GATGGTCGAC
CGCATCCTCG ATTTCGGCGA GGGCACCCGT CTTTATATTC TCGCGCCGCT CGTGCGCGGG
CGCAAGGGCG AATACAAGAA GGAACTGGCG GAGCTGATGA AGAAGGGCTT CCAGCGCGTC
AAAGTCGACG GGCAGTTCTA CGAGATCGCT GAGGCGCCGG TACTCGACAA GAAATACAAG
CACGACATCG ATGTCGTGGT CGACCGCATC GTCGTGCGCT CGGATGTCTC GGCCCGCCTG
GCCGACAGCC TGGAAACCTG CCTGAAGCTC GCCGACGGGC TGGCGGTTGC CGAATTTGCC
GACAAGCCGC TGCCGCCGGA AGAGACCTCG GCCGGCGGCT CGGCCAACAA GTCGCTGAAC
GAGACGCATG AGCGCGTACT GTTTTCGGAG AAATTCGCCT GCCCGGTTTC CGGCTTCACC
ATTCCCGAGA TCGAGCCCAG GCTGTTCTCC TTCAACAATC CCTTCGGCGC CTGCCCGACC
TGCGACGGCC TCGGCGCCCA GCAGAAGATC GATCCGGATC TGATCGTGCC CGAGCCCGAA
CGGACGTTGC GCGACGGCGC GATCGCTCCC TGGGCCAAGT CGACCTCGCC CTATTACAAC
CAGACGCTCG AGGCGCTCGG CAAACATTAC GGCTTCAAGC TCGGCACCCG CTGGAACGAT
CTTTCCGACG AGGCCAAGGA CGTCATCCTC AACGGCACCG AGGACAAGAT CGAATTTCAT
TATGCCGACG GCGCCCGCTC CTATACGACG CAGAAGAATT TCGAGGGCAT CATCACCAAT
CTCGAGCGCC GCTGGAAGGA GACCGATTCC GCCTGGGCGC GCGAGGATAT CGAGCGCTTC
ATGTCGGCAG CCCCCTGCCC TGTTTGTAAC GGCTTCCGCC TGAAGCCGGA AGCGCTGGCG
GTGAAGATCA ACACGCTGCA TATCGGTGAG GTCACCGGCA TGTCGATCCG CGTCGCCCGC
GACTGGTTCG AGACGCTGCC GGCAAGCTTC AACGCCAAGC AGAACGAAAT CGCTGTGCGC
ATCCTCAAGG AAATCCGCGA CCGGCTGCGC TTCCTCAACG ATGTCGGCCT GGAATATCTG
AGCCTGTCGC GCAACTCCGG CACGCTGTCG GGCGGCGAAA GCCAGCGTAT CCGGCTGGCC
TCGCAGATCG GCTCGGGCTT GACGGGCGTG CTCTACGTGC TCGACGAGCC GTCGATTGGC
CTGCATCAGC GCGACAATGC CCGGCTGCTC GACACCCTGA AGCACCTGCG CGACATCGGC
AACACCGTCA TCGTCGTCGA ACATGACGAG GATGCGATCA TGACGGCCGA CGACGTGGTC
GATATCGGCC CCGCCGCCGG CATTCACGGC GGCCAGGTCA TCGCCCACGG TACGCCGCAG
GATATTATGG ACAATCCGCA GTCGCTGACC GGCAAATACC TGTCCGGCGA GCTCGGCGTT
CCCGTTCCCC ACGAGCGCCG CAAGCAGAAG AAAGGCCGCG AGATCAAGGT GGTCGGGGCG
CGCGGCAACA ATCTGAAGAA CGTCACGGCG GCAATTCCGC TCGGCGTGTT CACGGCGGTG
ACCGGCGTTT CCGGCGGCGG CAAATCCACC TTCCTGATCG AGACGCTGTA TAAATCGGCC
GCAAGGCGGG TCATGGGCGC ACGCGAAAAC CCCGCCGATC ACGACCGCAT CGACGGCTTC
GAGCATATCG ACAAGGTTAT CGACATCGAC CAGTCGCCGA TCGGCCGCAC GCCGCGCTCG
AACCCGGCGA CCTATACCGG TGCCTTTACA CCGATCCGCG ACTGGTTCGC CGGCCTGCCG
GAAGCAAAAG CGCGCGGCTA CCAGCCGGGC CGCTTCTCCT TCAACGTCAA GGGCGGGCGC
TGCGAGGCCT GCCAGGGCGA TGGTGTCATC AAGATCGAAA TGCACTTCCT GCCCGATGTC
TACGTCACCT GCGACGTCTG CCACGGAAAA CGATACAATC GCGAGACGCT CGACGTCACC
TTCAAGCAGA AGTCGATTGC CGATGTGCTC GACATGACGG TGGAGGAAGG TGTCGATTTC
TTCGCGGCAG TACCCGCCGT GCGCGACAAG CTGCAGGCGC TGAAGGATGT CGGACTCGGT
TACATCAAGG TCGGCCAGCA GGCGAACACA CTTTCCGGCG GCGAAGCGCA GCGCGTCAAG
CTCGCCAAGG AACTGTCGAA ACGCTCGACG GGGCGCACGC TCTATATTCT CGACGAACCG
ACGACCGGCC TGCATTTCCA CGACGTGGCC AAGCTGCTCG AAATGCTGCA CGAACTGGTC
AACCAGGGCA ATTCCGTGGT GGTGATCGAG CACAATCTCG AAGTCATCAA GACGGCCGAC
TGGGTGCTCG ATTTCGGCCC CGAAGGCGGC GATGGCGGCG GCGAGATCGT GGCGTTCGGC
ACGCCGGAGG CAATCGTCAA GGAGAAGCGC TCCTATACCG GACAGTTCCT CAAGGAATTG
CTGGAGCGGC GGCCGGCAAA GAGGGCGGCT GCAGCGGAAT GA
 
Protein sequence
MSELKTISIR GAREHNLKSI DLDLPRNKLI VMTGLSGSGK SSLAFDTIYA EGQRRYVESL 
SAYARQFLEM MQKPDVDQID GLSPAISIEQ KTTSRNPRST VGTVTEIYDY MRLLFARVGV
PYSPATGLPI ESQTVSQMVD RILDFGEGTR LYILAPLVRG RKGEYKKELA ELMKKGFQRV
KVDGQFYEIA EAPVLDKKYK HDIDVVVDRI VVRSDVSARL ADSLETCLKL ADGLAVAEFA
DKPLPPEETS AGGSANKSLN ETHERVLFSE KFACPVSGFT IPEIEPRLFS FNNPFGACPT
CDGLGAQQKI DPDLIVPEPE RTLRDGAIAP WAKSTSPYYN QTLEALGKHY GFKLGTRWND
LSDEAKDVIL NGTEDKIEFH YADGARSYTT QKNFEGIITN LERRWKETDS AWAREDIERF
MSAAPCPVCN GFRLKPEALA VKINTLHIGE VTGMSIRVAR DWFETLPASF NAKQNEIAVR
ILKEIRDRLR FLNDVGLEYL SLSRNSGTLS GGESQRIRLA SQIGSGLTGV LYVLDEPSIG
LHQRDNARLL DTLKHLRDIG NTVIVVEHDE DAIMTADDVV DIGPAAGIHG GQVIAHGTPQ
DIMDNPQSLT GKYLSGELGV PVPHERRKQK KGREIKVVGA RGNNLKNVTA AIPLGVFTAV
TGVSGGGKST FLIETLYKSA ARRVMGAREN PADHDRIDGF EHIDKVIDID QSPIGRTPRS
NPATYTGAFT PIRDWFAGLP EAKARGYQPG RFSFNVKGGR CEACQGDGVI KIEMHFLPDV
YVTCDVCHGK RYNRETLDVT FKQKSIADVL DMTVEEGVDF FAAVPAVRDK LQALKDVGLG
YIKVGQQANT LSGGEAQRVK LAKELSKRST GRTLYILDEP TTGLHFHDVA KLLEMLHELV
NQGNSVVVIE HNLEVIKTAD WVLDFGPEGG DGGGEIVAFG TPEAIVKEKR SYTGQFLKEL
LERRPAKRAA AAE