Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1741 |
Symbol | uvrA |
ID | 6980478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 1781607 |
End bp | 1784528 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643396464 |
Product | excinuclease ABC subunit A |
Protein accession | YP_002281254 |
Protein GI | 209549337 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.806378 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAC TGAAGACGAT CTCCATCCGC GGCGCGCGCG AGCACAATCT CAAGAGCATC GATCTCGATC TGCCGCGCAA CAAGCTGATC GTCATGACCG GGCTTTCGGG CTCCGGCAAG TCCTCGCTTG CCTTCGACAC GATCTATGCC GAAGGCCAGC GCCGTTATGT CGAGAGCCTG TCGGCCTATG CCCGGCAGTT CCTCGAAATG ATGCAAAAGC CCGACGTCGA CCAGATCGAC GGGCTGTCGC CGGCGATCTC GATCGAGCAG AAGACCACCT CGCGCAACCC GCGCTCGACG GTCGGCACGG TCACCGAGAT CTACGACTAT ATGCGCCTGC TCTTTGCCCG CGTCGGCGTT CCCTATTCGC CGGCGACCGG CCTGCCGATC GAGAGCCAGA CGGTCAGCCA GATGGTCGAC CGCATCCTCG ATTTCGGCGA GGGCACCCGT CTTTATATTC TCGCGCCGCT CGTGCGCGGG CGCAAGGGCG AATACAAGAA GGAACTGGCG GAGCTGATGA AGAAGGGCTT CCAGCGCGTC AAAGTCGACG GGCAGTTCTA CGAGATCGCT GAGGCGCCGG TACTCGACAA GAAATACAAG CACGACATCG ATGTCGTGGT CGACCGCATC GTCGTGCGCT CGGATGTCTC GGCCCGCCTG GCCGACAGCC TGGAAACCTG CCTGAAGCTC GCCGACGGGC TGGCGGTTGC CGAATTTGCC GACAAGCCGC TGCCGCCGGA AGAGACCTCG GCCGGCGGCT CGGCCAACAA GTCGCTGAAC GAGACGCATG AGCGCGTACT GTTTTCGGAG AAATTCGCCT GCCCGGTTTC CGGCTTCACC ATTCCCGAGA TCGAGCCCAG GCTGTTCTCC TTCAACAATC CCTTCGGCGC CTGCCCGACC TGCGACGGCC TCGGCGCCCA GCAGAAGATC GATCCGGATC TGATCGTGCC CGAGCCCGAA CGGACGTTGC GCGACGGCGC GATCGCTCCC TGGGCCAAGT CGACCTCGCC CTATTACAAC CAGACGCTCG AGGCGCTCGG CAAACATTAC GGCTTCAAGC TCGGCACCCG CTGGAACGAT CTTTCCGACG AGGCCAAGGA CGTCATCCTC AACGGCACCG AGGACAAGAT CGAATTTCAT TATGCCGACG GCGCCCGCTC CTATACGACG CAGAAGAATT TCGAGGGCAT CATCACCAAT CTCGAGCGCC GCTGGAAGGA GACCGATTCC GCCTGGGCGC GCGAGGATAT CGAGCGCTTC ATGTCGGCAG CCCCCTGCCC TGTTTGTAAC GGCTTCCGCC TGAAGCCGGA AGCGCTGGCG GTGAAGATCA ACACGCTGCA TATCGGTGAG GTCACCGGCA TGTCGATCCG CGTCGCCCGC GACTGGTTCG AGACGCTGCC GGCAAGCTTC AACGCCAAGC AGAACGAAAT CGCTGTGCGC ATCCTCAAGG AAATCCGCGA CCGGCTGCGC TTCCTCAACG ATGTCGGCCT GGAATATCTG AGCCTGTCGC GCAACTCCGG CACGCTGTCG GGCGGCGAAA GCCAGCGTAT CCGGCTGGCC TCGCAGATCG GCTCGGGCTT GACGGGCGTG CTCTACGTGC TCGACGAGCC GTCGATTGGC CTGCATCAGC GCGACAATGC CCGGCTGCTC GACACCCTGA AGCACCTGCG CGACATCGGC AACACCGTCA TCGTCGTCGA ACATGACGAG GATGCGATCA TGACGGCCGA CGACGTGGTC GATATCGGCC CCGCCGCCGG CATTCACGGC GGCCAGGTCA TCGCCCACGG TACGCCGCAG GATATTATGG ACAATCCGCA GTCGCTGACC GGCAAATACC TGTCCGGCGA GCTCGGCGTT CCCGTTCCCC ACGAGCGCCG CAAGCAGAAG AAAGGCCGCG AGATCAAGGT GGTCGGGGCG CGCGGCAACA ATCTGAAGAA CGTCACGGCG GCAATTCCGC TCGGCGTGTT CACGGCGGTG ACCGGCGTTT CCGGCGGCGG CAAATCCACC TTCCTGATCG AGACGCTGTA TAAATCGGCC GCAAGGCGGG TCATGGGCGC ACGCGAAAAC CCCGCCGATC ACGACCGCAT CGACGGCTTC GAGCATATCG ACAAGGTTAT CGACATCGAC CAGTCGCCGA TCGGCCGCAC GCCGCGCTCG AACCCGGCGA CCTATACCGG TGCCTTTACA CCGATCCGCG ACTGGTTCGC CGGCCTGCCG GAAGCAAAAG CGCGCGGCTA CCAGCCGGGC CGCTTCTCCT TCAACGTCAA GGGCGGGCGC TGCGAGGCCT GCCAGGGCGA TGGTGTCATC AAGATCGAAA TGCACTTCCT GCCCGATGTC TACGTCACCT GCGACGTCTG CCACGGAAAA CGATACAATC GCGAGACGCT CGACGTCACC TTCAAGCAGA AGTCGATTGC CGATGTGCTC GACATGACGG TGGAGGAAGG TGTCGATTTC TTCGCGGCAG TACCCGCCGT GCGCGACAAG CTGCAGGCGC TGAAGGATGT CGGACTCGGT TACATCAAGG TCGGCCAGCA GGCGAACACA CTTTCCGGCG GCGAAGCGCA GCGCGTCAAG CTCGCCAAGG AACTGTCGAA ACGCTCGACG GGGCGCACGC TCTATATTCT CGACGAACCG ACGACCGGCC TGCATTTCCA CGACGTGGCC AAGCTGCTCG AAATGCTGCA CGAACTGGTC AACCAGGGCA ATTCCGTGGT GGTGATCGAG CACAATCTCG AAGTCATCAA GACGGCCGAC TGGGTGCTCG ATTTCGGCCC CGAAGGCGGC GATGGCGGCG GCGAGATCGT GGCGTTCGGC ACGCCGGAGG CAATCGTCAA GGAGAAGCGC TCCTATACCG GACAGTTCCT CAAGGAATTG CTGGAGCGGC GGCCGGCAAA GAGGGCGGCT GCAGCGGAAT GA
|
Protein sequence | MSELKTISIR GAREHNLKSI DLDLPRNKLI VMTGLSGSGK SSLAFDTIYA EGQRRYVESL SAYARQFLEM MQKPDVDQID GLSPAISIEQ KTTSRNPRST VGTVTEIYDY MRLLFARVGV PYSPATGLPI ESQTVSQMVD RILDFGEGTR LYILAPLVRG RKGEYKKELA ELMKKGFQRV KVDGQFYEIA EAPVLDKKYK HDIDVVVDRI VVRSDVSARL ADSLETCLKL ADGLAVAEFA DKPLPPEETS AGGSANKSLN ETHERVLFSE KFACPVSGFT IPEIEPRLFS FNNPFGACPT CDGLGAQQKI DPDLIVPEPE RTLRDGAIAP WAKSTSPYYN QTLEALGKHY GFKLGTRWND LSDEAKDVIL NGTEDKIEFH YADGARSYTT QKNFEGIITN LERRWKETDS AWAREDIERF MSAAPCPVCN GFRLKPEALA VKINTLHIGE VTGMSIRVAR DWFETLPASF NAKQNEIAVR ILKEIRDRLR FLNDVGLEYL SLSRNSGTLS GGESQRIRLA SQIGSGLTGV LYVLDEPSIG LHQRDNARLL DTLKHLRDIG NTVIVVEHDE DAIMTADDVV DIGPAAGIHG GQVIAHGTPQ DIMDNPQSLT GKYLSGELGV PVPHERRKQK KGREIKVVGA RGNNLKNVTA AIPLGVFTAV TGVSGGGKST FLIETLYKSA ARRVMGAREN PADHDRIDGF EHIDKVIDID QSPIGRTPRS NPATYTGAFT PIRDWFAGLP EAKARGYQPG RFSFNVKGGR CEACQGDGVI KIEMHFLPDV YVTCDVCHGK RYNRETLDVT FKQKSIADVL DMTVEEGVDF FAAVPAVRDK LQALKDVGLG YIKVGQQANT LSGGEAQRVK LAKELSKRST GRTLYILDEP TTGLHFHDVA KLLEMLHELV NQGNSVVVIE HNLEVIKTAD WVLDFGPEGG DGGGEIVAFG TPEAIVKEKR SYTGQFLKEL LERRPAKRAA AAE
|
| |