Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2199 |
Symbol | |
ID | 6980938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 2253548 |
End bp | 2256520 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643396917 |
Product | excinuclease ABC subunit B |
Protein accession | YP_002281705 |
Protein GI | 209549788 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0556] Helicase subunit of the DNA excision repair complex |
TIGRFAM ID | [TIGR00631] excinuclease ABC, B subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.166657 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAAT CTCCGAAGAA ATCCCCCGCC CCGAATGGCT TCGAGGAAGC CCCGCAGTCG TCCTTCGAGG GCGCTCCCCT GTCCGGCTCC GTTACCGATT GGGTGAAGCA GCTGGAGGCC GATGCCGAAG CGTCGGGCGT CGAGACCCAG CGCCAGATCG CCTCCAAGGC CGGCAAGCAC CGCAAGAAGG TGGAAATCGC GGCTTCGAAA TCGGCGCGTG GCACTTCGAT GGGCGGCTCG ACCGACCCGA AGACGCGCGC GGCCGCAGGC CTCAACCCCG TCGCCGGCAT GAATACGACG CTGGAGGAAG CCTCGTCGCT GCAGGCCGGC ACCGCCGTCA CCGCCACCGT CGAAGCGCTG TCGGCGCTGA TCGAGAGCGG CAACCCGCTG CACAAGAACG GCAAGATCTG GACGCCGCAC CGCCCGGCCC GGCCCGACAA GTCCGAAGGT GGCATCCGCA TCCTGATGAA GTCGGATTAT GAGCCGGCCG GCGACCAGCC GACGGCCATC CGCGATCTCG TCGAGGGACT GGAGAATGGC GACCGCAGCC AGGTGCTGCT CGGCGTTACC GGCTCCGGCA AGACCTTCAC CATGGCCAAG GTGATCGAGG CGACGCAGCG CCCGGCCGTC ATCCTGGCGC CGAACAAGAC GCTGGCCGCC CAGCTCTATT CGGAATTCAA GAATTTCTTC CCCGACAATG CGGTGGAATA TTTCGTTTCC TACTACGATT ATTATCAGCC GGAAGCCTAT GTGCCGCGCT CCGACACCTA TATCGAGAAG GAAAGCTCGA TCAACGAGCA GATCGACCGC ATGCGCCACT CGGCGACGCG CTCGCTGCTC GAACGTGACG ACTGCATCAT CGTCGCCTCG GTCTCCTGCA TCTACGGTAT CGGCTCGGTC GAAACCTATA CGGCGATGAC CTTCCAGATG TCGGTCGGCG ACCGGCTCGA CCAGCGCCAG TTGCTGGCCG ACCTTGTCGC CCAGCAATAT AAGCGCCGCG ACATGGATTT CACCCGCGGT TCTTTCCGCG TGCGCGGCGA TACGATCGAG CTCTTCCCCG CCCACTTGGA GGATGCCGCC TGGCGCATCT CGATGTTCGG TGACGAGATC GACGCCATCA CCGAGTTCGA TCCGCTCACC GGCCAGAAGG TCGGTGATCT GAAATCGGTG AAAATCTACG CCAATTCGCA CTATGTCACG CCGCGCCCGA CGCTGAACGG CGCCATCAAA TCGATCAAGG AAGAGCTCAG GCTTCGCCTC GCCGAGCTGG AGAAGGCCGG CCGCCTGCTG GAGGCCCAGC GCCTGGAGCA GCGCACCCGC TACGACATCG AAATGCTCGA AGCCACCGGC TCCTGCCAGG GCATCGAGAA TTATTCGCGT TATCTCACCG GCCGCGACCC CGGCGATCCC CCGCCGACGC TGTTCGAGTA TATCCCCGAT AACGCCCTCG TTTTCATCGA CGAGAGCCAT GTCACCGTGC CGCAGATCGG CGGCATGTAC CGGGGCGACT TCCGCCGTAA GGCGACGCTG GCCGAATACG GCTTCCGCCT GCCCTCCTGT ATGGATAACC GGCCGCTGCG CTTCGAGGAA TGGGACGCCA TGCGCCCCGA CACCATTGCC GTCTCGGCCA CGCCAGGCGG CTGGGAGATG GAGCAGTCGG GCGGCGTCTT CGCCGAACAG GTGATCCGGC CCACGGGCCT TATCGACCCG CCGGTCGAGG TCCGCTCGGC CCGCACCCAG GTCGACGACG TGCTCGGCGA GATCCGCGAA ACCGCCGCCA AGGGCTACCG CACCCTTTGC ACCGTGCTGA CCAAGCGCAT GGCCGAAGAC CTGACCGAAT ATCTGCATGA GCAGGGCGTG CGCGTCCGCT ACATGCACTC CGACATCGAC ACGCTTGAAC GTATCGAGAT CCTCCGCGAT CTTCGTCTGG GTGCTTTCGA CGTGCTCGTC GGCATCAACC TGCTGCGCGA GGGTCTCGAC ATTCCCGAAT GCGGCTTCGT CGCCATCCTC GACGCCGACA AGGAAGGCTT CCTGCGCTCG GAGACCTCGC TGATCCAGAC GATCGGCCGC GCCGCGCGCA ACGTCGACGG CAAGGTCATT CTCTATGCCG ACCAGGTCAC CGGCTCGATG AAGCGGGCGA TGGAGGAAAC CGGCCGCCGC CGCGAAAAGC AGATGATCTA CAATCAGGAA CACGGCATCA CCCCTGAATC CGTCAAGGCC AGGATCTCCG ACATCCTCGA CAGCGTCTAC GAACGCGACC ACGTCCGCGC CGATATCTCG GGCGTCTCGG GCAAGGGCTT TGCCGATGGC GGCAACCTGG TCGGCAACAA CCTCCAGACC CATCTCAACG CGCTCGAAAA AAGCATGCGT GACGCCGCCG CCGACCTCGA CTTCGAAAAA GCCGCCCGCC TCCGCGACGA AATCAAACGC CTCAAGGCCG CCGAACTGGC CGTCATGGAT GATCCGATGG CACGCGAAGA GTCGAAGGCA ATGGAAGGTC GCGGCAAGAA GAAGACGGGC GCGGCAAACG CAACCGGCTC CCTCCCCCCT GTGGGGAGGG TTGGGGAGGG GAAGGTCACA GATGCAGGGT CCGCCTCCTA CTTCTCCAGA CCCAGCCTCG ATGACATGGG CCCGGGCACC GACACCGCCA CCCCGCTCTT CCGAAAACCC GCCCTCGACG AGATGGGCCG CGACCCCACC ACCCCCGCTG GCAAGAGCCT CTTCCGCAAG AACGACCTCG ACGAGATGAC GGTTGGGCGA ACGGAAAAAC CGGTGGTCGG CCATGTGCCG GAAAAGCCGG AAGCCTCCAA GGGCACAAAG CGATTTTCGC CGTTGCTTGA AGGGCAACCG GAACGCGATG ACGATGTGCG GCCGGTGGTG CGCGGCAAGG CGGGTGTCGG CAGCTATGAG GATCCGGGCG AGCAGAAGCG CAAGGGCCGG ACGAAGGGCA AGACTGGGCG GCCGGGGCGG TGA
|
Protein sequence | MAKSPKKSPA PNGFEEAPQS SFEGAPLSGS VTDWVKQLEA DAEASGVETQ RQIASKAGKH RKKVEIAASK SARGTSMGGS TDPKTRAAAG LNPVAGMNTT LEEASSLQAG TAVTATVEAL SALIESGNPL HKNGKIWTPH RPARPDKSEG GIRILMKSDY EPAGDQPTAI RDLVEGLENG DRSQVLLGVT GSGKTFTMAK VIEATQRPAV ILAPNKTLAA QLYSEFKNFF PDNAVEYFVS YYDYYQPEAY VPRSDTYIEK ESSINEQIDR MRHSATRSLL ERDDCIIVAS VSCIYGIGSV ETYTAMTFQM SVGDRLDQRQ LLADLVAQQY KRRDMDFTRG SFRVRGDTIE LFPAHLEDAA WRISMFGDEI DAITEFDPLT GQKVGDLKSV KIYANSHYVT PRPTLNGAIK SIKEELRLRL AELEKAGRLL EAQRLEQRTR YDIEMLEATG SCQGIENYSR YLTGRDPGDP PPTLFEYIPD NALVFIDESH VTVPQIGGMY RGDFRRKATL AEYGFRLPSC MDNRPLRFEE WDAMRPDTIA VSATPGGWEM EQSGGVFAEQ VIRPTGLIDP PVEVRSARTQ VDDVLGEIRE TAAKGYRTLC TVLTKRMAED LTEYLHEQGV RVRYMHSDID TLERIEILRD LRLGAFDVLV GINLLREGLD IPECGFVAIL DADKEGFLRS ETSLIQTIGR AARNVDGKVI LYADQVTGSM KRAMEETGRR REKQMIYNQE HGITPESVKA RISDILDSVY ERDHVRADIS GVSGKGFADG GNLVGNNLQT HLNALEKSMR DAAADLDFEK AARLRDEIKR LKAAELAVMD DPMAREESKA MEGRGKKKTG AANATGSLPP VGRVGEGKVT DAGSASYFSR PSLDDMGPGT DTATPLFRKP ALDEMGRDPT TPAGKSLFRK NDLDEMTVGR TEKPVVGHVP EKPEASKGTK RFSPLLEGQP ERDDDVRPVV RGKAGVGSYE DPGEQKRKGR TKGKTGRPGR
|
| |