Gene Rleg2_2199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2199 
Symbol 
ID6980938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2253548 
End bp2256520 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content64% 
IMG OID643396917 
Productexcinuclease ABC subunit B 
Protein accessionYP_002281705 
Protein GI209549788 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.166657 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAT CTCCGAAGAA ATCCCCCGCC CCGAATGGCT TCGAGGAAGC CCCGCAGTCG 
TCCTTCGAGG GCGCTCCCCT GTCCGGCTCC GTTACCGATT GGGTGAAGCA GCTGGAGGCC
GATGCCGAAG CGTCGGGCGT CGAGACCCAG CGCCAGATCG CCTCCAAGGC CGGCAAGCAC
CGCAAGAAGG TGGAAATCGC GGCTTCGAAA TCGGCGCGTG GCACTTCGAT GGGCGGCTCG
ACCGACCCGA AGACGCGCGC GGCCGCAGGC CTCAACCCCG TCGCCGGCAT GAATACGACG
CTGGAGGAAG CCTCGTCGCT GCAGGCCGGC ACCGCCGTCA CCGCCACCGT CGAAGCGCTG
TCGGCGCTGA TCGAGAGCGG CAACCCGCTG CACAAGAACG GCAAGATCTG GACGCCGCAC
CGCCCGGCCC GGCCCGACAA GTCCGAAGGT GGCATCCGCA TCCTGATGAA GTCGGATTAT
GAGCCGGCCG GCGACCAGCC GACGGCCATC CGCGATCTCG TCGAGGGACT GGAGAATGGC
GACCGCAGCC AGGTGCTGCT CGGCGTTACC GGCTCCGGCA AGACCTTCAC CATGGCCAAG
GTGATCGAGG CGACGCAGCG CCCGGCCGTC ATCCTGGCGC CGAACAAGAC GCTGGCCGCC
CAGCTCTATT CGGAATTCAA GAATTTCTTC CCCGACAATG CGGTGGAATA TTTCGTTTCC
TACTACGATT ATTATCAGCC GGAAGCCTAT GTGCCGCGCT CCGACACCTA TATCGAGAAG
GAAAGCTCGA TCAACGAGCA GATCGACCGC ATGCGCCACT CGGCGACGCG CTCGCTGCTC
GAACGTGACG ACTGCATCAT CGTCGCCTCG GTCTCCTGCA TCTACGGTAT CGGCTCGGTC
GAAACCTATA CGGCGATGAC CTTCCAGATG TCGGTCGGCG ACCGGCTCGA CCAGCGCCAG
TTGCTGGCCG ACCTTGTCGC CCAGCAATAT AAGCGCCGCG ACATGGATTT CACCCGCGGT
TCTTTCCGCG TGCGCGGCGA TACGATCGAG CTCTTCCCCG CCCACTTGGA GGATGCCGCC
TGGCGCATCT CGATGTTCGG TGACGAGATC GACGCCATCA CCGAGTTCGA TCCGCTCACC
GGCCAGAAGG TCGGTGATCT GAAATCGGTG AAAATCTACG CCAATTCGCA CTATGTCACG
CCGCGCCCGA CGCTGAACGG CGCCATCAAA TCGATCAAGG AAGAGCTCAG GCTTCGCCTC
GCCGAGCTGG AGAAGGCCGG CCGCCTGCTG GAGGCCCAGC GCCTGGAGCA GCGCACCCGC
TACGACATCG AAATGCTCGA AGCCACCGGC TCCTGCCAGG GCATCGAGAA TTATTCGCGT
TATCTCACCG GCCGCGACCC CGGCGATCCC CCGCCGACGC TGTTCGAGTA TATCCCCGAT
AACGCCCTCG TTTTCATCGA CGAGAGCCAT GTCACCGTGC CGCAGATCGG CGGCATGTAC
CGGGGCGACT TCCGCCGTAA GGCGACGCTG GCCGAATACG GCTTCCGCCT GCCCTCCTGT
ATGGATAACC GGCCGCTGCG CTTCGAGGAA TGGGACGCCA TGCGCCCCGA CACCATTGCC
GTCTCGGCCA CGCCAGGCGG CTGGGAGATG GAGCAGTCGG GCGGCGTCTT CGCCGAACAG
GTGATCCGGC CCACGGGCCT TATCGACCCG CCGGTCGAGG TCCGCTCGGC CCGCACCCAG
GTCGACGACG TGCTCGGCGA GATCCGCGAA ACCGCCGCCA AGGGCTACCG CACCCTTTGC
ACCGTGCTGA CCAAGCGCAT GGCCGAAGAC CTGACCGAAT ATCTGCATGA GCAGGGCGTG
CGCGTCCGCT ACATGCACTC CGACATCGAC ACGCTTGAAC GTATCGAGAT CCTCCGCGAT
CTTCGTCTGG GTGCTTTCGA CGTGCTCGTC GGCATCAACC TGCTGCGCGA GGGTCTCGAC
ATTCCCGAAT GCGGCTTCGT CGCCATCCTC GACGCCGACA AGGAAGGCTT CCTGCGCTCG
GAGACCTCGC TGATCCAGAC GATCGGCCGC GCCGCGCGCA ACGTCGACGG CAAGGTCATT
CTCTATGCCG ACCAGGTCAC CGGCTCGATG AAGCGGGCGA TGGAGGAAAC CGGCCGCCGC
CGCGAAAAGC AGATGATCTA CAATCAGGAA CACGGCATCA CCCCTGAATC CGTCAAGGCC
AGGATCTCCG ACATCCTCGA CAGCGTCTAC GAACGCGACC ACGTCCGCGC CGATATCTCG
GGCGTCTCGG GCAAGGGCTT TGCCGATGGC GGCAACCTGG TCGGCAACAA CCTCCAGACC
CATCTCAACG CGCTCGAAAA AAGCATGCGT GACGCCGCCG CCGACCTCGA CTTCGAAAAA
GCCGCCCGCC TCCGCGACGA AATCAAACGC CTCAAGGCCG CCGAACTGGC CGTCATGGAT
GATCCGATGG CACGCGAAGA GTCGAAGGCA ATGGAAGGTC GCGGCAAGAA GAAGACGGGC
GCGGCAAACG CAACCGGCTC CCTCCCCCCT GTGGGGAGGG TTGGGGAGGG GAAGGTCACA
GATGCAGGGT CCGCCTCCTA CTTCTCCAGA CCCAGCCTCG ATGACATGGG CCCGGGCACC
GACACCGCCA CCCCGCTCTT CCGAAAACCC GCCCTCGACG AGATGGGCCG CGACCCCACC
ACCCCCGCTG GCAAGAGCCT CTTCCGCAAG AACGACCTCG ACGAGATGAC GGTTGGGCGA
ACGGAAAAAC CGGTGGTCGG CCATGTGCCG GAAAAGCCGG AAGCCTCCAA GGGCACAAAG
CGATTTTCGC CGTTGCTTGA AGGGCAACCG GAACGCGATG ACGATGTGCG GCCGGTGGTG
CGCGGCAAGG CGGGTGTCGG CAGCTATGAG GATCCGGGCG AGCAGAAGCG CAAGGGCCGG
ACGAAGGGCA AGACTGGGCG GCCGGGGCGG TGA
 
Protein sequence
MAKSPKKSPA PNGFEEAPQS SFEGAPLSGS VTDWVKQLEA DAEASGVETQ RQIASKAGKH 
RKKVEIAASK SARGTSMGGS TDPKTRAAAG LNPVAGMNTT LEEASSLQAG TAVTATVEAL
SALIESGNPL HKNGKIWTPH RPARPDKSEG GIRILMKSDY EPAGDQPTAI RDLVEGLENG
DRSQVLLGVT GSGKTFTMAK VIEATQRPAV ILAPNKTLAA QLYSEFKNFF PDNAVEYFVS
YYDYYQPEAY VPRSDTYIEK ESSINEQIDR MRHSATRSLL ERDDCIIVAS VSCIYGIGSV
ETYTAMTFQM SVGDRLDQRQ LLADLVAQQY KRRDMDFTRG SFRVRGDTIE LFPAHLEDAA
WRISMFGDEI DAITEFDPLT GQKVGDLKSV KIYANSHYVT PRPTLNGAIK SIKEELRLRL
AELEKAGRLL EAQRLEQRTR YDIEMLEATG SCQGIENYSR YLTGRDPGDP PPTLFEYIPD
NALVFIDESH VTVPQIGGMY RGDFRRKATL AEYGFRLPSC MDNRPLRFEE WDAMRPDTIA
VSATPGGWEM EQSGGVFAEQ VIRPTGLIDP PVEVRSARTQ VDDVLGEIRE TAAKGYRTLC
TVLTKRMAED LTEYLHEQGV RVRYMHSDID TLERIEILRD LRLGAFDVLV GINLLREGLD
IPECGFVAIL DADKEGFLRS ETSLIQTIGR AARNVDGKVI LYADQVTGSM KRAMEETGRR
REKQMIYNQE HGITPESVKA RISDILDSVY ERDHVRADIS GVSGKGFADG GNLVGNNLQT
HLNALEKSMR DAAADLDFEK AARLRDEIKR LKAAELAVMD DPMAREESKA MEGRGKKKTG
AANATGSLPP VGRVGEGKVT DAGSASYFSR PSLDDMGPGT DTATPLFRKP ALDEMGRDPT
TPAGKSLFRK NDLDEMTVGR TEKPVVGHVP EKPEASKGTK RFSPLLEGQP ERDDDVRPVV
RGKAGVGSYE DPGEQKRKGR TKGKTGRPGR