Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4287 |
Symbol | |
ID | 8015065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4391774 |
End bp | 4394968 |
Gene Length | 3195 bp |
Protein Length | 1064 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644826857 |
Product | double-strand break repair protein AddB |
Protein accession | YP_002978066 |
Protein GI | 241206970 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3893] Inactivated superfamily I helicase |
TIGRFAM ID | [TIGR02786] double-strand break repair protein AddB, alphaproteobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.235496 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.271483 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGAGC GGCACCAGCC ACGCATCGTG ACGATCCCCG CAGGCCTCTC CTTCCTGAAA ACGCTGGCGA CGACACTTTG CGACGGCCGA TTGACGCCGA TTTTCCGGCA CGACGCCGAT GATCCGCTAT CGCTTGCCAA AGTCACGATC TACCTGCCGA CCCGGCGCGC CGTGCGTGTG CTGCGTTCCG AATTCGTCGA CCTGCTCGGC GGCCGCTCGG CGATCCTCCC CATGATCCGC CCGCTCGGCG AAACCGATGA CGATAGCGGC TATTTCGACG AGGCGCTGCC GGCAACGATC GATCTCGCCC AGCCACTGTC GAATACCGCC CGTCTGCTGG AGCTTGCGCG CCTGATCCTC GCCTGGCGAA ACAAGCTGCC GGAGATCGTC CGCCACATCC ACTCGGACTC GCCGCTCGTC GCGCCGGCAA GCCCGGCGGA TGCGATCTGG CTCGCCCGCA ATCTTGCGGA ATTGATCGAT TCCATCGAGA CCGAGGATCT TGACTGGTCG GAGCTGTCGA AACTCGATAC CGGCGATTAT GCCGCCTGGT GGCAGTTGAC GGCGGAGTTC CTGCAGATCG CCAGCGCCTT CTGGCCCGAG CGGCTGGCCG AACTCGGCAA ATCCTCGCCG GCGCGGCACA GAAACGCCAT TCTCCGGGCC GAAGCAAGCC GGCTTTCGGC GACGAAACCC GCCGGGCCGA TCATCATCGC CGGTTCGACG GGTTCCGTTC CCGCCACCGC CGATCTCATT GCCGCCGTCG CCCATCTGCC GGAAGGCGTG ATCGTGCTTC CAGGTCTCGA TCTCTCCATG CCCGAAAGGC ACTGGCAGAT GGTCGCGCCG GAACCGGCGC CCGGCCAACA TGCCAATCCG GCAAGCCGGA GCCATCCGCA GTATGGTCTG TCTTCGCTGC TCAAGCGGCT GAAGCTGACG CGGGCCGACC TCACGCTCCT CGACAGACCG GAGGCCGATC TTGAGCGGCG TGCCGAAATC CTGTCGCAGG CCCTTGCTCC GGCGGAGGCG ACCAGTGACT GGGGAGCTTG GAAAACCGAC CTGCCGGCAG GCGCGCTGTC TTCGTCCTTC TCTGATGTCT CGCTGATTGA AGCCGCCAAT GAGCGCGAGG AAGCAACCGC GATTGCCATC GCGCTCCGGC TGGCGCTGGA AAGACCGGGA CAGGACAGCG AGAGCCGGGC AGCACTCATA ACCCCGGATC GCAATCTTGC GCGGCGGGTG ATGGCCGAGC TTTCCCGCTT CGGCATCCTC GCCGACGATT CGGCCGGTAC GCCGCTTTCG GCCATGCCGC AGGGCACCTT GCTGCAATTG CTGCTGGAGG CAGCGCTGCG GCCGGGCGAT CCGGTGGCGA TCATCTCGCT GCTCAAACAT CCGCTTGCCC GCTTCGGCCT GGAACGCGGC GCATTGATTT CCGCTACCGA GGCGCTCGAG CTGCTGGCAC TGCGCGGTGG CGTGGCAGAG GTGGATATCA GCACGCTGGA ACCGCTGCTC GCTCACCAAC TTGCCGAACA GGCCCTGGAC AGGCACGCGC CGCAATGGCG AAAAGCGCTT TCGCCCGAGG CCGCCGACGC CGCATACGAC CTTGCCCGGC GGGTGACACA AGCGACCGAG CCTCTGGCTT CGGCGCTGAT GCGGGAGCGG CCGGAAGATC GTGGAAGAAC AGCGCGCTTC ACATTGTCGG AATGGGCGAA GCGCACCGGC CGTTCGCTTG AAGCGGTCGC GGTTGATCCG CACGGCAATC TCGCCGATCT CTGGTCGAAC GAAGCGGGAG ACGCCCTCGC CGCCCTGCTT GGTGAAGTGA TCGACACGGA CGGCCAGATG GAGGCCGACG GACCGCAATG GATCGACATC ATGGCAGCCC TTGCCGCCGG TCATGCGGTG AAGCCACGGG CGCTCAGCCA TCCGAGGCTC TTCATCTTCG GCACGCTGGA AGCCCGCCTG CAGAGCGTCG ATACGTTGAT CCTCGGCGGC CTGAACGAAG GCACGTGGCC GGGACAGACC GCCAATAATC CCTTCATTCC GCGCATGATG AAGACGGAGA TCGGCCTCGA GCCTCCGGAG CGGCGCATCG GCCAGCTGGC GCATGATTTC GAGATGGCAA ACGGTACGCG CCATCTGATC TATTCGCGCG CGCTGCGCCA GGGCTCAACG CCAACCGTCG CCTCGCGTTG GCTGCAGCGG CTGCTGGCGC TCGGCGGAGA GGCGTTCGAA GCGGAATTGA AGGGACGCGG TAATCGGTTC CTCCAATGGG CCGCTCTCAT CGATCGAGGA GATGCTCAGG CGCCGGCGCA GCGCCCCTCG CCGAAACCGC CGCTGGCGCT GCAGCCGAAA TCCTATTCCT TCAGTGAAGT CGGCCGGCTG CGCCGAGATC CCTATGCCAT CTATGCCCGC CGTGTGTTGC GGCTCGACCC GGTCGAGCCG TTCAATCGCG ATCCAGGGGC CGCCGAACGC GGAACGCTCT ACCACAAGAT CATCGACCGT TTCATCCGCG AAGCCCATAT CGCCGGCACG CCGGATGCGG CAGCAGCGAT GGAGGCCATC CTTTCAGAGC TTTTCGACAT GGAAAAATTG CCGCCGCATA TCGATGCCGT GTGGCGGCCG CGCTTTCGCG CGGTGGCCCG CGCCTTCCTT GAATGGGAGG CTGGACGCCG GCATGGCATC CTGAAAACGC TGACGGAAGT ACGGGGCGGC ATGGAGCTGG AGCCGATCAA TATCCGGCTC ACCGGCGTCG CAGACCGGAT CGACGTCACG GGGCCACACT CGGCCGATAT CATCGACTAC AAGACCGGCT TCAATCCCTC GCCGGCGCAG GCACGCGTGC TTCTCGATCC ACAGCTTGCG CTTGAAGCGG CCGCCTTGAG CGCCGGTGCC TTCCGCGATG CCGGCAGCCT CATGCCGCAG GACCTTCTCT ATGTGCGTCT GCGTCCGGGA AGCCGCTTCC AGGTCGATAC CGTCAACAAT GAGAGTTCTG CCCGCAGCGA CAAGGCGAAA TCGGCGATGG ATCTCGCCGC CGAATCGATC GACCAGCTGG TCAAATTCGT GGGGTTGCTG CAATCCAACG AAAGAGGCTT TACCTCGCGG CTGATCCCGG CCCAGCAATT CGATTTCGGC GGCGACTACG ATCATCTCGC CCGCGTTTCC GAATGGTCGA CGGCCGAAAC CGAAGAGGGC GGCGGCGATG AGTGA
|
Protein sequence | MAERHQPRIV TIPAGLSFLK TLATTLCDGR LTPIFRHDAD DPLSLAKVTI YLPTRRAVRV LRSEFVDLLG GRSAILPMIR PLGETDDDSG YFDEALPATI DLAQPLSNTA RLLELARLIL AWRNKLPEIV RHIHSDSPLV APASPADAIW LARNLAELID SIETEDLDWS ELSKLDTGDY AAWWQLTAEF LQIASAFWPE RLAELGKSSP ARHRNAILRA EASRLSATKP AGPIIIAGST GSVPATADLI AAVAHLPEGV IVLPGLDLSM PERHWQMVAP EPAPGQHANP ASRSHPQYGL SSLLKRLKLT RADLTLLDRP EADLERRAEI LSQALAPAEA TSDWGAWKTD LPAGALSSSF SDVSLIEAAN EREEATAIAI ALRLALERPG QDSESRAALI TPDRNLARRV MAELSRFGIL ADDSAGTPLS AMPQGTLLQL LLEAALRPGD PVAIISLLKH PLARFGLERG ALISATEALE LLALRGGVAE VDISTLEPLL AHQLAEQALD RHAPQWRKAL SPEAADAAYD LARRVTQATE PLASALMRER PEDRGRTARF TLSEWAKRTG RSLEAVAVDP HGNLADLWSN EAGDALAALL GEVIDTDGQM EADGPQWIDI MAALAAGHAV KPRALSHPRL FIFGTLEARL QSVDTLILGG LNEGTWPGQT ANNPFIPRMM KTEIGLEPPE RRIGQLAHDF EMANGTRHLI YSRALRQGST PTVASRWLQR LLALGGEAFE AELKGRGNRF LQWAALIDRG DAQAPAQRPS PKPPLALQPK SYSFSEVGRL RRDPYAIYAR RVLRLDPVEP FNRDPGAAER GTLYHKIIDR FIREAHIAGT PDAAAAMEAI LSELFDMEKL PPHIDAVWRP RFRAVARAFL EWEAGRRHGI LKTLTEVRGG MELEPINIRL TGVADRIDVT GPHSADIIDY KTGFNPSPAQ ARVLLDPQLA LEAAALSAGA FRDAGSLMPQ DLLYVRLRPG SRFQVDTVNN ESSARSDKAK SAMDLAAESI DQLVKFVGLL QSNERGFTSR LIPAQQFDFG GDYDHLARVS EWSTAETEEG GGDE
|
| |