Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3568 |
Symbol | |
ID | 6982329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3695964 |
End bp | 3698657 |
Gene Length | 2694 bp |
Protein Length | 897 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643398293 |
Product | hypothetical protein |
Protein accession | YP_002283061 |
Protein GI | 209551144 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR02302] conserved hypothetical protein TIGR02302 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.159826 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAGCC CCTCAAGGCA GAGGAAAGGT GCGTTTGCGC TCCGCCCCTC GCTTGCCCGG CTGGTGACGA CAAAACGCCT GCTGGCGCGC GTGGTGCTGT TTTTCGAGCA GTTGCTTCCG CCCTTGATGC CGGTCCTGGC AGTCATCGCC CTTTATCTCG CAGCCTCCTG GTTCGGCCTC TTCCGCAGTG TGCCGGACTG GCTGCGCATC CTGCTGCTGA TCGCCTTTGT CGCCGCTTTT CTCGTCTCGC TGCTGCCCTT GCGCAATCTG CGCTGGCCGG GGGTCGCCGA AGCCGACCGC ATGCTGGAAG AGCGCAACGG CCTGCCGCAT CAGCCGGTTA CCGTCCAGGA AGACGAGCCG GCCTTCGATA CGCCCTTCGC CCGGGCGCTC TGGCGCGAGC ACCAGACCCG CATGGCCGAA AAGATCGCTG CCCTCGATGC CGGACTGCCG CGGCCGGATA TCGCCGCGCA TGACCGTTTT GCGCTGCGCG CTGTACCGGC GCTGCTGCTG GTCACCGCCT TTGCCTATTC GCTGTCGATC AACGGCGGCT CGCTCGGTGA CGCGTTTCAG TCAGCGCCCG AGCAGGTGGT CGTCGATCCG GCGGTGCGCA TCGACGCCTG GGTGACGCCG CCCTCCTATA CCGGCCGCGC CCCGGTCTAT CTCACCGCTG ACGGCAGCGA GCAGGCGCCG ATCGGCATTC CGCAGTTTTC GGGCCTCACC GTTCGTGTCA GCGGCGGAAA GACCGCCGAA AAGGTCGTGT TCCGCAAGGC GAACGGCCAG GCGCAGGATA TCGCCGTGCA GGCGGATACC AAGCCGCAAC AGGTGGCGTC AGGCAGCGAA CAGCCGAAGA CGGCCCCGAC CAGCCAGGCA TCCACCGGCC AAGCTCCCGC TGACCAGGCC GCCACCAGCC AGGCACTTGT GGCGCAGACG CATGTGATGA AGCTCGAAGA AAATGGCGCC CTCGAAGTCA ACGGCCGCCG CTGGAGCTTC GATGTCCTCC CCGACAAGGC GCCGGAGATC GCCTTCGACG GTTTGCCGAA GCCCAGCGTC AACGGCGCGC TTGAAATCGG CTTTACCGTC AAGGACGATT ACGGCGTCCA GGAAGCGCAT GCGGAGATCG TTCCTCTGGA AAACGATCCG ACGGCGACGC CGCTCTATCC GCTGCCGGAA TACCGGCTGG ATATTCCCCG CCGCAACGCC CGCGACGCCA AGGGCGTGAC CAGCCGCAAC CTGACCGAAC ATCCGCTTGC CGGCAAGCGC GTGCGCGTGA CGCTCGTCGC CAAGGATGGC GCCGGCCAGA CCGGCCGCAG CCCGCCGCAT GAAATGATCC TGCCGTCGCG GCCCTTCAAC GAGCCGCTTG CTGCGGCCGT CGCCGAGGAG CGGCAGGTTT TCGCGCTCGA CACCCGCAAG ATGCCGCAGG CGATCGCGCT GAACGAGGCG CTGACCATCC GGCCTGAGGA GACCATTCCC AAGCTCACCA ATTATCTGCT GCTCGAATCC GCCTTGACGC GCATGAAGCT TGCAAAGGGT GACGAGGCAC TGAAGGACAC GGCCCAGTAT CTCTGGGAGA TCGCACTCGG CATGGAGGAC GGCGATCTTT CGCTTGCCGA GCGCAAGCTG CGCGAGGCCC AGCAGAAACT TGCCGACGCA TTGGACCGCA ACGCGCCGGA CGAGGAGATC AAGAAGCTGA TGGATGAGCT GCGCAAGGCA ATGCAGGACT ATATGACCGA GCTTGCCCAA CGCATGCAGA ACGCGCCGAT GCAGCCGAAT CAGAACGCCC AGAACATCCT GCGCCAGCAG GATCTGGAAC GGATGATGGA CCAGATCGAA AATCTCGCCC GCTCCGGCAA TCGCGACGCA GCCCAGCAGA TGCTGTCGGA ATTGCAGCGC ATGATGAACA ACCTGCAGGC CGGCAGGCCG CAGCGCGGCC AGCAGAGCCA GGAAAACAGC GAAGCCCGCA AACAGATCGA CAAGCTCGGC GAGATCCTGC GCGACCAGCA GAAGCTGATG GAACAGACCT TCCGTCTCGA TCAGCAGCTG AGGGACCGCA TGCAGCGCGG CGAACCCGAC ATGGGCGAAA ACGATCCGCT GCTCGACGAG ATGAACCCTG GCGAGAACGG CGAGCCGCAG GATCAGCAGC AGGGCCAGCA AGGCAAGGAA GGCCAGCAGC CCTCCGACCA GATGACCGCC GAGCAGCTGC GCGAGGCGCT GAAACAGCTG CGCGCCCAGC AGGATGCGCT CGGCAAGCAG CTCGGCGAAT TGCAGAAGAG CCTCGGCGAG TTGGGCATGA AACCCGGCCC CGGCTTCGGC CAGGCGCAAC GCGAGATGGA AGGTGCAGGC CGTGAGCTCG GCCAGGGCCG CGGCCAGCCG GCCATCGAGG GCCAGGGCCG CGCGCTCGAA GCGCTTCGCC AAGGCGCCCG CGATATGATG AACCAGATGA TGCAGGCCCA ACAGGGCCAG CAAGGCCAGG GTCCCAACGG TCAGGTCGGT CAGGGGGATC AAAACGGCCG CGACCCACTC GGCCGCCCGC GCCGCGTCCA GGGGCCGGAT TTCGGCGACG ACGTGAAAGT GCCCGACGAG ATCGACGTTC AGCGTGCCCG GGAAATCCTC GACGCGATCC GCGAAAAGCT CGGCAACAAT CCGCCGCAGG AAATGGAACG GCGGTATCTC GAACGGTTGC TGGACATTCA GTAG
|
Protein sequence | MTSPSRQRKG AFALRPSLAR LVTTKRLLAR VVLFFEQLLP PLMPVLAVIA LYLAASWFGL FRSVPDWLRI LLLIAFVAAF LVSLLPLRNL RWPGVAEADR MLEERNGLPH QPVTVQEDEP AFDTPFARAL WREHQTRMAE KIAALDAGLP RPDIAAHDRF ALRAVPALLL VTAFAYSLSI NGGSLGDAFQ SAPEQVVVDP AVRIDAWVTP PSYTGRAPVY LTADGSEQAP IGIPQFSGLT VRVSGGKTAE KVVFRKANGQ AQDIAVQADT KPQQVASGSE QPKTAPTSQA STGQAPADQA ATSQALVAQT HVMKLEENGA LEVNGRRWSF DVLPDKAPEI AFDGLPKPSV NGALEIGFTV KDDYGVQEAH AEIVPLENDP TATPLYPLPE YRLDIPRRNA RDAKGVTSRN LTEHPLAGKR VRVTLVAKDG AGQTGRSPPH EMILPSRPFN EPLAAAVAEE RQVFALDTRK MPQAIALNEA LTIRPEETIP KLTNYLLLES ALTRMKLAKG DEALKDTAQY LWEIALGMED GDLSLAERKL REAQQKLADA LDRNAPDEEI KKLMDELRKA MQDYMTELAQ RMQNAPMQPN QNAQNILRQQ DLERMMDQIE NLARSGNRDA AQQMLSELQR MMNNLQAGRP QRGQQSQENS EARKQIDKLG EILRDQQKLM EQTFRLDQQL RDRMQRGEPD MGENDPLLDE MNPGENGEPQ DQQQGQQGKE GQQPSDQMTA EQLREALKQL RAQQDALGKQ LGELQKSLGE LGMKPGPGFG QAQREMEGAG RELGQGRGQP AIEGQGRALE ALRQGARDMM NQMMQAQQGQ QGQGPNGQVG QGDQNGRDPL GRPRRVQGPD FGDDVKVPDE IDVQRAREIL DAIREKLGNN PPQEMERRYL ERLLDIQ
|
| |