Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5159 |
Symbol | |
ID | 6978253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 796713 |
End bp | 799976 |
Gene Length | 3264 bp |
Protein Length | 1087 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643394287 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_002279105 |
Protein GI | 209547187 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.572913 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.312122 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTACG CCGAACTGCA GGTCACCACC CACTTCTCTT TCCTTCGCGG CGCAAGCTCG GCGGAGGAAC TCTTCGCCAC CGCCAAGCTG ATGGGCATCG AGGCGCTCGG CATCGTCGAC CGCAACAGCC TGGCCGGCAT CGTCCGGGCG CTCGAAGCCT CGCGTGCCAC CGGCCTGCGG CTGGTGGTTG GTTGCCGGCT CGATCTCGCT GACGGCATGT CGATGCTCGT CTATCCCACC GACCGCGCTG CCTATTCGCG GCTGACACGG CTGCTGACCT TGGGCAAGGG CAGGGGCGGG AAGGCCAAGT GCATCCTGCA GCTCGATGAT GTGGCGCTCT ATGCCGAAGG CCTGATCGGC GTCCTGGTGC CGGACATGGC CGACGAGACC TGCGCCGTGC AACTCAGGAA GATGGCTGAA ATCTTCGGCG ACCGCGCCTA TGTCTCGCTC TGTCTGCGTC GCCGCCCCAA CGATCAGCTT CGATTGCACG AGCTGTCGAA CCTGGCTGTA AAGCACCGGG TGAAGACCGT CATCACCAAT GACGTGCTGT TTCATGAACC CGGCCGGCGG CAGCTGCAGG ATGTCGTCAC CTGCATCCGC ACCGGCACCA CCATCGACGA TGTCGGCTTC GAGCGCGAGC GCCATGCCGA TCGCTTCCTC AAACCGCCCG AAGAAATGGC GCGTCTATTT CCCCGGTACC CAGAAGCACT GGCACGCACG ATGGACATCG TCGAGCGCTG CAGGTTCAGT TTGGAAGAGC TCGTCTATCA ATATCCCGAG GAAGCACTGA TCCCGGGCAT GACCGCGCAG GAGTCGCTGG AGCATTACAC CTGGGAAGGC GTGACGACGC GCTATCCGGA AGGGCTGCCA GCGCATGTCG AGAAAACGAT CCGGCATGAA CTGGCGCTGA TCGAGACGAT GAAATACGCC CCCTACTTCC TGACGGTGTT CTCGATCGTC CGTTATGCCC GAGCGCAAGG CATTCTCTGC CAGGGCCGGG GATCGGCCGC CAATTCCGCC GTTTGCTATG TGCTCGGCAT CACCTCGATC GACCCGGAAA CAAACGATCT CCTCTTTGAG CGCTTTGTGT CCCAGGAGCG TGACGAGCCG CCCGACATCG ATGTCGATTT CGAGCACGAA AGACGGGAGG AGGTGATCCA GTGGATCTAC AAGACCTACG GCCATGACAA GGCCGCACTC TGTTCGACGG TGACCCGCTA TCGCGCCAAA GGCGCTATTC GCGACGTCGG CAAGGCCCTT GGCCTGCCCG AGGATCTGAT CAACTCGCTT TCCTCCGGCA TCTGGTCCTG GTCGGAAACG GTCGGCGAGA ACCAGGTGCG CGCCCTCGGC CTCAATCCGG AAGACCGTCG TCTCGCCTTA ACGCTGAGGC TTGCCCAGCA GCTGATGGGA GCACCGCGGC ATCTTGGCCA ACATCCCGGC GGCTTCGTCT TGACCCATGA CAGGCTCGAT CATCTCGTGC CCATCGAACC CGCGGCGATG GTCGACCGCC AGGTGATCGA ATGGGACAAG GACGACGTCG AGGCGCTCAA GATGATGAAG GTCGATGTGC TGGCATTGGG CATGCTGACC TGCATGGCCA AGGCCTTTGC CCTCATTGGC GAGCACAAAC ATCAGGATCT CTACCTCGCC ACCATTCCAC AGGAGGATCC CGCGACCTAT GCGATGATCC GCAAGGCCGA TACTCTCGGG ACATTCCAGA TCGAATCCCG CGCCCAAATG TCGATGCTCC CACGGATGAA ACCGAAAACC TTTTATGATC TCGTCATCCA GGTGGCGATC GTCCGCCCCG GACCGATCCA GGGCGATATG GTCCATCCCT ATCTGCGCCG GCGCGAAGGC AAGGAGAAGG TCGAATATCC GACGCCCGAA CTGGAGGCGG TCCTCCACAA GACGCTGGGT GTGCCGCTGT TTCAGGAGTC GGCGATGAAG GTGGCGATGG TCTGTGCCGG GTTTACCGGC GGTGAAGCCG ACCAGCTCAG GAAATCCATG GCGACCTTCA AGTTCACCGG TGGCGTTTCC CGCTTCAAGG ACAAGCTCGT CTCCGGGATG ATCAAAAGCG GCTACTCGCC GGAATTCGCC GAAAAGACTT TCGGCCAGCT GGAAGGTTTT GGCTCCTATG GCTTTCCCGA AAGCCATGCC GCCTCCTTCG CACTGATCGC CTATGCCTCA AACTACGTGA AATGCCATTT TCCGGATGTC TTTTGCGCAG CACTTCTGAA CTCGCAGCCG ATGGGCTTTT ATGCGCCGGC ACAGATCGTT TCCGACGCGA GGAAACACGA CGTCGAGGTC TGCCCGATCT GCATCAACCG CTCGCGCTGG GACTGCACGC TCGAGGAGGT GGAGGGTACA GGCCGGCATG CGGTGCGGCT CGGCATGCGC CTGGTGCGAG GATTGGCGAC CGCTGACGCC GCACGCATTG TCGCCGCGCG CGCCGATGAA CCCTTCGCAT CGGTGGACGA CATGTGGCGG CGGTCGGGTG TACCCGTGGC CTCCCTTGTC GAGCTCGCCG AGGCCGACGC GTTCCTGCCG TCATTGCGGC TCGAGCGGCG CGATGCGCTC TGGGCGATCA AGGCGCTGCG CGACGAGCCG CTGCCTCTGT TTACGTCCGC AGCTGAACGC GAGGCGCGGG CGATTGCCGA GCAGCAGGAG CCGGAAGTCG AGCTCAGGCA GATGACCGAT GGGCACAACG TCGTTGAGGA TTACAGCCAC ATCGGACTGA CACTGCGGGA GCATCCGTTA CGATTCCTGC GGGCGGATCT CACAAAGCGC CAGATCGTCA CCTGTGCCAA GGCGATGACG GCGCGCGATG GCCAGTGGCT GATGGCGGCC GGCCTGGTGC TGGTGCGGCA GCGGCCGGGT TCGGCGAAGG GTGTGATGTT CATCACCATC GAGGATGAGA CCGGCATTGC CAATATCGTC GTCTGGCCGA AGCTGTTCGA GCGCTCACGC CGCGTGGTGC TCGGCGCCAG TATGATGGCG ATCAATGGCA GGATCCAGCG GGAAGGGGAA GTGGTTCACC TGGTCGCCCA GCAGCTCTTC GATTTCTCGG CCGATCTGTC AGGACTTGCC GGGCGGGACG GCGCTTTCCG TGCCTCCACC GGCCGCGGCG ATGAGTTCGC CCATGGCTCT CCTGGGAGCC CGGATTCTCG CGAAAAAGCG CCTCCGGGGG TTCGGGCGAG AGATATGTTC ACACCCGATC TTCATATCGA CACGCTGAAG ATTAAGAGCC GGAATTTTCA GTAG
|
Protein sequence | MRYAELQVTT HFSFLRGASS AEELFATAKL MGIEALGIVD RNSLAGIVRA LEASRATGLR LVVGCRLDLA DGMSMLVYPT DRAAYSRLTR LLTLGKGRGG KAKCILQLDD VALYAEGLIG VLVPDMADET CAVQLRKMAE IFGDRAYVSL CLRRRPNDQL RLHELSNLAV KHRVKTVITN DVLFHEPGRR QLQDVVTCIR TGTTIDDVGF ERERHADRFL KPPEEMARLF PRYPEALART MDIVERCRFS LEELVYQYPE EALIPGMTAQ ESLEHYTWEG VTTRYPEGLP AHVEKTIRHE LALIETMKYA PYFLTVFSIV RYARAQGILC QGRGSAANSA VCYVLGITSI DPETNDLLFE RFVSQERDEP PDIDVDFEHE RREEVIQWIY KTYGHDKAAL CSTVTRYRAK GAIRDVGKAL GLPEDLINSL SSGIWSWSET VGENQVRALG LNPEDRRLAL TLRLAQQLMG APRHLGQHPG GFVLTHDRLD HLVPIEPAAM VDRQVIEWDK DDVEALKMMK VDVLALGMLT CMAKAFALIG EHKHQDLYLA TIPQEDPATY AMIRKADTLG TFQIESRAQM SMLPRMKPKT FYDLVIQVAI VRPGPIQGDM VHPYLRRREG KEKVEYPTPE LEAVLHKTLG VPLFQESAMK VAMVCAGFTG GEADQLRKSM ATFKFTGGVS RFKDKLVSGM IKSGYSPEFA EKTFGQLEGF GSYGFPESHA ASFALIAYAS NYVKCHFPDV FCAALLNSQP MGFYAPAQIV SDARKHDVEV CPICINRSRW DCTLEEVEGT GRHAVRLGMR LVRGLATADA ARIVAARADE PFASVDDMWR RSGVPVASLV ELAEADAFLP SLRLERRDAL WAIKALRDEP LPLFTSAAER EARAIAEQQE PEVELRQMTD GHNVVEDYSH IGLTLREHPL RFLRADLTKR QIVTCAKAMT ARDGQWLMAA GLVLVRQRPG SAKGVMFITI EDETGIANIV VWPKLFERSR RVVLGASMMA INGRIQREGE VVHLVAQQLF DFSADLSGLA GRDGAFRAST GRGDEFAHGS PGSPDSREKA PPGVRARDMF TPDLHIDTLK IKSRNFQ
|
| |