Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4103 |
Symbol | |
ID | 6982875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 4280807 |
End bp | 4283806 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643398833 |
Product | DNA polymerase I |
Protein accession | YP_002283591 |
Protein GI | 209551674 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG GCGATCACCT CTTCCTAGTC GATGGTTCCG GGTTCATCTT CCGGGCGTTT CATGCACTTC CGCCGCTGAC CCGCAAGACC GACGGCCTGC CGATCGGCGC CGTTTCCGGT TTCTGCAACA TGCTGTGGAA GCTGTTGCGC GATGCGCGCA ACACCGATGT CGGGGTGACG CCGACGCATC TTGCCGTCAT CTTCGACTAT TCCGCCAAGA CCTTTCGCAA GGATCTCTAC GACGCCTATA AGGCGAACCG CTCCGCCCCG CCGGAAGAGC TCATCCCGCA ATTCGGCCTG ATCCGCGAGG CGACCCGCGC CTTCAATCTG CCCTGCATCG AGACCGAAGG TTTTGAGGCC GACGATATCA TCGCCACCTA TGCCCGTCAG GCCGAGGCAT CGGGCGCCGA TGTCACCATC GTCTCTTCCG ACAAGGACCT GATGCAGCTC GTCAGCCCGA ATGTCCACAT GTATGACAGC ATGAAGGACA AGCAGATCGG CATTCCCGAT GTCATCGAGA AATGGGGCGT GCCGCCGGAA AAGATGATCG ACCTGCAGGC GATGACCGGC GATTCCGTCG ACAATGTTCC CGGCATTCCC GGCATCGGCC CGAAGACCGC CGCCCAGCTG CTCGAGGAAT ACGGCGATCT CGATACGCTG CTCGACCGCG CCACCGAGAT CAAGCAGGTC AAGCGCCGCG AAACCATTCT CGCCAATATC GATATGGCCC GGCTCTCGCG CGACCTCGTG CGGCTGCGCA CCGATGTGCC GCTCGACCTC GATCTCGACG CGCTGGTGCT GGAGCCGCAG AACGGCCCGA AGCTGATCGG CTTCCTCAAG ACGATGGAAT TCACGACGCT GACCCGCCGC GTCGCCGAGG TCTGCGATTG CGATGCGAGT GCCATCGAAC CGGCGATCGT CAATGTCGAA TGGGGCAAGG CTGCCCATGG TCCCGATCTC GATGCGGCCG AACCTGCCCC GGTTGCCGGC GGCATCCCCG AGGTTTCCGG CGAATCGGCG CCGGTGCCGC CGCGCGCGAA GGCCAAGGCT TCCGTCGAAG GTGCCTTTTC GCCCGCCGAT CTGGCCAAGG CGCGGGCCGA AGCCTTTGCG ACGTTGCCCT TCGATCATAC GGCCTATGTC ACGATCCGCG ATCTCGCCAC GCTCGACCGG TGGATCGCCG ATGCGCGCGC CACGGGCCTC GTTGCTTTCG ACACCGAGAC CACGTCGCTG GATGCGATGC AGGCCGAGCT TGTCGGCTTC TCGCTGGCGA TTGCCGACAA TACTGCCGAT CCCACCGGCA CGAAGATCCG CGCCGCCTAT ATACCGCTCG CCCATAAGAA CGGCGTCGGC GATCTGCTCG GCGGCGGCCT TGCCGACAAC CAGATCTCCA TGCGCGATGC CCTGCCGCGG TTGAAGTCCC TGCTGGAGGA CGCTGCGGTT CTCAAGGTCG CGCAGAACCT CAAATACGAC TACCTGCTGA TGCAGCGCTA CGGCATCGAG ACCAGGAGTT TCGACGACAC GATGCTGATC TCCTACGTGC TCGACGCCGG CACCGGCGCC CACGGCATGG ACCCGCTCTC GGAAAAATTC CTCGGCCACA CGCCGATCCC CTACAAGGAT GTCGCAGGCA GCGGCAAGGC GAACGTCACC TTCGATCTGG TCGATATCGA CCGCGCCACC CATTATGCGG CCGAAGATGC CGACGTGACG CTGCGGCTCT GGCTGGTGCT GAAGCCGCGG CTGGCGGCAG CCGGGCTGAC CAGCGTCTAT GAACGGCTGG AACGGCCGCT GCTGCCGGTG CTGGCGCGCA TGGAAGCGCG CGGCATCACG GTCGACCGGC AGATCCTGTC GCGCCTGTCC GGCGAGCTCG CCCAAGGTGC TGCGCGCCTG GAAGACGAGA TCTATGTGCT CGCCGGCGAG CGGTTCAATA TCGGCTCGCC GAAGCAGCTG GGCGATATCC TGTTCGGCAA GATGGGCCTT GCCGGCGGCA GCAAGACGAA AACCGGGCAA TGGTCCACCT CGGCGCAGGT GCTCGAGGAT CTGGCCGCCG CCGGTTTCGA GCTGCCGCGC AAGATCGTCG ACTGGCGCCA GCTCACCAAG CTGAAATCCA CCTATACCGA CGCGCTTCCC GGCTATGTCC ACGCCGAGAC CAAGCGGGTC CACACCTCCT ATTCGCTGGC ATCGACGACG ACGGGGCGCC TGTCCTCCTC CGAGCCGAAC CTGCAGAACA TTCCGGTGCG CACCGCCGAA GGCCGCAAGA TCCGCACCGC CTTCATCTCG ACGCCGGGCC ACAAGCTGAT TTCCGCCGAC TACAGCCAGA TCGAATTGCG CGTGCTGGCC CATGTGGCTG AGATCCCGCA GCTGACCAAG GCCTTCGAGG ATGGCGTCGA CATTCATGCG ATGACCGCGT CGGAAATGTT CGGCGTGCCG GTCGAAGGCA TGCCGGGCGA AGTTCGTCGC CGCGCCAAGG CGATCAATTT CGGCATCATC TACGGCATTT CGGCCTTCGG CCTTGCCAAC CAGCTGTCGA TCGAGCGCTC GGAAGCCGGC GACTACATCA AGAAGTATTT CGAGCGTTTC CCGGGCATCC GCGACTATAT GGAAAGCCGC AAGGCGATGG CGCGCGACAA GGGTTATGTC GAGACGATCT TCGGGCGGCG TATCAACTAC CCCGAAATCC GCTCTTCCAA CCCGTCCGTG CGCGCCTTCA ACGAGCGTGC GGCGATCAAC GCGCCGATCC AGGGCTCGGC CGCCGACGTC ATCCGCCGGG CGATGATCCG GATGGAGCCG GCGCTGGCCG AAGTCGGTCT TGGCGATCGC GTCCGCATGC TGCTGCAGGT GCACGACGAA CTCATCTTCG AAGTCGAGGA TGAGGATGTC GAGAAGGCAA TGCCCATCAT CGTCTCGGTC ATGGAAAACG CCACCATGCC GGCGCTGGAA ATGCGCGTGC CGCTGCGCGT CGACGCGCGT GCCGCCAGCA ATTGGGACGA GGCGCATTGA
|
Protein sequence | MKKGDHLFLV DGSGFIFRAF HALPPLTRKT DGLPIGAVSG FCNMLWKLLR DARNTDVGVT PTHLAVIFDY SAKTFRKDLY DAYKANRSAP PEELIPQFGL IREATRAFNL PCIETEGFEA DDIIATYARQ AEASGADVTI VSSDKDLMQL VSPNVHMYDS MKDKQIGIPD VIEKWGVPPE KMIDLQAMTG DSVDNVPGIP GIGPKTAAQL LEEYGDLDTL LDRATEIKQV KRRETILANI DMARLSRDLV RLRTDVPLDL DLDALVLEPQ NGPKLIGFLK TMEFTTLTRR VAEVCDCDAS AIEPAIVNVE WGKAAHGPDL DAAEPAPVAG GIPEVSGESA PVPPRAKAKA SVEGAFSPAD LAKARAEAFA TLPFDHTAYV TIRDLATLDR WIADARATGL VAFDTETTSL DAMQAELVGF SLAIADNTAD PTGTKIRAAY IPLAHKNGVG DLLGGGLADN QISMRDALPR LKSLLEDAAV LKVAQNLKYD YLLMQRYGIE TRSFDDTMLI SYVLDAGTGA HGMDPLSEKF LGHTPIPYKD VAGSGKANVT FDLVDIDRAT HYAAEDADVT LRLWLVLKPR LAAAGLTSVY ERLERPLLPV LARMEARGIT VDRQILSRLS GELAQGAARL EDEIYVLAGE RFNIGSPKQL GDILFGKMGL AGGSKTKTGQ WSTSAQVLED LAAAGFELPR KIVDWRQLTK LKSTYTDALP GYVHAETKRV HTSYSLASTT TGRLSSSEPN LQNIPVRTAE GRKIRTAFIS TPGHKLISAD YSQIELRVLA HVAEIPQLTK AFEDGVDIHA MTASEMFGVP VEGMPGEVRR RAKAINFGII YGISAFGLAN QLSIERSEAG DYIKKYFERF PGIRDYMESR KAMARDKGYV ETIFGRRINY PEIRSSNPSV RAFNERAAIN APIQGSAADV IRRAMIRMEP ALAEVGLGDR VRMLLQVHDE LIFEVEDEDV EKAMPIIVSV MENATMPALE MRVPLRVDAR AASNWDEAH
|
| |