Gene Rleg2_4103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4103 
Symbol 
ID6982875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4280807 
End bp4283806 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content64% 
IMG OID643398833 
ProductDNA polymerase I 
Protein accessionYP_002283591 
Protein GI209551674 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG GCGATCACCT CTTCCTAGTC GATGGTTCCG GGTTCATCTT CCGGGCGTTT 
CATGCACTTC CGCCGCTGAC CCGCAAGACC GACGGCCTGC CGATCGGCGC CGTTTCCGGT
TTCTGCAACA TGCTGTGGAA GCTGTTGCGC GATGCGCGCA ACACCGATGT CGGGGTGACG
CCGACGCATC TTGCCGTCAT CTTCGACTAT TCCGCCAAGA CCTTTCGCAA GGATCTCTAC
GACGCCTATA AGGCGAACCG CTCCGCCCCG CCGGAAGAGC TCATCCCGCA ATTCGGCCTG
ATCCGCGAGG CGACCCGCGC CTTCAATCTG CCCTGCATCG AGACCGAAGG TTTTGAGGCC
GACGATATCA TCGCCACCTA TGCCCGTCAG GCCGAGGCAT CGGGCGCCGA TGTCACCATC
GTCTCTTCCG ACAAGGACCT GATGCAGCTC GTCAGCCCGA ATGTCCACAT GTATGACAGC
ATGAAGGACA AGCAGATCGG CATTCCCGAT GTCATCGAGA AATGGGGCGT GCCGCCGGAA
AAGATGATCG ACCTGCAGGC GATGACCGGC GATTCCGTCG ACAATGTTCC CGGCATTCCC
GGCATCGGCC CGAAGACCGC CGCCCAGCTG CTCGAGGAAT ACGGCGATCT CGATACGCTG
CTCGACCGCG CCACCGAGAT CAAGCAGGTC AAGCGCCGCG AAACCATTCT CGCCAATATC
GATATGGCCC GGCTCTCGCG CGACCTCGTG CGGCTGCGCA CCGATGTGCC GCTCGACCTC
GATCTCGACG CGCTGGTGCT GGAGCCGCAG AACGGCCCGA AGCTGATCGG CTTCCTCAAG
ACGATGGAAT TCACGACGCT GACCCGCCGC GTCGCCGAGG TCTGCGATTG CGATGCGAGT
GCCATCGAAC CGGCGATCGT CAATGTCGAA TGGGGCAAGG CTGCCCATGG TCCCGATCTC
GATGCGGCCG AACCTGCCCC GGTTGCCGGC GGCATCCCCG AGGTTTCCGG CGAATCGGCG
CCGGTGCCGC CGCGCGCGAA GGCCAAGGCT TCCGTCGAAG GTGCCTTTTC GCCCGCCGAT
CTGGCCAAGG CGCGGGCCGA AGCCTTTGCG ACGTTGCCCT TCGATCATAC GGCCTATGTC
ACGATCCGCG ATCTCGCCAC GCTCGACCGG TGGATCGCCG ATGCGCGCGC CACGGGCCTC
GTTGCTTTCG ACACCGAGAC CACGTCGCTG GATGCGATGC AGGCCGAGCT TGTCGGCTTC
TCGCTGGCGA TTGCCGACAA TACTGCCGAT CCCACCGGCA CGAAGATCCG CGCCGCCTAT
ATACCGCTCG CCCATAAGAA CGGCGTCGGC GATCTGCTCG GCGGCGGCCT TGCCGACAAC
CAGATCTCCA TGCGCGATGC CCTGCCGCGG TTGAAGTCCC TGCTGGAGGA CGCTGCGGTT
CTCAAGGTCG CGCAGAACCT CAAATACGAC TACCTGCTGA TGCAGCGCTA CGGCATCGAG
ACCAGGAGTT TCGACGACAC GATGCTGATC TCCTACGTGC TCGACGCCGG CACCGGCGCC
CACGGCATGG ACCCGCTCTC GGAAAAATTC CTCGGCCACA CGCCGATCCC CTACAAGGAT
GTCGCAGGCA GCGGCAAGGC GAACGTCACC TTCGATCTGG TCGATATCGA CCGCGCCACC
CATTATGCGG CCGAAGATGC CGACGTGACG CTGCGGCTCT GGCTGGTGCT GAAGCCGCGG
CTGGCGGCAG CCGGGCTGAC CAGCGTCTAT GAACGGCTGG AACGGCCGCT GCTGCCGGTG
CTGGCGCGCA TGGAAGCGCG CGGCATCACG GTCGACCGGC AGATCCTGTC GCGCCTGTCC
GGCGAGCTCG CCCAAGGTGC TGCGCGCCTG GAAGACGAGA TCTATGTGCT CGCCGGCGAG
CGGTTCAATA TCGGCTCGCC GAAGCAGCTG GGCGATATCC TGTTCGGCAA GATGGGCCTT
GCCGGCGGCA GCAAGACGAA AACCGGGCAA TGGTCCACCT CGGCGCAGGT GCTCGAGGAT
CTGGCCGCCG CCGGTTTCGA GCTGCCGCGC AAGATCGTCG ACTGGCGCCA GCTCACCAAG
CTGAAATCCA CCTATACCGA CGCGCTTCCC GGCTATGTCC ACGCCGAGAC CAAGCGGGTC
CACACCTCCT ATTCGCTGGC ATCGACGACG ACGGGGCGCC TGTCCTCCTC CGAGCCGAAC
CTGCAGAACA TTCCGGTGCG CACCGCCGAA GGCCGCAAGA TCCGCACCGC CTTCATCTCG
ACGCCGGGCC ACAAGCTGAT TTCCGCCGAC TACAGCCAGA TCGAATTGCG CGTGCTGGCC
CATGTGGCTG AGATCCCGCA GCTGACCAAG GCCTTCGAGG ATGGCGTCGA CATTCATGCG
ATGACCGCGT CGGAAATGTT CGGCGTGCCG GTCGAAGGCA TGCCGGGCGA AGTTCGTCGC
CGCGCCAAGG CGATCAATTT CGGCATCATC TACGGCATTT CGGCCTTCGG CCTTGCCAAC
CAGCTGTCGA TCGAGCGCTC GGAAGCCGGC GACTACATCA AGAAGTATTT CGAGCGTTTC
CCGGGCATCC GCGACTATAT GGAAAGCCGC AAGGCGATGG CGCGCGACAA GGGTTATGTC
GAGACGATCT TCGGGCGGCG TATCAACTAC CCCGAAATCC GCTCTTCCAA CCCGTCCGTG
CGCGCCTTCA ACGAGCGTGC GGCGATCAAC GCGCCGATCC AGGGCTCGGC CGCCGACGTC
ATCCGCCGGG CGATGATCCG GATGGAGCCG GCGCTGGCCG AAGTCGGTCT TGGCGATCGC
GTCCGCATGC TGCTGCAGGT GCACGACGAA CTCATCTTCG AAGTCGAGGA TGAGGATGTC
GAGAAGGCAA TGCCCATCAT CGTCTCGGTC ATGGAAAACG CCACCATGCC GGCGCTGGAA
ATGCGCGTGC CGCTGCGCGT CGACGCGCGT GCCGCCAGCA ATTGGGACGA GGCGCATTGA
 
Protein sequence
MKKGDHLFLV DGSGFIFRAF HALPPLTRKT DGLPIGAVSG FCNMLWKLLR DARNTDVGVT 
PTHLAVIFDY SAKTFRKDLY DAYKANRSAP PEELIPQFGL IREATRAFNL PCIETEGFEA
DDIIATYARQ AEASGADVTI VSSDKDLMQL VSPNVHMYDS MKDKQIGIPD VIEKWGVPPE
KMIDLQAMTG DSVDNVPGIP GIGPKTAAQL LEEYGDLDTL LDRATEIKQV KRRETILANI
DMARLSRDLV RLRTDVPLDL DLDALVLEPQ NGPKLIGFLK TMEFTTLTRR VAEVCDCDAS
AIEPAIVNVE WGKAAHGPDL DAAEPAPVAG GIPEVSGESA PVPPRAKAKA SVEGAFSPAD
LAKARAEAFA TLPFDHTAYV TIRDLATLDR WIADARATGL VAFDTETTSL DAMQAELVGF
SLAIADNTAD PTGTKIRAAY IPLAHKNGVG DLLGGGLADN QISMRDALPR LKSLLEDAAV
LKVAQNLKYD YLLMQRYGIE TRSFDDTMLI SYVLDAGTGA HGMDPLSEKF LGHTPIPYKD
VAGSGKANVT FDLVDIDRAT HYAAEDADVT LRLWLVLKPR LAAAGLTSVY ERLERPLLPV
LARMEARGIT VDRQILSRLS GELAQGAARL EDEIYVLAGE RFNIGSPKQL GDILFGKMGL
AGGSKTKTGQ WSTSAQVLED LAAAGFELPR KIVDWRQLTK LKSTYTDALP GYVHAETKRV
HTSYSLASTT TGRLSSSEPN LQNIPVRTAE GRKIRTAFIS TPGHKLISAD YSQIELRVLA
HVAEIPQLTK AFEDGVDIHA MTASEMFGVP VEGMPGEVRR RAKAINFGII YGISAFGLAN
QLSIERSEAG DYIKKYFERF PGIRDYMESR KAMARDKGYV ETIFGRRINY PEIRSSNPSV
RAFNERAAIN APIQGSAADV IRRAMIRMEP ALAEVGLGDR VRMLLQVHDE LIFEVEDEDV
EKAMPIIVSV MENATMPALE MRVPLRVDAR AASNWDEAH