Gene Rleg_4423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4423 
Symbol 
ID8015192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4553938 
End bp4556988 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content62% 
IMG OID644826998 
ProductDNA polymerase I 
Protein accessionYP_002978200 
Protein GI241207104 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACT CCATATGGAC CTCGTCCCAT GCCCGCGCTA TCCATGCACG CATGAAAAAA 
GGCGATCACC TCTTCCTAGT CGATGGTTCC GGATTCATCT TCCGGGCGTT TCATGCATTG
CCGCCGCTGA CCCGCAAGAC CGACGGCCTG CCGATCGGCG CCGTGTCCGG TTTCTGCAAC
ATGCTGTGGA AACTGCTGAG GGATGCGCGC AATACCGATG TCGGCGTCAC GCCGACACAT
CTTGCCGTCA TCTTCGATTA TTCCGCCAAG ACGTTTCGCA AGGATCTCTA CGACGCTTAC
AAGGCGAACC GCTCCGCCCC GCCTGAAGAG CTCATCCCGC AATTCGGCCT TATAAGAGAG
GCGACCCGCG CCTTCAATCT GCCCTGCATC GAGACCCAAG GCTTCGAGGC CGACGACATC
ATCGCCACCT ATGCCCGCCA GGCCGAAGCG ACCGGCGCCG ATGTCACCAT TGTCTCCTCC
GACAAGGATC TGATGCAGCT CGTCAGCCCC AATGTCCATA TGTATGACAG CATGAAGGAC
AAGCAGATCG GCATTCCCGA TGTCATCGAG AAATGGGGCG TGCCGCCGGA AAAGATGATC
GACCTGCAGG CGATGACCGG CGATTCAGTC GACAATGTTC CCGGCATTCC CGGCATCGGT
CCGAAAACCG CCGCCCAGCT GCTCGAGGAA TACGGCGATC TCGATACGCT GCTCGAACGC
GCCACCGAGA TCAAGCAGGT CAAGCGCCGT GAGACGATCC TCGCCAATAT CGACATGGCC
AGGCTCTCGC GCGACCTCGT GCGGTTGCGC ATAGACGTGC CGCTCGATCT CGATCTCGAC
GCGCTGGTGC TGGAACCGCA GAACGGTCCG AAGCTGATCG GCTTCCTCAA GACGATGGAA
TTCACCACGC TGACGCGCCG CGTCGCCGAA GCCTGCAATT GCGATGCCGG CGCCATCGAA
CCGGCGATCG TCCGTGTCGA ATGGGGTGAG ACGGCCCGCG GCCCGGATCT CGATGCGGCC
GCGCCCGAGC CTGTTGCCGG CGGCATCCCC GACGTTTCCG GCGAATCCGT GCCGGTGCCG
CCGCGTGCAA AGGCGAAGAC CGCGGTCGAA GGCGCCTTTT CGCCCGCCGA TCTTGCCAAG
GCGCGGGCCG AGGTCTTTGC GACGCTGCCC TTCGATCATT CGGCCTATGT CACGATCCGC
GACCTGGCGA CACTCGACCG ATGGATTGCC GATGCACGCG TCACCGGCCT CGTTGCTTTC
GATACCGAGA CCACGTCGCT GGATGCGATG CAGGCCGAAC TTGTCGGCTT TTCGCTGGCG
ATCGCCGACA ATACCGCCGA TCCCACCGGC ACGAAGATCC GTGCCGCCTA TGTGCCGCTC
GTCCACAAGA ACGGCGTCGG CGATCTGCTC GGCGGCGGCC TTGCCGAAAA CCAGATCCCG
ATGCGCGATG CTCTGCCACG ACTGAAGGCA TTGCTGGAGG ACGAAGCGGT TCTCAAGGTC
GCCCAGAACC TGAAATACGA CTACCTGCTG TTGAAGCGCT ACGGCATCGA GACCAGGAGT
TTCGACGACA CGATGCTGAT CTCCTACGTG CTCGATGCCG GCACCGGCGC GCATGGCATG
GACCCGCTCT CGGAAAAATT CCTCGGCCAT ACCCCGATTC CCTACAAGGA CGTGGCCGGC
AGCGGCAAGG CGAACGTCAC CTTCGATCTG GTCGATATCG ACCGCGCCAC CCACTATGCC
GCCGAAGATG CCGAGGTGAC GTTGCGCCTC TGGCTGGTGC TGAAGCCCCG GCTGGCGGCG
GCGGGATTGA CCAGCGTCTA TGAACGGCTG GAGCGGCCGC TATTGCCGGT GCTGGCGCGC
ATGGAAGCGC GCGGCATCAC CGTCGACCGG CAGATCCTGT CGCGCCTCTC CGGCGAGCTG
GCCCAGAGTG CAGCAAGGCT GGAGGACGAG ATCTACGTGC TGGCCGGCGA GCGTTTCAAT
ATCGGTTCGC CGAAGCAGCT GGGCGATATC CTGTTCGGCA AGATGGGCCT TTCCGGCGGC
AGCAAGACGA AGACCGGCCA ATGGTCCACC TCCGCCCAGG TGCTCGAGGA TCTGGCCGCC
GCCGGTTTCG AATTGCCGCG CAAGATCGTC GACTGGCGCC AGGTCACCAA GCTGAAATCC
ACCTATACCG ACGCGCTTCC GGGTTACGTT CACCCCGAGA CAAAGCGGGT CCACACCTCC
TACTCGCTGG CATCGACGAC CACGGGACGC CTGTCATCGT CCGAGCCGAA CTTGCAGAAT
ATTCCGGTGC GCACCGCAGA AGGCCGCAAG ATCCGCACCG CCTTCATCTC GACGCCCCGC
CACAAGCTGA TCTCCGCCGA CTACAGCCAG ATCGAACTGC GCGTGCTTGC CCATGTGGCC
GAAATCCCGC AGCTGACCAA GGCCTTCGAA GATGGCGTCG ACATCCATGC CATGACGGCG
TCGGAAATGT TCGGCGTGCC GGTGGAAGGC ATGCCGGGCG AGGTGCGCCG CCGCGCCAAG
GCGATCAATT TCGGCATCAT CTACGGCATC TCGGCCTTCG GGCTTGCCAA TCAGCTTTCG
ATCGAGCGTT CGGAAGCCGG CGACTACATC AAGAAGTATT TCGAGCGTTT CCCCGGCATC
CGCGATTATA TGGAAAGCCG AAAGGCCATG GCGCGCGACA AGGGTTATGT CGAAACGATC
TTCGGTCGCC GCATCAACTA TCCCGAAATC CGCTCTTCCA ATCCATCCGT GCGTGCCTTT
AACGAGCGTG CGGCGATCAA CGCGCCGATC CAGGGCTCGG CTGCCGACGT CATCCGCCGG
GCGATGATCA AGATAGAGCC GGCGCTTGTT GAAGTCGGCC TTGCCGATCG CGTCCGCATG
CTGCTGCAGG TGCATGACGA ACTCATCTTC GAGGTCGAGG ACGAGGATGT CGAAAAGGCG
ATGCCGGTCA TCGTCTCGGT CATGGAAAAC GCCACCATGC CGGCGCTGGA AATGCGCGTG
CCGCTGAGGG TCGATGCCCG CGCCGCCACC AATTGGGACG AGGCGCACTA A
 
Protein sequence
MPNSIWTSSH ARAIHARMKK GDHLFLVDGS GFIFRAFHAL PPLTRKTDGL PIGAVSGFCN 
MLWKLLRDAR NTDVGVTPTH LAVIFDYSAK TFRKDLYDAY KANRSAPPEE LIPQFGLIRE
ATRAFNLPCI ETQGFEADDI IATYARQAEA TGADVTIVSS DKDLMQLVSP NVHMYDSMKD
KQIGIPDVIE KWGVPPEKMI DLQAMTGDSV DNVPGIPGIG PKTAAQLLEE YGDLDTLLER
ATEIKQVKRR ETILANIDMA RLSRDLVRLR IDVPLDLDLD ALVLEPQNGP KLIGFLKTME
FTTLTRRVAE ACNCDAGAIE PAIVRVEWGE TARGPDLDAA APEPVAGGIP DVSGESVPVP
PRAKAKTAVE GAFSPADLAK ARAEVFATLP FDHSAYVTIR DLATLDRWIA DARVTGLVAF
DTETTSLDAM QAELVGFSLA IADNTADPTG TKIRAAYVPL VHKNGVGDLL GGGLAENQIP
MRDALPRLKA LLEDEAVLKV AQNLKYDYLL LKRYGIETRS FDDTMLISYV LDAGTGAHGM
DPLSEKFLGH TPIPYKDVAG SGKANVTFDL VDIDRATHYA AEDAEVTLRL WLVLKPRLAA
AGLTSVYERL ERPLLPVLAR MEARGITVDR QILSRLSGEL AQSAARLEDE IYVLAGERFN
IGSPKQLGDI LFGKMGLSGG SKTKTGQWST SAQVLEDLAA AGFELPRKIV DWRQVTKLKS
TYTDALPGYV HPETKRVHTS YSLASTTTGR LSSSEPNLQN IPVRTAEGRK IRTAFISTPR
HKLISADYSQ IELRVLAHVA EIPQLTKAFE DGVDIHAMTA SEMFGVPVEG MPGEVRRRAK
AINFGIIYGI SAFGLANQLS IERSEAGDYI KKYFERFPGI RDYMESRKAM ARDKGYVETI
FGRRINYPEI RSSNPSVRAF NERAAINAPI QGSAADVIRR AMIKIEPALV EVGLADRVRM
LLQVHDELIF EVEDEDVEKA MPVIVSVMEN ATMPALEMRV PLRVDARAAT NWDEAH