Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4423 |
Symbol | |
ID | 8015192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4553938 |
End bp | 4556988 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826998 |
Product | DNA polymerase I |
Protein accession | YP_002978200 |
Protein GI | 241207104 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAACT CCATATGGAC CTCGTCCCAT GCCCGCGCTA TCCATGCACG CATGAAAAAA GGCGATCACC TCTTCCTAGT CGATGGTTCC GGATTCATCT TCCGGGCGTT TCATGCATTG CCGCCGCTGA CCCGCAAGAC CGACGGCCTG CCGATCGGCG CCGTGTCCGG TTTCTGCAAC ATGCTGTGGA AACTGCTGAG GGATGCGCGC AATACCGATG TCGGCGTCAC GCCGACACAT CTTGCCGTCA TCTTCGATTA TTCCGCCAAG ACGTTTCGCA AGGATCTCTA CGACGCTTAC AAGGCGAACC GCTCCGCCCC GCCTGAAGAG CTCATCCCGC AATTCGGCCT TATAAGAGAG GCGACCCGCG CCTTCAATCT GCCCTGCATC GAGACCCAAG GCTTCGAGGC CGACGACATC ATCGCCACCT ATGCCCGCCA GGCCGAAGCG ACCGGCGCCG ATGTCACCAT TGTCTCCTCC GACAAGGATC TGATGCAGCT CGTCAGCCCC AATGTCCATA TGTATGACAG CATGAAGGAC AAGCAGATCG GCATTCCCGA TGTCATCGAG AAATGGGGCG TGCCGCCGGA AAAGATGATC GACCTGCAGG CGATGACCGG CGATTCAGTC GACAATGTTC CCGGCATTCC CGGCATCGGT CCGAAAACCG CCGCCCAGCT GCTCGAGGAA TACGGCGATC TCGATACGCT GCTCGAACGC GCCACCGAGA TCAAGCAGGT CAAGCGCCGT GAGACGATCC TCGCCAATAT CGACATGGCC AGGCTCTCGC GCGACCTCGT GCGGTTGCGC ATAGACGTGC CGCTCGATCT CGATCTCGAC GCGCTGGTGC TGGAACCGCA GAACGGTCCG AAGCTGATCG GCTTCCTCAA GACGATGGAA TTCACCACGC TGACGCGCCG CGTCGCCGAA GCCTGCAATT GCGATGCCGG CGCCATCGAA CCGGCGATCG TCCGTGTCGA ATGGGGTGAG ACGGCCCGCG GCCCGGATCT CGATGCGGCC GCGCCCGAGC CTGTTGCCGG CGGCATCCCC GACGTTTCCG GCGAATCCGT GCCGGTGCCG CCGCGTGCAA AGGCGAAGAC CGCGGTCGAA GGCGCCTTTT CGCCCGCCGA TCTTGCCAAG GCGCGGGCCG AGGTCTTTGC GACGCTGCCC TTCGATCATT CGGCCTATGT CACGATCCGC GACCTGGCGA CACTCGACCG ATGGATTGCC GATGCACGCG TCACCGGCCT CGTTGCTTTC GATACCGAGA CCACGTCGCT GGATGCGATG CAGGCCGAAC TTGTCGGCTT TTCGCTGGCG ATCGCCGACA ATACCGCCGA TCCCACCGGC ACGAAGATCC GTGCCGCCTA TGTGCCGCTC GTCCACAAGA ACGGCGTCGG CGATCTGCTC GGCGGCGGCC TTGCCGAAAA CCAGATCCCG ATGCGCGATG CTCTGCCACG ACTGAAGGCA TTGCTGGAGG ACGAAGCGGT TCTCAAGGTC GCCCAGAACC TGAAATACGA CTACCTGCTG TTGAAGCGCT ACGGCATCGA GACCAGGAGT TTCGACGACA CGATGCTGAT CTCCTACGTG CTCGATGCCG GCACCGGCGC GCATGGCATG GACCCGCTCT CGGAAAAATT CCTCGGCCAT ACCCCGATTC CCTACAAGGA CGTGGCCGGC AGCGGCAAGG CGAACGTCAC CTTCGATCTG GTCGATATCG ACCGCGCCAC CCACTATGCC GCCGAAGATG CCGAGGTGAC GTTGCGCCTC TGGCTGGTGC TGAAGCCCCG GCTGGCGGCG GCGGGATTGA CCAGCGTCTA TGAACGGCTG GAGCGGCCGC TATTGCCGGT GCTGGCGCGC ATGGAAGCGC GCGGCATCAC CGTCGACCGG CAGATCCTGT CGCGCCTCTC CGGCGAGCTG GCCCAGAGTG CAGCAAGGCT GGAGGACGAG ATCTACGTGC TGGCCGGCGA GCGTTTCAAT ATCGGTTCGC CGAAGCAGCT GGGCGATATC CTGTTCGGCA AGATGGGCCT TTCCGGCGGC AGCAAGACGA AGACCGGCCA ATGGTCCACC TCCGCCCAGG TGCTCGAGGA TCTGGCCGCC GCCGGTTTCG AATTGCCGCG CAAGATCGTC GACTGGCGCC AGGTCACCAA GCTGAAATCC ACCTATACCG ACGCGCTTCC GGGTTACGTT CACCCCGAGA CAAAGCGGGT CCACACCTCC TACTCGCTGG CATCGACGAC CACGGGACGC CTGTCATCGT CCGAGCCGAA CTTGCAGAAT ATTCCGGTGC GCACCGCAGA AGGCCGCAAG ATCCGCACCG CCTTCATCTC GACGCCCCGC CACAAGCTGA TCTCCGCCGA CTACAGCCAG ATCGAACTGC GCGTGCTTGC CCATGTGGCC GAAATCCCGC AGCTGACCAA GGCCTTCGAA GATGGCGTCG ACATCCATGC CATGACGGCG TCGGAAATGT TCGGCGTGCC GGTGGAAGGC ATGCCGGGCG AGGTGCGCCG CCGCGCCAAG GCGATCAATT TCGGCATCAT CTACGGCATC TCGGCCTTCG GGCTTGCCAA TCAGCTTTCG ATCGAGCGTT CGGAAGCCGG CGACTACATC AAGAAGTATT TCGAGCGTTT CCCCGGCATC CGCGATTATA TGGAAAGCCG AAAGGCCATG GCGCGCGACA AGGGTTATGT CGAAACGATC TTCGGTCGCC GCATCAACTA TCCCGAAATC CGCTCTTCCA ATCCATCCGT GCGTGCCTTT AACGAGCGTG CGGCGATCAA CGCGCCGATC CAGGGCTCGG CTGCCGACGT CATCCGCCGG GCGATGATCA AGATAGAGCC GGCGCTTGTT GAAGTCGGCC TTGCCGATCG CGTCCGCATG CTGCTGCAGG TGCATGACGA ACTCATCTTC GAGGTCGAGG ACGAGGATGT CGAAAAGGCG ATGCCGGTCA TCGTCTCGGT CATGGAAAAC GCCACCATGC CGGCGCTGGA AATGCGCGTG CCGCTGAGGG TCGATGCCCG CGCCGCCACC AATTGGGACG AGGCGCACTA A
|
Protein sequence | MPNSIWTSSH ARAIHARMKK GDHLFLVDGS GFIFRAFHAL PPLTRKTDGL PIGAVSGFCN MLWKLLRDAR NTDVGVTPTH LAVIFDYSAK TFRKDLYDAY KANRSAPPEE LIPQFGLIRE ATRAFNLPCI ETQGFEADDI IATYARQAEA TGADVTIVSS DKDLMQLVSP NVHMYDSMKD KQIGIPDVIE KWGVPPEKMI DLQAMTGDSV DNVPGIPGIG PKTAAQLLEE YGDLDTLLER ATEIKQVKRR ETILANIDMA RLSRDLVRLR IDVPLDLDLD ALVLEPQNGP KLIGFLKTME FTTLTRRVAE ACNCDAGAIE PAIVRVEWGE TARGPDLDAA APEPVAGGIP DVSGESVPVP PRAKAKTAVE GAFSPADLAK ARAEVFATLP FDHSAYVTIR DLATLDRWIA DARVTGLVAF DTETTSLDAM QAELVGFSLA IADNTADPTG TKIRAAYVPL VHKNGVGDLL GGGLAENQIP MRDALPRLKA LLEDEAVLKV AQNLKYDYLL LKRYGIETRS FDDTMLISYV LDAGTGAHGM DPLSEKFLGH TPIPYKDVAG SGKANVTFDL VDIDRATHYA AEDAEVTLRL WLVLKPRLAA AGLTSVYERL ERPLLPVLAR MEARGITVDR QILSRLSGEL AQSAARLEDE IYVLAGERFN IGSPKQLGDI LFGKMGLSGG SKTKTGQWST SAQVLEDLAA AGFELPRKIV DWRQVTKLKS TYTDALPGYV HPETKRVHTS YSLASTTTGR LSSSEPNLQN IPVRTAEGRK IRTAFISTPR HKLISADYSQ IELRVLAHVA EIPQLTKAFE DGVDIHAMTA SEMFGVPVEG MPGEVRRRAK AINFGIIYGI SAFGLANQLS IERSEAGDYI KKYFERFPGI RDYMESRKAM ARDKGYVETI FGRRINYPEI RSSNPSVRAF NERAAINAPI QGSAADVIRR AMIKIEPALV EVGLADRVRM LLQVHDELIF EVEDEDVEKA MPVIVSVMEN ATMPALEMRV PLRVDARAAT NWDEAH
|
| |