Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3381 |
Symbol | |
ID | 5324265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 3581962 |
End bp | 3584976 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640792332 |
Product | DNA polymerase I |
Protein accession | YP_001329037 |
Protein GI | 150398570 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.30865 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACG GTGATCATCT CTTCCTCGTC GACGGCTCGG GCTTCATCTT CCGGGCGTTC CACGCCATCC CCCCGCTCAA CCGCAAGTCG GACGGACTTC CGGTGAATGC GGTGGCAGGT TTCTGCAACA TGCTCTGGAA GCTGTTGACC GACGCGCGCG ATACCTCCGT GGGCGTGACG CCGACGCATT TCGCGGTGAT CTTCGATTAT TCCTCCAAGA CTTTCCGCAA CGGGCTCTAC GACCAGTACA AGGCGAATAG GACCGCCCCG CCGGAAGACC TGATCCCACA GTTCGGGCTG ATCCGCCATG CCACCCGCGC ATTCAATCTG CCCTGCATCG AGAAGGAAGG CTACGAGGCG GACGACCTCA TTGCGACCTA TGCGCGGCTT GCCGAAGAGG CAGGCGCCGA CGTGACCATC GTCTCGTCGG ACAAGGACCT CATGCAGCTC GTCACGCCGA AGGTGTCGAT GTATGACAGC ATGAAGGACA AGCAGATCAC CGTTCCGGAC GTAATCGAAA AATGGGGCGT GCCGCCGGAA AAGATGATCG ATCTCCAGGC GATGACCGGC GATTCGACCG ACAACGTGCC TGGAATCCCA GGGATAGGAC CAAAGACCGC GGCCCAACTC CTCGAGGAGT ATGGCGATCT CGACACGCTG CTTGCACGGG CTGGCGAGAT CAAACAGCAG AAGCGGCGCG AGTCCATCAT CGCCAATGCC AATCTGGCGC GCCTTTCGCG TGAGCTCGTG ACGCTGAAGA AAGACACGCC GCTCGACGTG CCGCTCGACG ACTTCATGCT CGATTCTCAG GATGGCCCGA AGCTCATCGC CTTTCTGAAG GCGATGGAAT TCACAACGCT CACGCGCCGC GTCGCGGCGG CGACGGACAC GGACGCCGAA TCGATAGAGC CCGCTCATGT TCCGGTTGAA TGGGGAACCG AGGCGCGTGG ACCGGATCTC GACGTCGGAG AGGCTGGCGG ACCTCCGCCG TCGCCGCAAT CCTCAAGCGC GGCGCCGCCG CGCGGAAATG CGGCAAGGGC TGCGGTTTCA TTCCTCTCCC CCGGCCAGGA TGCCGATGCG ACCGGCGCAA CGCCGGCCGA CCTTGCCGAG GCGCGTGCCG CCTATTTCGC CAGCGCTCCT TTCGACCATT CCGCCTATAT CACCATACGC GATCTCGCAT CGCTCGAACG CTGGATCGCC GACGCGCGCG AGGCCGGGCT TGTCGGTTTC GGCACCCAGG CGACATCGTC CGACGCCATG CGCGCCGATC TCGTCGGCTT TTCGCTGGCG ATAGCCGATT ACGCGAACGA CCCGTCGGGC TCGAGGATCC GGGCTGCCTA TGTGCCGCTC GCGCACAAGA ACGGCACCGG CGATCTGCTG GGAGGCGGAC ACATCGACAA CCAGATCCCG GCTCGCGAGG CGCTGAGCCG GCTAAAGGAG TTGCTGGAGG ACCCGTCCAT CCTCAAAGTC GCACAGAACC TGAAATACGG CTATCTGGTG ATGAAGCGCC ATGGGGTTAC CATGCAGGGC TTCGACGACA CGATGCTGAT ATCCTATGTG CTCGATGCCG GCAACGGAGC CCACGGCATG GAGTCGCTCG CCGAGCGATG GCTTGGCCAC ACGCCGATCG CCTCCAAGGG CATCACCGGC AGTGGCAGGT CGTCGCTCAC CATCGACTTC GTCGATATCG ACAAGGCAGC GGCCTACTCG GCGGAAGATG CGGATATTGC GCTGAGGCTT TGGCATGTGC TGAAGCCCCG CCTTACTGCC AGGGGTCTGA CGCGTGTCTA CGAGCGGCTG GAGCGGCCGC TGATTTCGGT GCTTGCCGGA ATGGAGGAAC GCGGCATCCC CGTCGACAGG CAGATCCTAT CGCGCCTCTC CGGCGAACTG GCGCAGGGAG CGGCGGCGCT GGAAGACGAG ATATATCGCC TTGCCGGCGA AACCTTCACC ATCGGCTCGC CGAAGCAGCT TGGCGACATT CTCTTCGGCA AGTTGGGCCT TCCGGGCGGT TCCAAAACCA AGACCGGGCA ATGGTCGACC TCCGCGCAGG TTCTCGAGGA CCTTGCGGCC GCTGGACACG ACCTGCCCCG CAAGATCGTC GACTGGCGGC AGCTGACCAA GCTCAAATCG ACCTATACCG ACGCTCTCCC CGGCTTCGTG CACCCGGAAA CCAAGCGCGT TCACACCTGT TTTGCACTGG CCGCGACGAC AACCGGACGG CTTTCCTCGT CCGACCCGAA CCTGCAGAAC ATTCCGATAC GGACTGGCGA GGGTCGCAAG ATCCGCACTG CCTTCGTGGC CACGCCGGGA CACAAGCTGG TCTCGGCGGA CTACAGCCAG ATCGAACTCC GGGTGCTTGC CCATGTCGCC GACATCCCGC AGCTGCGCCA GGCGTTTGCG GATGGCGTCG ACATCCATGC GATGACCGCT TCCGAAATGT TCGGCGTGCC GGTCGACGGC ATGCCTGGCG AAATCCGCCG CCGCGCCAAG GCGATCAACT TCGGCATCAT CTACGGTATT TCCGCCTTCG GCCTTGCCAA CCAGCTTTCC ATCGAGCGGT CCGAGGCCGG GGACTACATC AAGCGATATT TCGAGCGCTT TCCCGGCATT CGCGACTATA TGGAAAACAC CAAGACCTTT GCCCGCGAGA ACGGCTATGT CGAAACGATC TTCGGTCGCC GTGCCCATTA CCCGGACATT CGTTCGTCCA ACCCGTCGAT GCGCGCCTTC AACGAACGCG CCTCGATCAA CGCCCCGATC CAGGGATCGG CCGCCGACAT CATCCGTCGC GCAATGGTCA AGATGGAGCC CGCGCTCGAA GCCGCCAAGC TATCCGCGCG AATGCTCCTT CAGGTTCACG ACGAACTGAT CTTCGAGGTG GAGGAGGGCG AGATCGAACG GACTATTCCC GTCATCGTGT CCGTGATGGA GAATGCCGCA ATGCCCGCCC TCGATATGAG AGTGCCGTTG AAGGTCGATG CGCGGGCGGC GCACAATTGG GACGAGGCGC ATTAA
|
Protein sequence | MKNGDHLFLV DGSGFIFRAF HAIPPLNRKS DGLPVNAVAG FCNMLWKLLT DARDTSVGVT PTHFAVIFDY SSKTFRNGLY DQYKANRTAP PEDLIPQFGL IRHATRAFNL PCIEKEGYEA DDLIATYARL AEEAGADVTI VSSDKDLMQL VTPKVSMYDS MKDKQITVPD VIEKWGVPPE KMIDLQAMTG DSTDNVPGIP GIGPKTAAQL LEEYGDLDTL LARAGEIKQQ KRRESIIANA NLARLSRELV TLKKDTPLDV PLDDFMLDSQ DGPKLIAFLK AMEFTTLTRR VAAATDTDAE SIEPAHVPVE WGTEARGPDL DVGEAGGPPP SPQSSSAAPP RGNAARAAVS FLSPGQDADA TGATPADLAE ARAAYFASAP FDHSAYITIR DLASLERWIA DAREAGLVGF GTQATSSDAM RADLVGFSLA IADYANDPSG SRIRAAYVPL AHKNGTGDLL GGGHIDNQIP AREALSRLKE LLEDPSILKV AQNLKYGYLV MKRHGVTMQG FDDTMLISYV LDAGNGAHGM ESLAERWLGH TPIASKGITG SGRSSLTIDF VDIDKAAAYS AEDADIALRL WHVLKPRLTA RGLTRVYERL ERPLISVLAG MEERGIPVDR QILSRLSGEL AQGAAALEDE IYRLAGETFT IGSPKQLGDI LFGKLGLPGG SKTKTGQWST SAQVLEDLAA AGHDLPRKIV DWRQLTKLKS TYTDALPGFV HPETKRVHTC FALAATTTGR LSSSDPNLQN IPIRTGEGRK IRTAFVATPG HKLVSADYSQ IELRVLAHVA DIPQLRQAFA DGVDIHAMTA SEMFGVPVDG MPGEIRRRAK AINFGIIYGI SAFGLANQLS IERSEAGDYI KRYFERFPGI RDYMENTKTF ARENGYVETI FGRRAHYPDI RSSNPSMRAF NERASINAPI QGSAADIIRR AMVKMEPALE AAKLSARMLL QVHDELIFEV EEGEIERTIP VIVSVMENAA MPALDMRVPL KVDARAAHNW DEAH
|
| |