Gene Smed_3381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3381 
Symbol 
ID5324265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3581962 
End bp3584976 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content62% 
IMG OID640792332 
ProductDNA polymerase I 
Protein accessionYP_001329037 
Protein GI150398570 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.30865 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACG GTGATCATCT CTTCCTCGTC GACGGCTCGG GCTTCATCTT CCGGGCGTTC 
CACGCCATCC CCCCGCTCAA CCGCAAGTCG GACGGACTTC CGGTGAATGC GGTGGCAGGT
TTCTGCAACA TGCTCTGGAA GCTGTTGACC GACGCGCGCG ATACCTCCGT GGGCGTGACG
CCGACGCATT TCGCGGTGAT CTTCGATTAT TCCTCCAAGA CTTTCCGCAA CGGGCTCTAC
GACCAGTACA AGGCGAATAG GACCGCCCCG CCGGAAGACC TGATCCCACA GTTCGGGCTG
ATCCGCCATG CCACCCGCGC ATTCAATCTG CCCTGCATCG AGAAGGAAGG CTACGAGGCG
GACGACCTCA TTGCGACCTA TGCGCGGCTT GCCGAAGAGG CAGGCGCCGA CGTGACCATC
GTCTCGTCGG ACAAGGACCT CATGCAGCTC GTCACGCCGA AGGTGTCGAT GTATGACAGC
ATGAAGGACA AGCAGATCAC CGTTCCGGAC GTAATCGAAA AATGGGGCGT GCCGCCGGAA
AAGATGATCG ATCTCCAGGC GATGACCGGC GATTCGACCG ACAACGTGCC TGGAATCCCA
GGGATAGGAC CAAAGACCGC GGCCCAACTC CTCGAGGAGT ATGGCGATCT CGACACGCTG
CTTGCACGGG CTGGCGAGAT CAAACAGCAG AAGCGGCGCG AGTCCATCAT CGCCAATGCC
AATCTGGCGC GCCTTTCGCG TGAGCTCGTG ACGCTGAAGA AAGACACGCC GCTCGACGTG
CCGCTCGACG ACTTCATGCT CGATTCTCAG GATGGCCCGA AGCTCATCGC CTTTCTGAAG
GCGATGGAAT TCACAACGCT CACGCGCCGC GTCGCGGCGG CGACGGACAC GGACGCCGAA
TCGATAGAGC CCGCTCATGT TCCGGTTGAA TGGGGAACCG AGGCGCGTGG ACCGGATCTC
GACGTCGGAG AGGCTGGCGG ACCTCCGCCG TCGCCGCAAT CCTCAAGCGC GGCGCCGCCG
CGCGGAAATG CGGCAAGGGC TGCGGTTTCA TTCCTCTCCC CCGGCCAGGA TGCCGATGCG
ACCGGCGCAA CGCCGGCCGA CCTTGCCGAG GCGCGTGCCG CCTATTTCGC CAGCGCTCCT
TTCGACCATT CCGCCTATAT CACCATACGC GATCTCGCAT CGCTCGAACG CTGGATCGCC
GACGCGCGCG AGGCCGGGCT TGTCGGTTTC GGCACCCAGG CGACATCGTC CGACGCCATG
CGCGCCGATC TCGTCGGCTT TTCGCTGGCG ATAGCCGATT ACGCGAACGA CCCGTCGGGC
TCGAGGATCC GGGCTGCCTA TGTGCCGCTC GCGCACAAGA ACGGCACCGG CGATCTGCTG
GGAGGCGGAC ACATCGACAA CCAGATCCCG GCTCGCGAGG CGCTGAGCCG GCTAAAGGAG
TTGCTGGAGG ACCCGTCCAT CCTCAAAGTC GCACAGAACC TGAAATACGG CTATCTGGTG
ATGAAGCGCC ATGGGGTTAC CATGCAGGGC TTCGACGACA CGATGCTGAT ATCCTATGTG
CTCGATGCCG GCAACGGAGC CCACGGCATG GAGTCGCTCG CCGAGCGATG GCTTGGCCAC
ACGCCGATCG CCTCCAAGGG CATCACCGGC AGTGGCAGGT CGTCGCTCAC CATCGACTTC
GTCGATATCG ACAAGGCAGC GGCCTACTCG GCGGAAGATG CGGATATTGC GCTGAGGCTT
TGGCATGTGC TGAAGCCCCG CCTTACTGCC AGGGGTCTGA CGCGTGTCTA CGAGCGGCTG
GAGCGGCCGC TGATTTCGGT GCTTGCCGGA ATGGAGGAAC GCGGCATCCC CGTCGACAGG
CAGATCCTAT CGCGCCTCTC CGGCGAACTG GCGCAGGGAG CGGCGGCGCT GGAAGACGAG
ATATATCGCC TTGCCGGCGA AACCTTCACC ATCGGCTCGC CGAAGCAGCT TGGCGACATT
CTCTTCGGCA AGTTGGGCCT TCCGGGCGGT TCCAAAACCA AGACCGGGCA ATGGTCGACC
TCCGCGCAGG TTCTCGAGGA CCTTGCGGCC GCTGGACACG ACCTGCCCCG CAAGATCGTC
GACTGGCGGC AGCTGACCAA GCTCAAATCG ACCTATACCG ACGCTCTCCC CGGCTTCGTG
CACCCGGAAA CCAAGCGCGT TCACACCTGT TTTGCACTGG CCGCGACGAC AACCGGACGG
CTTTCCTCGT CCGACCCGAA CCTGCAGAAC ATTCCGATAC GGACTGGCGA GGGTCGCAAG
ATCCGCACTG CCTTCGTGGC CACGCCGGGA CACAAGCTGG TCTCGGCGGA CTACAGCCAG
ATCGAACTCC GGGTGCTTGC CCATGTCGCC GACATCCCGC AGCTGCGCCA GGCGTTTGCG
GATGGCGTCG ACATCCATGC GATGACCGCT TCCGAAATGT TCGGCGTGCC GGTCGACGGC
ATGCCTGGCG AAATCCGCCG CCGCGCCAAG GCGATCAACT TCGGCATCAT CTACGGTATT
TCCGCCTTCG GCCTTGCCAA CCAGCTTTCC ATCGAGCGGT CCGAGGCCGG GGACTACATC
AAGCGATATT TCGAGCGCTT TCCCGGCATT CGCGACTATA TGGAAAACAC CAAGACCTTT
GCCCGCGAGA ACGGCTATGT CGAAACGATC TTCGGTCGCC GTGCCCATTA CCCGGACATT
CGTTCGTCCA ACCCGTCGAT GCGCGCCTTC AACGAACGCG CCTCGATCAA CGCCCCGATC
CAGGGATCGG CCGCCGACAT CATCCGTCGC GCAATGGTCA AGATGGAGCC CGCGCTCGAA
GCCGCCAAGC TATCCGCGCG AATGCTCCTT CAGGTTCACG ACGAACTGAT CTTCGAGGTG
GAGGAGGGCG AGATCGAACG GACTATTCCC GTCATCGTGT CCGTGATGGA GAATGCCGCA
ATGCCCGCCC TCGATATGAG AGTGCCGTTG AAGGTCGATG CGCGGGCGGC GCACAATTGG
GACGAGGCGC ATTAA
 
Protein sequence
MKNGDHLFLV DGSGFIFRAF HAIPPLNRKS DGLPVNAVAG FCNMLWKLLT DARDTSVGVT 
PTHFAVIFDY SSKTFRNGLY DQYKANRTAP PEDLIPQFGL IRHATRAFNL PCIEKEGYEA
DDLIATYARL AEEAGADVTI VSSDKDLMQL VTPKVSMYDS MKDKQITVPD VIEKWGVPPE
KMIDLQAMTG DSTDNVPGIP GIGPKTAAQL LEEYGDLDTL LARAGEIKQQ KRRESIIANA
NLARLSRELV TLKKDTPLDV PLDDFMLDSQ DGPKLIAFLK AMEFTTLTRR VAAATDTDAE
SIEPAHVPVE WGTEARGPDL DVGEAGGPPP SPQSSSAAPP RGNAARAAVS FLSPGQDADA
TGATPADLAE ARAAYFASAP FDHSAYITIR DLASLERWIA DAREAGLVGF GTQATSSDAM
RADLVGFSLA IADYANDPSG SRIRAAYVPL AHKNGTGDLL GGGHIDNQIP AREALSRLKE
LLEDPSILKV AQNLKYGYLV MKRHGVTMQG FDDTMLISYV LDAGNGAHGM ESLAERWLGH
TPIASKGITG SGRSSLTIDF VDIDKAAAYS AEDADIALRL WHVLKPRLTA RGLTRVYERL
ERPLISVLAG MEERGIPVDR QILSRLSGEL AQGAAALEDE IYRLAGETFT IGSPKQLGDI
LFGKLGLPGG SKTKTGQWST SAQVLEDLAA AGHDLPRKIV DWRQLTKLKS TYTDALPGFV
HPETKRVHTC FALAATTTGR LSSSDPNLQN IPIRTGEGRK IRTAFVATPG HKLVSADYSQ
IELRVLAHVA DIPQLRQAFA DGVDIHAMTA SEMFGVPVDG MPGEIRRRAK AINFGIIYGI
SAFGLANQLS IERSEAGDYI KRYFERFPGI RDYMENTKTF ARENGYVETI FGRRAHYPDI
RSSNPSMRAF NERASINAPI QGSAADIIRR AMVKMEPALE AAKLSARMLL QVHDELIFEV
EEGEIERTIP VIVSVMENAA MPALDMRVPL KVDARAAHNW DEAH