Gene Smed_3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3021 
Symbol 
ID5323899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3169083 
End bp3172433 
Gene Length3351 bp 
Protein Length1116 aa 
Translation table11 
GC content62% 
IMG OID640791971 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_001328683 
Protein GI150398216 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGG ATACTGCCTT CTACGAACTC GGTGCGTGCA CGAATTTCTC TTTTCTCGAA 
GGGGCGGCGC CGGCCGAGGA AATGGTCGTC TTCGCCAAGA AGGCGAGGCT TGCCGGTCTC
GGTATTGCCG ACCGCAACAG CGTTGCCGGT GTGGTGAGAG CCCATGCCAA GGCGAAAATG
GAGAACTATC CTTTCCAGCC GGGCGCCCGT CTGGTTTTTG CCGACGGTAC TCCGGACGTG
CTCGCCTATC CGAGGAACCG GCGGGGGTGG GGGCATCTCT GCCGCCTGCT CAGCGCCGGT
AACCTGCGTT CGAAGAAGGG CGATTGCACG CTCCATCTCG CCGATCTTCT CGAATGGCAG
GAGGAGCTTC TCCTGATCGT CATGCCGGAT AGGGTACGGC CGGAACCGGA GAGCCTCAAG
CCACTGCTCG GGAAGCTGCA GGAACATGCC GGCAATCGGC TCTATCTCGG CCTGGCGCCG
CGCTATGACG GTTTCGACCG GCATGATTTC GCCGTGCTGG CCACGGTCGC CCGAAAAGCC
GGCATCGGGT TACTCGCCAC CAATGACGCG CTTTATCACG ATCCGGATTA CAGACCGCTT
GCAGATGTCG TGACCGCGAT ACGGGAACAT GTGCCGGTCG CCGGCGCCGG CTTCCTGCTG
CAGAAGAATG CCGAAAGGCA TCTGAAGAGT CCGAGGGAAA TGGCGCGGCT TTTCAGCGAC
TATCCCGAGG CGATTGCCAA TACGCAGAAA TTTTTCAGGC ATCTCGCCTT CAGCCTCGAC
GAGCTCAGGC ATCAATATCC GGATGAAAAC GCCGGGGGTG AGACGCCGGC CGAAAGCTTG
CGGAGGCTCG TCTCGGAAGG TGCCGCCGAG CGCTATCCCG AAGGCGTGCC GGAAAAGGTG
CAGCGTCAGA TCGAGTACGA ACTCGAGCTT ATCAACGACA AGAAATACGA GCCTTACTTC
CTGACCGTCC ACAAGCTTGT GAAATTTGCC CGAAGCGAGA AGATTCTCTG CCAGGGGCGG
GGGTCGGCGG CCAATTCCTC GGTCTGTTTC TGTCTCGGCA TCACCGATGT CGATCCGCAG
AAATTCACGC TGCTGTTCGA CCGCTTCCTG TCGAAGGATC GCGACGAGCC GCCCGATATC
GACGTCGATT TCGAGCATGA GCGGCGCGAA GAGGTGATCC AGTATATCTA CAGGACCTAC
GGCAAGGAGC ATGCGGGCCT TGCGGCTGCG GTGATCAGCT ATCGCTCACG GTCCGCCGGG
CGCGAAGTCG CCAAGGCCTT CGGGTTTTCC GAGGATGTAC AGTCGGCCCT CGTCAGCTCC
ATCTGGGGCT GGGGGAACTC GCCCTTCACA GAGGAACAGG CCAGGGGAGC GGGACTCGAC
GCGGCGGACC CGTCGACGCG GCGCGTCCTT GCCTATGCCA GCCTGCTCAT GAACTATCCG
CGCCATCTCT CGCAGCATGT GGGCGGTTTC GTGATTACCC GCGACAGGCT CGACGAAGTC
GTGCCGATCA TGAATACGGC CATGCCGGAC CGCTACATGA TCGAGTGGGA CAAGGACGAT
CTCGACGAGT TGAAGATCCT CAAGGTCGAT GTGCTGGCGC TCGGCATGCT GACCTGCCTT
GCCAAGGGCT TCAAGCTGCT GGAGGCGCAT TACGGCGAGC CGATAACGCT GGCCGAAATC
TATCAGGATC ACCAGCCGGC CGTTTACGAC ATGATCTGCC GTGCCGATAC GGTCGGCGTC
TTCCAGATCG AAAGCCGGGC GCAGATGAGC ATGCTGCCGC GCCTGCAGCC ACGTGAGATG
TACGATCTGG TGATCGAGGT TGCGATCGTC CGGCCCGGAC CCATCCAGGG CAATATGGTG
CATCCCTATC TCAAACGCCG CGAGGGACAG CGCAAGGGCG AAAAGGTCAA ATATCCGAGC
CCCGAATTGA AGGCGGTCCT GGAAAGGACG CTGGGCGTGC CGCTGTTCCA GGAACAGGCG
ATGCAGATCG CGATCACCGC CGCCGGTTTC TCGCCCAGCG AGGCTGACCG CCTGCGCCGT
GCCATGGCAA CCTTCAAGCG GACCGGCACG ATCCACACCT TCGAGCGGAA GATGGTCGAG
GGCATGGTCG CGAATGGTTA CGAGAGGGAA TTTGCCGAGC GCTGCTTCAA CCAGATCAAG
GGATTCGGCG AATATGGCTT CCCCGAAAGC CATGCCGCCT CCTTCGCCTC TCTCGTCTAT
GCCTCGGCCT GGCTCAAGAC CTATTATCCG GACATTTTCT GCGCCGCGCT CCTGAACGCG
CAGCCCATGG GTTTCTATGC GCCGGCGCAA TTGGTGCGCG ATGCCCGGGA ACACGGAGTG
AAAGTGCTGC CGGTCGACAT CAACCACTCG GATTGGGATG CCTTGCTCGA AGGCGAAGGC
CAGTTCCGAA AGGAGTCTGT CGACCCGCGC CATGCCGATA TGCGGGAGGT GATCAAAACC
CGGAAGGCCG TGCGGCTGGG CTTTCGGCTG GTCAAGGGCC TTAAGCAAGC GGATATGGGG
GCGCTCGTCG CGTGCCGGGG AGAGGGGTAT CGCTCCGTTC ATGACCTATG GTTTCGCTCC
GGCCTTTCCC GCTCCGTTCT GGAGCGCCTT GCGGATGCCG ATGCTTTCCG ATCGCTAGGT
CTCGACCGGC GCGCGGCCCT CTGGGCGGTG AAAGCCCTCG ACGAGCAATC GGCGGTGGAG
CGGTTGCCGC TTTTCGAGGG GGCGGGTTCT CTCGATCTGC GGGCCGAGCC CAAGGTTGCT
CTTCCCGAAA TGCCGGCAGG CGAACAGGTC ATTCATGATT ATCGCACCCT GACGCTCTCC
CTGAAGGCAC ACCCGGTTTC TTTCATGCGG GAGGATTTCT CGCGAACGGG GATCCTGCGC
AGCCGGGATC TGGCGGCAAC TGCAACGGGG AAATGGGTAA CTGTGGCGGG TCTGGTGCTG
GTCAGGCAGA GGCCGGGCTC CGCCAATGGA GTCATCTTCA TGACGATCGA GGACGAAACC
GGAATCGCCA ATATCATCGT CTGGGAAAAG ACGTTCCGGA AATACCGGCC TCAGGTCATG
GGATCGCGGC TCGTGAAAAT ACGTGGACGC TTGCAGAACC AGAGCGGCGT CATTCACGTG
GTCGCCGATC ATCTGGAGGA TATAACGCCG ATGCTCGGCC TCCTGCGCCG TGAGGCGCGG
CGTTTCGGCG CCAACGACCG CGCGGATGGA GCCTTGCGGC CGAGCGGGGA CGCTCGCGAA
AAGAGAAAGC TCCGGCAATT GCGCCTCGGC CTGCCCGGCG GGGCAGAGCC CGAGGGCGAG
GCAGCCGCTC AGGTGGCCGA GGCCATGCCG AAAGGGCGCA ATTTTCATTA G
 
Protein sequence
MSGDTAFYEL GACTNFSFLE GAAPAEEMVV FAKKARLAGL GIADRNSVAG VVRAHAKAKM 
ENYPFQPGAR LVFADGTPDV LAYPRNRRGW GHLCRLLSAG NLRSKKGDCT LHLADLLEWQ
EELLLIVMPD RVRPEPESLK PLLGKLQEHA GNRLYLGLAP RYDGFDRHDF AVLATVARKA
GIGLLATNDA LYHDPDYRPL ADVVTAIREH VPVAGAGFLL QKNAERHLKS PREMARLFSD
YPEAIANTQK FFRHLAFSLD ELRHQYPDEN AGGETPAESL RRLVSEGAAE RYPEGVPEKV
QRQIEYELEL INDKKYEPYF LTVHKLVKFA RSEKILCQGR GSAANSSVCF CLGITDVDPQ
KFTLLFDRFL SKDRDEPPDI DVDFEHERRE EVIQYIYRTY GKEHAGLAAA VISYRSRSAG
REVAKAFGFS EDVQSALVSS IWGWGNSPFT EEQARGAGLD AADPSTRRVL AYASLLMNYP
RHLSQHVGGF VITRDRLDEV VPIMNTAMPD RYMIEWDKDD LDELKILKVD VLALGMLTCL
AKGFKLLEAH YGEPITLAEI YQDHQPAVYD MICRADTVGV FQIESRAQMS MLPRLQPREM
YDLVIEVAIV RPGPIQGNMV HPYLKRREGQ RKGEKVKYPS PELKAVLERT LGVPLFQEQA
MQIAITAAGF SPSEADRLRR AMATFKRTGT IHTFERKMVE GMVANGYERE FAERCFNQIK
GFGEYGFPES HAASFASLVY ASAWLKTYYP DIFCAALLNA QPMGFYAPAQ LVRDAREHGV
KVLPVDINHS DWDALLEGEG QFRKESVDPR HADMREVIKT RKAVRLGFRL VKGLKQADMG
ALVACRGEGY RSVHDLWFRS GLSRSVLERL ADADAFRSLG LDRRAALWAV KALDEQSAVE
RLPLFEGAGS LDLRAEPKVA LPEMPAGEQV IHDYRTLTLS LKAHPVSFMR EDFSRTGILR
SRDLAATATG KWVTVAGLVL VRQRPGSANG VIFMTIEDET GIANIIVWEK TFRKYRPQVM
GSRLVKIRGR LQNQSGVIHV VADHLEDITP MLGLLRREAR RFGANDRADG ALRPSGDARE
KRKLRQLRLG LPGGAEPEGE AAAQVAEAMP KGRNFH