Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2631 |
Symbol | |
ID | 5323500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 2731583 |
End bp | 2734180 |
Gene Length | 2598 bp |
Protein Length | 865 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640791575 |
Product | DNA ligase D |
Protein accession | YP_001328296 |
Protein GI | 150397829 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1793] ATP-dependent DNA ligase [COG3285] Predicted eukaryotic-type DNA primase |
TIGRFAM ID | [TIGR02776] DNA ligase D [TIGR02777] DNA ligase D, 3'-phosphoesterase domain [TIGR02778] DNA polymerase LigD, polymerase domain [TIGR02779] DNA polymerase LigD, ligase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCTC GCAACGAAAC ACTTACCGAA TATAACAGGC GCCGTGACTT CACGCGGACG AAGGAGCCCA AGGGAACCGT CGCTCGTCGC AGCGGGAACA GGATGCGGTT CCTGGTCCAG AAGCACGCCG CAACCCGCCT GCATTACGAC TTTCGCCTGG AATGGGAGGG CGTGCTGAAG AGCTGGGCGG TGACGCGCGG CCCAAGCCTC AACCCCGAGG ACAAGCGCCT TGCCGTCCGC ACCGAAGACC ACCCGCTTGC TTATGGCGAC TTCGAAGGAA CCATCCCGAA GGGAGAATAT GGCGGCGGCA CCGTGATGCT CTGGGATACG GGCTGGTGGG AGCCGGAGGA TGACCCGTCC AAAGCCTTGA GGAAGGGCAA GCTCTCCTTC AAGCTGCATG GCAGCCGCAT GAAGGGAGGC TGGGCTCTGG TACGCATGCG CCCACGCGAA GGCGAGAAGC GCGAGAACTG GCTGCTCGTC AAGGAAACGG ATGAGATCGC CTCCAAGGAT GGCGAGAGCC TGATCAACGA AAACATCACC AGCGTCGTCA CCGGTCGCAC CATGGAAGAG ATCGCCGAAG GCAGGGGCGA AAAGCGCTCA CGCGTCTGGC ACTCGAACAA AAGCACGGCG GCCAATCTCA GGGCCGGCGC GGTCGCTGAA GACGGCAATG CACGCAAGCG CCCCACGCGA AAGCCCTCAG GCAAACTGCC TACCTTCCGG GCGCCTCAGC TGGCGACGCT GGTGACCCAG GTACCCGTGG GCGACGCGTG GCTGAACGAA GCCAAGTTCG ACGGCTACCG GCTTGTTTGC GCCCTGGGCG GCGGTACTGC CCGCTGCTAC ACGCGCAACG GACTCGATTG GACCGAAAAG TTCCCGGTCA TTGCCGCAGC CCTTGCCGAA CTCGACTGCC GGTCCGCCCT TATCGACGGC GAAGTCGTAG CACTGTCAGA AGGCGGGTCC ACTTTCTCCG CCCTGCAGAA AGCGCTTGCA ACAGGGGCCA GCACACGGCT TTACGCGTTC GATCTCATCG AGCTCGACGG CAAGGATTTG AGCAGAAAGC CGCTCGTGGA GCGCAAGGAA AAGCTCGAAG CGCTGCTCCA AACGCTGGGC ACGACCTCGA CCATCCAGTT CAGCGAGCAT GTTCGCGGGA ATGGCGAGCA CGTGCTTGCC GCCATATGCA AGGCTGGTCA GGAGGGTATA ATCGCCAAGG AGGCGAACGC CCCCTACCGG AGCGGGCGCA CCCGAAGCTG GCTCAAGGTG AAATGCACCA AACGCCAGGA ATTCGTCATC GGCGGCTACA GCCTGTCGGA CAAGAAGGGG CGCGCCTTCG CCTCGCTCAT CGTCGGGACT TTCGAAGGCG GGAAGCTGAT CTATCGTGGC GGGGTCGGAA CCGGTTTCAG CAAAAAGACG ATGGAGGACC TCGCCGCAGC CTTCGCCTCA CGCAAAAGAG AAACGTCGCC CTTCGACAGC ATACCGCGAG AGAGAATGCG ACATTCAGTT TGGCTGAAGC CGGACCTGGT GGCCGAGGTA GACTTCGCCG AATTCACGGC TGAAGGCCAT ATCCGCCACG GTTCGTTCGA GGGATTGCGC GAGGACAAGG AGGCAAAGGC CGTGAAACTG GAGACATCGA AGCCAGCGGA AGTGGAGTCC GGGACCACCA CCAAGGGTAA ATCCACCACA AAGGCGCGCA GGACATCGGC CGCACAAGGT GACGCCGACG TCCTCGGCAT CCGTATTTCG CATCCGGACC GCGTCCTGTT CAAAGGTCAG GGCATCACCA AGATCGATCT TGCCCGCTAT TATGCAATCG TCGCCGACAG GATGCTCCCC TTTGCCGCCG ACCATCCCGT CTCGCTGGTG CGCTGCCCGC AGGGCGGCGA GCGGCAGTGC TTCTTCCAGA AGCACGCAAG CGACGGCTTT CCCGATGCGA TCAGGGAAGT GCCCATCACC GAATCGTCGG GAGACACCGA AGACTACATG TATATCCACG ATGCCAAGGG GCTCGTCGCA GCAGTACAGA TGGGGACGCT CGAATTTCAC ATCTGGGGCG CAAGGACCGA TCGGCTGGAG AAACCCGATC GGCTGGTCTT CGATCTCGAC CCCGATCCAA GCGTCGACTT CGCAACCGTC AAAGCAGCGG CCGTAGAGCT TCGCGACGAG CTCGCCGGGA TCGGCCTGAA AACCGTGCCG ATGGTGACGG GCGGCAAGGG AGTCCACGTG ATCGTGCCGC TCCGCCCCCA TGCCGAATGG GATGAAGTAA AAGGGTTCGC CAAGGCGCTC GCGCAATCTT TTGCCGAGCG CGATCCGGAT CATTTCGTCG CGACCATGTC CAAGGCCAAG CGCAAGGGGA GGATTTTCAT CGACTGGCTG CGCAACGACC GCGGCGCCAC GGCGATAGCC CCCTACTCCA CCCGTGCGCG CGCCGGCGGA CCGGTCGCCA CTCCGGTCGG CTGGGAGGAG CTTGAAAGCC TCGATGCCGC CAATGGGTTT CATATTCCCG AAATTCTCGC ACGCATCGAG GCCGGAACCG ATCCCTGGCG GGACATCGGC AAGATCAGTC AGTCGCTGAC GAAGAAGATC CTGAACTCGG TCGCGTGA
|
Protein sequence | MAARNETLTE YNRRRDFTRT KEPKGTVARR SGNRMRFLVQ KHAATRLHYD FRLEWEGVLK SWAVTRGPSL NPEDKRLAVR TEDHPLAYGD FEGTIPKGEY GGGTVMLWDT GWWEPEDDPS KALRKGKLSF KLHGSRMKGG WALVRMRPRE GEKRENWLLV KETDEIASKD GESLINENIT SVVTGRTMEE IAEGRGEKRS RVWHSNKSTA ANLRAGAVAE DGNARKRPTR KPSGKLPTFR APQLATLVTQ VPVGDAWLNE AKFDGYRLVC ALGGGTARCY TRNGLDWTEK FPVIAAALAE LDCRSALIDG EVVALSEGGS TFSALQKALA TGASTRLYAF DLIELDGKDL SRKPLVERKE KLEALLQTLG TTSTIQFSEH VRGNGEHVLA AICKAGQEGI IAKEANAPYR SGRTRSWLKV KCTKRQEFVI GGYSLSDKKG RAFASLIVGT FEGGKLIYRG GVGTGFSKKT MEDLAAAFAS RKRETSPFDS IPRERMRHSV WLKPDLVAEV DFAEFTAEGH IRHGSFEGLR EDKEAKAVKL ETSKPAEVES GTTTKGKSTT KARRTSAAQG DADVLGIRIS HPDRVLFKGQ GITKIDLARY YAIVADRMLP FAADHPVSLV RCPQGGERQC FFQKHASDGF PDAIREVPIT ESSGDTEDYM YIHDAKGLVA AVQMGTLEFH IWGARTDRLE KPDRLVFDLD PDPSVDFATV KAAAVELRDE LAGIGLKTVP MVTGGKGVHV IVPLRPHAEW DEVKGFAKAL AQSFAERDPD HFVATMSKAK RKGRIFIDWL RNDRGATAIA PYSTRARAGG PVATPVGWEE LESLDAANGF HIPEILARIE AGTDPWRDIG KISQSLTKKI LNSVA
|
| |