Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4042 |
Symbol | |
ID | 5318609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 503521 |
End bp | 505338 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640775850 |
Product | thiamine pyrophosphate binding domain-containing protein |
Protein accession | YP_001312783 |
Protein GI | 150376187 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAATG CTTCCGACAT TCTCATAGAG ACGCTTATCG AATGGAAGGT CGAAGTGGTC TTCGGCCTGC CGGGAGACGG CATCAACGGT ATCATGGAGG CGCTGAGGCG GCGGCAGGAC CGCATCCGCT TCGTCTCCGT GCGCCACGAG CAGTCCGCTG CCTTCATGGC GTGCGCCTAT GCCAAGTTCA CGGGCAGGCT CGGCGTTTGT CTTGCCACTT CGGGTCCGGG CGGAACGAAC CTCCTGACCG GCCTCTACGA TGCGAAGCTC GATCAGATGC CGGTGCTGGC GATCACCGGC ATGCAGTATC ATGACCTGAT AGAGACTTTT TCGCAGCAGG ACGTCGATCT CACCCGCGTC TTCGAAAACG TCGCCGTCTA CAATGCTCAG GTGAACGATG CCGCCCACAT GGAAAATCTC GCAAACCTCG CCTGCCGGTC GGCCCTCTCG AAGCGCGGGG TCGCGCATCT TTCGATCGCC AACGACGTGC AAGAGCGTAT GGCCGGCGGT GGGCGGTCCC GCCGCAACCG CGAAGGGCAC ATGCCGAGCC GCTTTTTCGA AGGCAGGCTG GTGCCGCGCG AAGACGATAT TCTGCGCGCG GCGGACCTTC TGAATGCGGG CAGGAAGGTT GCGATTCTTG CCGGTCGCGG CGCGCTCGAG GCAAAGGGGC TTTTGCGCGA GACGGCGGAC CTGTTGGGCG CGCCGGTGGC AAAAGCCCTG CTCGGAAAGG CCGTGCTGCC CGACGACGAC CCGTTCACCA CCGGAGGCAT CGGAATTCTG GGAACGGTAC CGTCGCAGGA GATCATGCAG CAATGCGACA CGCTGCTGAT CGTCGGTTCG ACCTTCCCCT ATATCGAATA CTATCCGAAG CCCCACGCAG CCACCGGAAT CCAGATCGAT CATGACCCGC AGCGCATCGG GCTTCGCTAT CCGGTCGAAG TCGGGCTTGT CGGCGCGGCG GGAGAGACCC TTCGCATGCT CAATGAACGG CTTCAGCGCA AAGCAGACCG GACGTTTCTC GAACAGGCGC AAGAGAAAAC GCGAAACTGG CGGCGTGAAT TGCGTGCCAT GGAGGGCGAC AGGAGTTCTC CCCTGAAGCC GCAGGCCGCG GTCGGCGCGT TTGGCAGGCG CATCGCGGCA AACGGCATCG TCGTCACAGA TTCCGGACAA AACACCGAAC TTGCGGCCCG CCATGTGGAT CTCGGTGCCG ACCATATGTT TGCGGTATCG GGCGCACTCG CATCCATGGC CTCGGGTCTC CCTTACGCGA TCGCGGCTGG CATCGCCATG CCCGCCCGTC CCATCTACGC GGTGGTGGGC GATGGCGGCT TCGCCATGCA GCTCGGAGAA TTCGCCACGG CCGTCCGCTA TGAGATTCCT TTGAAGCTCC TGGTGATCCG TAACGACATG CTGAACCAGA TCGCCTGGGA ACAGATGATG TTCCTGGGCA ATCCGCAATT CGCCTGCGAG CTGCCGCCGA TCGACTTCGC CGCTGCGGCA GAAGCCATGG GCGGCCGAGG CTTCACGATC CGGTCCTTCG ACGAAATAGA CGGGGTTCTG GATCAGGCAT TCGCAGCCGA AGGACCGGTC GTCATTCAGG CGCTGGTCGA TCGCTACGAG CCGCTGATGC CGCCCAAAAT GCCCGAGGAC TACGCCCGCA ATTTCCGTGC CGCGCTTCCG CAAACGCCGG GACATGAGAA AATCGAAGAA AACCTCAGGC ATTCCTCCGC AGGCAGGAAA GTCACCGAGG AGGAGCCGCA GGCACCACAT GAAGCGGCGC CGGACGCCAA TACCGGTGAC CTGCCAGGGA TACCGTAG
|
Protein sequence | MPNASDILIE TLIEWKVEVV FGLPGDGING IMEALRRRQD RIRFVSVRHE QSAAFMACAY AKFTGRLGVC LATSGPGGTN LLTGLYDAKL DQMPVLAITG MQYHDLIETF SQQDVDLTRV FENVAVYNAQ VNDAAHMENL ANLACRSALS KRGVAHLSIA NDVQERMAGG GRSRRNREGH MPSRFFEGRL VPREDDILRA ADLLNAGRKV AILAGRGALE AKGLLRETAD LLGAPVAKAL LGKAVLPDDD PFTTGGIGIL GTVPSQEIMQ QCDTLLIVGS TFPYIEYYPK PHAATGIQID HDPQRIGLRY PVEVGLVGAA GETLRMLNER LQRKADRTFL EQAQEKTRNW RRELRAMEGD RSSPLKPQAA VGAFGRRIAA NGIVVTDSGQ NTELAARHVD LGADHMFAVS GALASMASGL PYAIAAGIAM PARPIYAVVG DGGFAMQLGE FATAVRYEIP LKLLVIRNDM LNQIAWEQMM FLGNPQFACE LPPIDFAAAA EAMGGRGFTI RSFDEIDGVL DQAFAAEGPV VIQALVDRYE PLMPPKMPED YARNFRAALP QTPGHEKIEE NLRHSSAGRK VTEEEPQAPH EAAPDANTGD LPGIP
|
| |