Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2121 |
Symbol | |
ID | 5322981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 2184895 |
End bp | 2186541 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640791059 |
Product | thiamine pyrophosphate protein |
Protein accession | YP_001327789 |
Protein GI | 150397322 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.299662 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACGG GCGGACAACT GGTGGTGGAG GCGCTTGTCG CCAATGGGGT GAAGCGCATC TCCTGCGTTC CGGGCGAAAG CTATCTGGCC GTGCTCGATG CGCTCTACGA CTCGGACGTC GAAGTCGTCG TCTGCCGGCA GGAGGGCGGC GCGGCCATGA TGGCGGATGC TTGGGGACGG CTGACGGGCG AACCGGGCAT CTGCATGGTC ACCCGCGGCC CGGGCGCAAC CAATGCCTCC GCCGGGCTCC ATGTGGCACG GCAGGATTCG GTTCCCATGA TCCTCTTCAT CGGCCAGGTG CAGCGGGAGG CGCGCGAGCG GGAGGCTTTC CAGGAGATCG AATACCGTCG CGCCTTCACC GAGGTCGCAA AATGGGTGGG CGAGATAGAC GACCCGGCAC GCATCCCCGA ATTCGTCACT CGCGCCTTCG CCGTCGCGAC GTCCGGCCGC CCCGGCCCAG TGGTCCTGAC CTTGCCGGAG GACATGCTGA CACAGAGTGC CGAGGCCCCG GCCGCACGCG CCTACCAACC GGTGGAAAGC CACCCTGGAA CCGGCCAAAT CGCGCGCCTC GCGGAACTTC TCTCTTCAGC GAAAAGGCCG ATCGCCATTC TCGGCGGCAC GCGCTGGTCT GCCGAAGCCG CGGCCGGGCT CAAGAGCTTC GCCGAACGCT GGCACCTCCC GGTCGGCTGC TCCTTCCGTC GGCAGATGCT TTTCGACCAC CTGCACCCGA ATTATGCGGG CGATGTCGGC ATCGGCATCA ATCCGGCACT GGCAGGAGAG ATCAGGGAAG CCGATCTCAT CCTCCTCATC GGCGGCCGGT TCTCGGAAAT GCCCTCCTCC GGCTACACGC TGATCGACGT CCCCTACCCC AGGCAGACCC TGGTGCATGT CCATCCGGAT CCGGGGGAAC TCGGCCGGGT CTACCGTCCG GACCTCGCGA TTGCCGCGAG CCCGCAGGAC TTCGTCGCGG CGCTCTCCGG CCTGACGCCT GCGGCCGAAC CCAGCTGGTC CGCCCGCACA AGGGAAATGC ATGCTGCCTA TCTCAAATGG TCGACGCCGC CGGAAAAGGG ACCGGGCGAC GTTCAGATGG GCCCGATCGT CAACTGGCTG GAAGCGAATA CGGGGCCGGA GACGATTTTT ACCAACGGCG CGGGCAACTA CGCGACCTGG CTCCATCGCT TCCACCGCTT CCGCCGTTAC GGCACGCAGG CTGCTCCCGC TTCCGGTTCG ATGGGCTACG GCCTGCCGGC CGCTGTCGCA GCCAAGCATC TGCATCCCGA CCGCGAGGTG ATCTGCTTTG CCGGAGACGG CTGTTTCCTC ATGCACGGCC AGGAATTCGC GACGGCCGTC CGCTACGGAC TCCCCCTTAT CGTGCTCGTC ATCAACAACA ACATCTACGG CACGATCCGC ATGCACCAGG AGCGCGAATA TCCCGGCCGC GTCAGTGCCA CGGATCTGAC GAACCCGGAT TTTGCCGCAC TTGCCCGCGC CTATGGCGGA CATGGCGAAA CGGTGGCGCG CACCGAGGAG TTTGCGGACG CTTTCCTGCG TGCGCGTGAA AGCGGCAAGC CCGCCATTAT CGAGGTCAAG CTCGATCCGG AAGCGATCAC GCCGACGCGG ACGCTGAGCG AGATCCGCAA AGGATAG
|
Protein sequence | MKTGGQLVVE ALVANGVKRI SCVPGESYLA VLDALYDSDV EVVVCRQEGG AAMMADAWGR LTGEPGICMV TRGPGATNAS AGLHVARQDS VPMILFIGQV QREAREREAF QEIEYRRAFT EVAKWVGEID DPARIPEFVT RAFAVATSGR PGPVVLTLPE DMLTQSAEAP AARAYQPVES HPGTGQIARL AELLSSAKRP IAILGGTRWS AEAAAGLKSF AERWHLPVGC SFRRQMLFDH LHPNYAGDVG IGINPALAGE IREADLILLI GGRFSEMPSS GYTLIDVPYP RQTLVHVHPD PGELGRVYRP DLAIAASPQD FVAALSGLTP AAEPSWSART REMHAAYLKW STPPEKGPGD VQMGPIVNWL EANTGPETIF TNGAGNYATW LHRFHRFRRY GTQAAPASGS MGYGLPAAVA AKHLHPDREV ICFAGDGCFL MHGQEFATAV RYGLPLIVLV INNNIYGTIR MHQEREYPGR VSATDLTNPD FAALARAYGG HGETVARTEE FADAFLRARE SGKPAIIEVK LDPEAITPTR TLSEIRKG
|
| |