Gene Smed_4042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4042 
Symbol 
ID5318609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp503521 
End bp505338 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content62% 
IMG OID640775850 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001312783 
Protein GI150376187 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAATG CTTCCGACAT TCTCATAGAG ACGCTTATCG AATGGAAGGT CGAAGTGGTC 
TTCGGCCTGC CGGGAGACGG CATCAACGGT ATCATGGAGG CGCTGAGGCG GCGGCAGGAC
CGCATCCGCT TCGTCTCCGT GCGCCACGAG CAGTCCGCTG CCTTCATGGC GTGCGCCTAT
GCCAAGTTCA CGGGCAGGCT CGGCGTTTGT CTTGCCACTT CGGGTCCGGG CGGAACGAAC
CTCCTGACCG GCCTCTACGA TGCGAAGCTC GATCAGATGC CGGTGCTGGC GATCACCGGC
ATGCAGTATC ATGACCTGAT AGAGACTTTT TCGCAGCAGG ACGTCGATCT CACCCGCGTC
TTCGAAAACG TCGCCGTCTA CAATGCTCAG GTGAACGATG CCGCCCACAT GGAAAATCTC
GCAAACCTCG CCTGCCGGTC GGCCCTCTCG AAGCGCGGGG TCGCGCATCT TTCGATCGCC
AACGACGTGC AAGAGCGTAT GGCCGGCGGT GGGCGGTCCC GCCGCAACCG CGAAGGGCAC
ATGCCGAGCC GCTTTTTCGA AGGCAGGCTG GTGCCGCGCG AAGACGATAT TCTGCGCGCG
GCGGACCTTC TGAATGCGGG CAGGAAGGTT GCGATTCTTG CCGGTCGCGG CGCGCTCGAG
GCAAAGGGGC TTTTGCGCGA GACGGCGGAC CTGTTGGGCG CGCCGGTGGC AAAAGCCCTG
CTCGGAAAGG CCGTGCTGCC CGACGACGAC CCGTTCACCA CCGGAGGCAT CGGAATTCTG
GGAACGGTAC CGTCGCAGGA GATCATGCAG CAATGCGACA CGCTGCTGAT CGTCGGTTCG
ACCTTCCCCT ATATCGAATA CTATCCGAAG CCCCACGCAG CCACCGGAAT CCAGATCGAT
CATGACCCGC AGCGCATCGG GCTTCGCTAT CCGGTCGAAG TCGGGCTTGT CGGCGCGGCG
GGAGAGACCC TTCGCATGCT CAATGAACGG CTTCAGCGCA AAGCAGACCG GACGTTTCTC
GAACAGGCGC AAGAGAAAAC GCGAAACTGG CGGCGTGAAT TGCGTGCCAT GGAGGGCGAC
AGGAGTTCTC CCCTGAAGCC GCAGGCCGCG GTCGGCGCGT TTGGCAGGCG CATCGCGGCA
AACGGCATCG TCGTCACAGA TTCCGGACAA AACACCGAAC TTGCGGCCCG CCATGTGGAT
CTCGGTGCCG ACCATATGTT TGCGGTATCG GGCGCACTCG CATCCATGGC CTCGGGTCTC
CCTTACGCGA TCGCGGCTGG CATCGCCATG CCCGCCCGTC CCATCTACGC GGTGGTGGGC
GATGGCGGCT TCGCCATGCA GCTCGGAGAA TTCGCCACGG CCGTCCGCTA TGAGATTCCT
TTGAAGCTCC TGGTGATCCG TAACGACATG CTGAACCAGA TCGCCTGGGA ACAGATGATG
TTCCTGGGCA ATCCGCAATT CGCCTGCGAG CTGCCGCCGA TCGACTTCGC CGCTGCGGCA
GAAGCCATGG GCGGCCGAGG CTTCACGATC CGGTCCTTCG ACGAAATAGA CGGGGTTCTG
GATCAGGCAT TCGCAGCCGA AGGACCGGTC GTCATTCAGG CGCTGGTCGA TCGCTACGAG
CCGCTGATGC CGCCCAAAAT GCCCGAGGAC TACGCCCGCA ATTTCCGTGC CGCGCTTCCG
CAAACGCCGG GACATGAGAA AATCGAAGAA AACCTCAGGC ATTCCTCCGC AGGCAGGAAA
GTCACCGAGG AGGAGCCGCA GGCACCACAT GAAGCGGCGC CGGACGCCAA TACCGGTGAC
CTGCCAGGGA TACCGTAG
 
Protein sequence
MPNASDILIE TLIEWKVEVV FGLPGDGING IMEALRRRQD RIRFVSVRHE QSAAFMACAY 
AKFTGRLGVC LATSGPGGTN LLTGLYDAKL DQMPVLAITG MQYHDLIETF SQQDVDLTRV
FENVAVYNAQ VNDAAHMENL ANLACRSALS KRGVAHLSIA NDVQERMAGG GRSRRNREGH
MPSRFFEGRL VPREDDILRA ADLLNAGRKV AILAGRGALE AKGLLRETAD LLGAPVAKAL
LGKAVLPDDD PFTTGGIGIL GTVPSQEIMQ QCDTLLIVGS TFPYIEYYPK PHAATGIQID
HDPQRIGLRY PVEVGLVGAA GETLRMLNER LQRKADRTFL EQAQEKTRNW RRELRAMEGD
RSSPLKPQAA VGAFGRRIAA NGIVVTDSGQ NTELAARHVD LGADHMFAVS GALASMASGL
PYAIAAGIAM PARPIYAVVG DGGFAMQLGE FATAVRYEIP LKLLVIRNDM LNQIAWEQMM
FLGNPQFACE LPPIDFAAAA EAMGGRGFTI RSFDEIDGVL DQAFAAEGPV VIQALVDRYE
PLMPPKMPED YARNFRAALP QTPGHEKIEE NLRHSSAGRK VTEEEPQAPH EAAPDANTGD
LPGIP