Gene Smed_2121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2121 
Symbol 
ID5322981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2184895 
End bp2186541 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content66% 
IMG OID640791059 
Productthiamine pyrophosphate protein 
Protein accessionYP_001327789 
Protein GI150397322 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.299662 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGG GCGGACAACT GGTGGTGGAG GCGCTTGTCG CCAATGGGGT GAAGCGCATC 
TCCTGCGTTC CGGGCGAAAG CTATCTGGCC GTGCTCGATG CGCTCTACGA CTCGGACGTC
GAAGTCGTCG TCTGCCGGCA GGAGGGCGGC GCGGCCATGA TGGCGGATGC TTGGGGACGG
CTGACGGGCG AACCGGGCAT CTGCATGGTC ACCCGCGGCC CGGGCGCAAC CAATGCCTCC
GCCGGGCTCC ATGTGGCACG GCAGGATTCG GTTCCCATGA TCCTCTTCAT CGGCCAGGTG
CAGCGGGAGG CGCGCGAGCG GGAGGCTTTC CAGGAGATCG AATACCGTCG CGCCTTCACC
GAGGTCGCAA AATGGGTGGG CGAGATAGAC GACCCGGCAC GCATCCCCGA ATTCGTCACT
CGCGCCTTCG CCGTCGCGAC GTCCGGCCGC CCCGGCCCAG TGGTCCTGAC CTTGCCGGAG
GACATGCTGA CACAGAGTGC CGAGGCCCCG GCCGCACGCG CCTACCAACC GGTGGAAAGC
CACCCTGGAA CCGGCCAAAT CGCGCGCCTC GCGGAACTTC TCTCTTCAGC GAAAAGGCCG
ATCGCCATTC TCGGCGGCAC GCGCTGGTCT GCCGAAGCCG CGGCCGGGCT CAAGAGCTTC
GCCGAACGCT GGCACCTCCC GGTCGGCTGC TCCTTCCGTC GGCAGATGCT TTTCGACCAC
CTGCACCCGA ATTATGCGGG CGATGTCGGC ATCGGCATCA ATCCGGCACT GGCAGGAGAG
ATCAGGGAAG CCGATCTCAT CCTCCTCATC GGCGGCCGGT TCTCGGAAAT GCCCTCCTCC
GGCTACACGC TGATCGACGT CCCCTACCCC AGGCAGACCC TGGTGCATGT CCATCCGGAT
CCGGGGGAAC TCGGCCGGGT CTACCGTCCG GACCTCGCGA TTGCCGCGAG CCCGCAGGAC
TTCGTCGCGG CGCTCTCCGG CCTGACGCCT GCGGCCGAAC CCAGCTGGTC CGCCCGCACA
AGGGAAATGC ATGCTGCCTA TCTCAAATGG TCGACGCCGC CGGAAAAGGG ACCGGGCGAC
GTTCAGATGG GCCCGATCGT CAACTGGCTG GAAGCGAATA CGGGGCCGGA GACGATTTTT
ACCAACGGCG CGGGCAACTA CGCGACCTGG CTCCATCGCT TCCACCGCTT CCGCCGTTAC
GGCACGCAGG CTGCTCCCGC TTCCGGTTCG ATGGGCTACG GCCTGCCGGC CGCTGTCGCA
GCCAAGCATC TGCATCCCGA CCGCGAGGTG ATCTGCTTTG CCGGAGACGG CTGTTTCCTC
ATGCACGGCC AGGAATTCGC GACGGCCGTC CGCTACGGAC TCCCCCTTAT CGTGCTCGTC
ATCAACAACA ACATCTACGG CACGATCCGC ATGCACCAGG AGCGCGAATA TCCCGGCCGC
GTCAGTGCCA CGGATCTGAC GAACCCGGAT TTTGCCGCAC TTGCCCGCGC CTATGGCGGA
CATGGCGAAA CGGTGGCGCG CACCGAGGAG TTTGCGGACG CTTTCCTGCG TGCGCGTGAA
AGCGGCAAGC CCGCCATTAT CGAGGTCAAG CTCGATCCGG AAGCGATCAC GCCGACGCGG
ACGCTGAGCG AGATCCGCAA AGGATAG
 
Protein sequence
MKTGGQLVVE ALVANGVKRI SCVPGESYLA VLDALYDSDV EVVVCRQEGG AAMMADAWGR 
LTGEPGICMV TRGPGATNAS AGLHVARQDS VPMILFIGQV QREAREREAF QEIEYRRAFT
EVAKWVGEID DPARIPEFVT RAFAVATSGR PGPVVLTLPE DMLTQSAEAP AARAYQPVES
HPGTGQIARL AELLSSAKRP IAILGGTRWS AEAAAGLKSF AERWHLPVGC SFRRQMLFDH
LHPNYAGDVG IGINPALAGE IREADLILLI GGRFSEMPSS GYTLIDVPYP RQTLVHVHPD
PGELGRVYRP DLAIAASPQD FVAALSGLTP AAEPSWSART REMHAAYLKW STPPEKGPGD
VQMGPIVNWL EANTGPETIF TNGAGNYATW LHRFHRFRRY GTQAAPASGS MGYGLPAAVA
AKHLHPDREV ICFAGDGCFL MHGQEFATAV RYGLPLIVLV INNNIYGTIR MHQEREYPGR
VSATDLTNPD FAALARAYGG HGETVARTEE FADAFLRARE SGKPAIIEVK LDPEAITPTR
TLSEIRKG