Gene Smed_6051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6051 
Symbol 
ID5320353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp997658 
End bp998839 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content54% 
IMG OID640777706 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001314638 
Protein GI150378043 
COG category[G] Carbohydrate transport and metabolism
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG3961] Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.221657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.609485 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGATAG CAGAAGGCCC GAGCCGTCGC CTAGCGGCCG TCGCGGGCTG CTGCCATAGA 
CGCCGGTGCC TCTCGATGAG AGGCACCGGT CCGGCAAGTC TCAACCGATT TTGGAGTCAG
CCAATGACAG TTGCATTTCG CAGTAAGCCA GTGACAGTTG AGGACCCGGC ATATAGAGGT
CTGGCTATCG GCACTTATGC GCCTGCAGTC ACACCGGCGC CTGCCTTCGC GCAGACCGAC
CTGAAAATTC CCTATGTGAC CGAGCTTTTG AGCAAGGCCA TCCTGCCGGA CGACGATGCA
CAATTTGCCG GGGTGTTTGA TGGCAAATCG TCATCCCCCT ACGTCCAGTC TTTGGTCGAA
AGCTCCGATT TCGTATTGGC CCTAGGCGTC TGGTTGACGG ACATCAATGA TTTGGGCTGG
CCCATCGATC TCGATAAAAC AGCCTTCGCT TCCTGGAATA CACTGAAATA TGGCACAAGT
TTCATCGCAC AGGTTTCGCT CGCGGACCGC GTGGACGGCT TGATTGATAA AAGGGTAACG
TGCAAAGACC GAATTCTTCC AGCGAATACC GTCCAGCAAG CGCCCGTCAT GAATCCGGCA
GACGAACTTA CGTATCAGGT CTTCTACGAT TTCATCCAGA GCCAGGTCGA CGAAGATACC
ATCGTCGGCG CCGACGCGAG CTTAAATTAT TTCGGGAGCA TGCTTCTCAA AGTGGGTGCT
CGCCGCGGTT TCATCGTTCA ATCATCCTAT TCGGCGATCG GCTATATTGG GCCGGCCGCG
ACAGGGTTAT CTCTGGCGAA GCAAGATGGC CAGAGACTGA TGGTCTTATC GGGTGATGGC
GGGTTTCGGA TGACCGTTCA ATGCCTGTCG ACACAAACCC GTTTCAATCT CAATCCGATC
ATCTTCGTGA CAGACAACGG TGTCTATGGC GTGGAGCAAT GGCTTGCCGA TGCACCGGTT
TTCCATGGTA GCAAACCGTT CTATAATTCA TGCATCCTGC ACCGATGGAA TTACAGCAAG
TTGGCAGAAG TCTTTGGCTG CCAAGGATGG AAGGTGCACA CCTATGGCGA ACTGGTGGCA
GCTATAAATG GCGCCAAAGA CAACCTAAAC AGTCCGTTCA TCATCCAGGT TGTCGTGCCA
CAGCGGTCGA TCCCAGACAA TGCGAACTGG AAAGCAAACT AA
 
Protein sequence
MTIAEGPSRR LAAVAGCCHR RRCLSMRGTG PASLNRFWSQ PMTVAFRSKP VTVEDPAYRG 
LAIGTYAPAV TPAPAFAQTD LKIPYVTELL SKAILPDDDA QFAGVFDGKS SSPYVQSLVE
SSDFVLALGV WLTDINDLGW PIDLDKTAFA SWNTLKYGTS FIAQVSLADR VDGLIDKRVT
CKDRILPANT VQQAPVMNPA DELTYQVFYD FIQSQVDEDT IVGADASLNY FGSMLLKVGA
RRGFIVQSSY SAIGYIGPAA TGLSLAKQDG QRLMVLSGDG GFRMTVQCLS TQTRFNLNPI
IFVTDNGVYG VEQWLADAPV FHGSKPFYNS CILHRWNYSK LAEVFGCQGW KVHTYGELVA
AINGAKDNLN SPFIIQVVVP QRSIPDNANW KAN