Gene Smed_5992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5992 
Symbol 
ID5320294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp949743 
End bp951398 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content53% 
IMG OID640777668 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001314600 
Protein GI150378005 
COG category[G] Carbohydrate transport and metabolism
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG3961] Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0792937 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGAGA AATACACCGT GGGCCAATAT CTAGTCGACA GACTTCATGA GTTGGGTCTG 
CGGCACCTGT TTTCCATAGC GGGCGATTAT TCGATCGAAT GGGTGAACAG TTATGTAGAG
AAAAGCGGTA TCCAGGTGAT AGAAGAGGTG AATGAACTGA ATGCCGGCTA TGCGGCTGAC
GGCTATGCGA GGCTGAAAGG AATTGGTGCA CTGTGCGTTA CCTATTCCGC AGGCTCGCTT
TGCGCAACGA ATGCGATCGC CGGATCTTAC GTCGAGAAGG TGCCGGTTGT TCTGATCAAC
GGTGCGCCAA GCATCAAGAA AACGCTCACG TTCGAACAAA CCGGCTATAG TTCGCATCAC
TTCATTAGTG GACGGGAAAC GGATCGTCAG GTGTTCGAAT ACATTACGGC CGCTGCAGTC
CGCATCGACA GCCCTGATCT CGCACCAATG CTCATAGATT ATGCGCTGAC GCAGTGCATC
ACGGAAAGGC GTCCGGTCTA TATCGAGTTG CTAGAGGATA TGGTGGACCT AGGATGCACA
GTTCCGTCAG GTGCACTAAA AGCGGCCCCG GTTATATCCG ACGAAGTTAG TCTAAATCAG
TCGATCGCGC AAATCAGCGA AAGACTGCAA AATGCTACCA GACCCCTGAT CTGGATCGGT
GTCGAGGTGG ACCGGTTTGG CCTTCACGAC CAGGCGGAGC GGCTTATCCA GGACCTGAAA
ATTCCCTACG TGACCGAGCT TTTGAGCAAG GCCATCCTGC CGGAAGACGA TGCACAATTT
GCCGGGGTGT TTGACGGCAA ATCGTCATCC CCCTACGTCC AGTCTTTGGT CGAAAGCTCC
GATTTCGTAT TGGCCCTAGG TGTCTGGTTG ACGGACATCA ATGATTTGGG CTGGCCCATC
GATCTCAATA AAACAGCCTT CGCTTCCTGG AATACACTGA AATATGGCAC AAGTTTCATC
GCACAGGTTT CGCTCGCGGA CCTCGTGGAC GGCTTGATTG ATAAAAGGGT AACGTGCAAA
GACCGAATTC TTCCAGCGAA TACCGTCCAG CAAGCGCCCG TCATGAATCC GGCAGACGAA
CTTACGTATC AGGGCTTCTA CGATTTCATC CAGAGCCAGG TCGACGAAGA TACCATCGTC
GGCGCCGACG CGAGCTTAAA TTATTTCGGG AGCATGCTTC TCAAAGTGGG TGCTCGCCGC
GGTTTCATCG TTCAATCATC CTATTCGGCG ATCGGCTATA TTGGGCCGGC CGCGACAGGG
TTATCTCTGG CGAAGCAAGA TGGCCAGAGA CTGATGGTCC TATCGGGGGA TGGCGGGTTT
CAAATGACCG CTCAATGCCT GTCCACACAA ACCCGTTTCA ATCTCAATCC GATCATCTTC
GTGACAGACA ACGGTGTCTA TGGCGTGGAG CAATGGCTTG CCGATGCTTC GGTTTTCCAT
GGTAGCAAGC CGTTCTATAA TTCATGCATC CTGCACCGAT GGAATTACAG CAAGTTGGCA
GAAGTCTTTG GCTGCCAAGG CTGGAAGGTG CACACCTATG GCGAACTGGA GGAAGCTATA
AATGGCGCCA AGGACAACCT AAACAGTCCG TCCATCATCC AGGTTGTCGT GCCACAGCGG
TCGATCCCCG ACAATGCGAA CTGGAAAGCA AACTAG
 
Protein sequence
MREKYTVGQY LVDRLHELGL RHLFSIAGDY SIEWVNSYVE KSGIQVIEEV NELNAGYAAD 
GYARLKGIGA LCVTYSAGSL CATNAIAGSY VEKVPVVLIN GAPSIKKTLT FEQTGYSSHH
FISGRETDRQ VFEYITAAAV RIDSPDLAPM LIDYALTQCI TERRPVYIEL LEDMVDLGCT
VPSGALKAAP VISDEVSLNQ SIAQISERLQ NATRPLIWIG VEVDRFGLHD QAERLIQDLK
IPYVTELLSK AILPEDDAQF AGVFDGKSSS PYVQSLVESS DFVLALGVWL TDINDLGWPI
DLNKTAFASW NTLKYGTSFI AQVSLADLVD GLIDKRVTCK DRILPANTVQ QAPVMNPADE
LTYQGFYDFI QSQVDEDTIV GADASLNYFG SMLLKVGARR GFIVQSSYSA IGYIGPAATG
LSLAKQDGQR LMVLSGDGGF QMTAQCLSTQ TRFNLNPIIF VTDNGVYGVE QWLADASVFH
GSKPFYNSCI LHRWNYSKLA EVFGCQGWKV HTYGELEEAI NGAKDNLNSP SIIQVVVPQR
SIPDNANWKA N