Gene Smed_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0223 
Symbol 
ID5321055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp245863 
End bp247962 
Gene Length2100 bp 
Protein Length699 aa 
Translation table11 
GC content62% 
IMG OID640789158 
Productshort chain dehydrogenase 
Protein accessionYP_001325917 
Protein GI150395450 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only
[S] Function unknown 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3347] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02632] rhamnulose-1-phosphate aldolase/alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.133149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.171909 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGGCA AGCAACAGGG CGCGCGCCTT GCCAATCTTT GGGACGACGG CAAGGCGGCA 
GGAATGACCG AGCCTGAAAG GCTCCTCTAT CGCTCGAACC TGCTCGGATC TGACAAGAGG
ATCACCAATT ACGGCGGCGG CAACACCTCG GCCAAGGTCA TGGAAAAGGA TCCGTTGACG
GGCGAGATGG TCGAGGTCCT CTGGGTCAAG GGCTCCGGCG GCGACGTCGG CACGATCAAG
ATGGACGGCT TCGCCACCCT CTATATGGAC AAGCTGCGGG CCCTCGAGGG CATCTATCGC
GGGGTCGAAT TCGAAGACGA GATGTTCGGG TATCTGCCGC ATTGCACCTT CAATCTCAAC
CCGCGCGCCG CTTCGATCGA CACGCCGCTG CATGCCTATG TTCCGAAGGT GCATGTCGAC
CACATGCATC CGGATGCGAT CATTGCCATC GCCGCCTCGA AGAACAGCAA GGAACTGACG
AGCAAGATAT TCGGAACTGA GATCGGCTGG CTGCCGTGGA AACGGCCCGG CTACGAACTC
GGTCTCTGGC TTGAAAGATT CTGCCGTGAG AACCCGGAAG CGCGCGGCGT CGTGCTCGAA
AGCCACGGCC TCTTCACCTG GGGCGACACG GCCAAGCAGG CCTACGAGAC GACGATCGAA
ATCATCAATC GCGCCATTGC CTGGTTCGAG ACGGAAAACA CGGGTCCGGC CTTCGGTGGA
CCGGCGAAAC CGGCGCTCGC CGGCGGTGAA CGCGTCGCCA TCGCCAGGAA ACTGATGCCT
GTCATCCGCG GTCTCATCAG CGGAGCGGAA AGCAAGGTCG GGCACTTCGA CGACAGCGAG
GCCGTGCTCG ATTTCGTAAC CTCTACAAAT CTGGAGCCGT TGGCAGCACT CGGCACCAGT
TGCCCCGATC ACTTCCTGCG CACCAAGATC CGCCCGCTGA TCGTCGACTT CGACCCGGCC
CAGCCGGACC TCGAAAAGAC GCTTGCGGGC CTGCCCGAGG CGATCGCGAC CTATCGCGCC
GATTACGCGG CCTATTACGA CCGCTGCAAG CGCGCCGACA GCCCGGCGAT CCGTGATCCG
AACGCCGTCG TCTATCTCCT GCCGGGCGTC GGCATGATCA CCTTCGCCAA GGACAAGGCG
ACGGCGCGCA TCTCCGCGGA GTTCTACGTC AACGCCATCA ATGTCATGCG CGGCGCGTCC
GGCGTCTCGA CCTATGTCGG CCTGCCGGAG CAGGAAGCTT TCGACATCGA ATACTGGCTG
CTCGAAGAAG CCAAGCTTCA GCGCATGCCG AAGCCGAAGA GCCTTGCCGG GCGCATCGCT
CTCGTCACCG GCGGCGCCGG CGGCATAGGC AAGGCGACAG CCAACCGGCT GATGCAGGAA
GGCGCCTGCG TCGTGCTTGC AGATATCGAC GAACGGGCGC TCGAGGCTGC GCAGAACGAG
CTTTCGAACC GCTATGGCAA GGATTTCGTC CGCTCCGTCA ACATGAACGT CACCGATGAG
GCGGCCGTCG CGTCGGGTTT CGGCGACGCG CTCCTGGCCT TCGGCGGTCT CGATATTCTC
GTCTCCAATG CGGGTCTCGC GACGTCGGCT GCTGTCGAGG ACACCACGCT GGCGCTCTGG
AACAAGAATA TGGACATTCT CGCGACGGGC TATTTCCTGG TCTCGCGCGA GGCTTTCCGC
ATCTTCCGCA ACCAGAAGAC CGGCGGAAAT GTCGTCTTCG TCGCTTCGAA GAACGGGCTT
GCGGCCTCGC CCGGAGCTTC TGCCTATTGC ACTGCCAAAG CGGCCGAAAT CCACCTCGCC
CGCTGCCTGG CGCTCGAGGG GGCTTCGGCG CAGATTCGCG TCAACGTGGT CAATCCCGAC
GCGGTACTGC GCGGTTCCAA GATCTGGACC GGCGAATGGA AGGAACAGCG CGCCGCCGCC
TACAATATGG ACGTCGATGA ACTGGAGGCG CATTACCGCG AGCGCTCCAT GCTGAAGCTC
AGCGTATTCC CGGAAGATAT CGCCGAGGCA ATCTACTTCC TTGCTTCCGA CATGTCGGCG
AAGTCCACCG GTAACATCGT CAATGTCGAC GCGGGCAATG CCCAGTCGTT CACGCGCTGA
 
Protein sequence
MLGKQQGARL ANLWDDGKAA GMTEPERLLY RSNLLGSDKR ITNYGGGNTS AKVMEKDPLT 
GEMVEVLWVK GSGGDVGTIK MDGFATLYMD KLRALEGIYR GVEFEDEMFG YLPHCTFNLN
PRAASIDTPL HAYVPKVHVD HMHPDAIIAI AASKNSKELT SKIFGTEIGW LPWKRPGYEL
GLWLERFCRE NPEARGVVLE SHGLFTWGDT AKQAYETTIE IINRAIAWFE TENTGPAFGG
PAKPALAGGE RVAIARKLMP VIRGLISGAE SKVGHFDDSE AVLDFVTSTN LEPLAALGTS
CPDHFLRTKI RPLIVDFDPA QPDLEKTLAG LPEAIATYRA DYAAYYDRCK RADSPAIRDP
NAVVYLLPGV GMITFAKDKA TARISAEFYV NAINVMRGAS GVSTYVGLPE QEAFDIEYWL
LEEAKLQRMP KPKSLAGRIA LVTGGAGGIG KATANRLMQE GACVVLADID ERALEAAQNE
LSNRYGKDFV RSVNMNVTDE AAVASGFGDA LLAFGGLDIL VSNAGLATSA AVEDTTLALW
NKNMDILATG YFLVSREAFR IFRNQKTGGN VVFVASKNGL AASPGASAYC TAKAAEIHLA
RCLALEGASA QIRVNVVNPD AVLRGSKIWT GEWKEQRAAA YNMDVDELEA HYRERSMLKL
SVFPEDIAEA IYFLASDMSA KSTGNIVNVD AGNAQSFTR