Gene Smed_0849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0849 
Symbol 
ID5321687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp906704 
End bp907738 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content61% 
IMG OID640789786 
Productalcohol dehydrogenase 
Protein accessionYP_001326539 
Protein GI150396072 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0438742 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGAG CACTCGTCCT AGAGCAAATC CGGAAACTCT CGCTGCGCGA TATCGACCTG 
CCGCAGGAAG TCGGACCCCA CGACGTCCGC ATCAAGATTC ACACGGTCGG GATATGCGGG
TCGGACGTGC ATTATTACAC GCACGGCCGG ATAGGGCCTT TTGTGGTGAA TGCGCCGATG
GTACTCGGGC ATGAGGCGGC TGGCGTCGTC GTCGAAACCG GTAAGGACGT GACGCATCTC
AAGGCAGGGG ATCGCGTGTG CATGGAGCCC GGAATTCCGG ACGCGAATTC GCGCGCCAGT
CGTCTTGGCC TTTACAATAT CGACCCGGCT GTGACGTTTT GGGCGACGCC TCCTGTCCAT
GGCGTCCTGA CTCCGCACGT CGTCCACTCG GCGAATTACA CCTATAAGCT GCCGGACAAA
GTCAGTTTCG CAGAAGGGGC GATGGTGGAG CCGTTTGCCG TCGGCATGCA GGCGGCGCAA
AAGGCGAAGA TTGCTCCCGG CGATACTGCC GTGGTCACCG GCGCCGGGCC GATCGGCATC
ATGGTGGCGA TCGCGGCGCT CGCCGGAGGG TGCGCGCGGG TGATTGTTGC CGATTTCGCG
CAACCGAAGC TAGACATTGC GGCGCAATAC CAGGGCATCC TGCCGATCAA CATCGGCAAA
CGCGACCTCG CGGAGGAAGT GAAGCAGCTC ACCGAGGGCT GGGGCGCCGA TGTGGTGTTC
GAATGCTCAG GTTCGCCGAA GGCATGGGAG ACATTGCTCG ATCTTCCCCG GCCAGGCGGT
GCCGTCGTTG CTGTGGGACT CCCGGTCGAA CCGGTTGGTC TGGATATATC CACCGCATCG
ACGAAGGAAA TCCGGTTTGA GACGGTATTT CGCTATGCCC ATCAATATGA CCGCGCAATC
GCTTTGATGG GATCTGGGCG CGTCGACCTG AAGCCGCTCA TCACCGAGAC GTTTCCGTTC
GAAGAAAGTG TCGCGGCTTT CGATCGCGCG GCGGAGGGTA GGCCGGGTGA TGTGAAGCTG
CAGATCACGC TGTAG
 
Protein sequence
MPRALVLEQI RKLSLRDIDL PQEVGPHDVR IKIHTVGICG SDVHYYTHGR IGPFVVNAPM 
VLGHEAAGVV VETGKDVTHL KAGDRVCMEP GIPDANSRAS RLGLYNIDPA VTFWATPPVH
GVLTPHVVHS ANYTYKLPDK VSFAEGAMVE PFAVGMQAAQ KAKIAPGDTA VVTGAGPIGI
MVAIAALAGG CARVIVADFA QPKLDIAAQY QGILPINIGK RDLAEEVKQL TEGWGADVVF
ECSGSPKAWE TLLDLPRPGG AVVAVGLPVE PVGLDISTAS TKEIRFETVF RYAHQYDRAI
ALMGSGRVDL KPLITETFPF EESVAAFDRA AEGRPGDVKL QITL