Gene Rleg_2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2094 
Symbol 
ID8013118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2085176 
End bp2086144 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content62% 
IMG OID644824681 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_002975911 
Protein GI241204815 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.26328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.398139 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGAC TTTTTGGCGC GACATCGACC ACCGATGAGG TGCTCGCCGG CGTCGATCTC 
AAGGGCAAGC GTGTTCTGGT GACGGGTGTT TCGGCCGGCC TCGGCGTGGA AACCGCGCGT
GTGTTGGCAG CCCATGGAGC GCAGGTGACG GGTACGGCAC GTGACCTTGC AAAGGCGAGG
GCGGCAACGG AAGTCGTGCG TGCCGGTGCG GCCAATGGCG GCAGCCTTGA TATCGTCGAG
CTTGATCTTG CCTCTCTGGC AAGCGTGCGC GCCTGTGCCG ATGCGCTCAT TTCGGATGGC
CGGCCCTTCG ATGTCGTCAT TGCCAATGCC GGCGTGATGG CCGCTCCCTT CGGCCGCACC
GCCGATGGCT TCGAAACGCA GTTCGGCACC AACCATCTCG GTCATTTCGT GCTGGTCAAC
AGCATCGCAC CGCTCGTCAA ATCGGGCGGC CGAGTGGTGA TCGTCGCATC CTCGGGCCAT
CGCATGGCAC CTTTCAGCCT CGATGACCTC AATTTCGAGA GCAAGACCTA TGAGCCCTGG
GCGGCCTATG CCCAGTCGAA AACCGCAAAT ATCCTGTTCG CGGTGGAACT CGACCGGCGC
CTCAAGGAGC GCGGCATCCG TGCAACGGCA CTGCATCCCG GCGGCATCCA GACCGAGCTC
GACCGTCATC TCGACCCTGA CATGATTGAA GGCATGATAA CGCAGATCAA CGCAGCACTC
TCCGCCGAGG GCAAGCCGCC TTTCCAGTGG AAGACGATTC CTCAGGGTGC GGCTACCTCC
GTCTGGGCAG GTTTCGTCGC CCCTGCAGAC GCGGTCGGTG GCAGATATTG CGAGAATTGC
CACGTCTCCG AAGTGACGGA TGCGGAGATC AGCCCGATTT CCGAAGGCGT GCGTACCTAC
GCGCTCGATC CCGAGACGGC CAGGGGATTG TGGACGAAAA GCGAGCATAT GGTCGGCGAG
CGCTTCTAG
 
Protein sequence
MSGLFGATST TDEVLAGVDL KGKRVLVTGV SAGLGVETAR VLAAHGAQVT GTARDLAKAR 
AATEVVRAGA ANGGSLDIVE LDLASLASVR ACADALISDG RPFDVVIANA GVMAAPFGRT
ADGFETQFGT NHLGHFVLVN SIAPLVKSGG RVVIVASSGH RMAPFSLDDL NFESKTYEPW
AAYAQSKTAN ILFAVELDRR LKERGIRATA LHPGGIQTEL DRHLDPDMIE GMITQINAAL
SAEGKPPFQW KTIPQGAATS VWAGFVAPAD AVGGRYCENC HVSEVTDAEI SPISEGVRTY
ALDPETARGL WTKSEHMVGE RF