Gene Rleg_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1053 
Symbol 
ID8012182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1028698 
End bp1029657 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content64% 
IMG OID644823636 
Productoxidoreductase 
Protein accessionYP_002974887 
Protein GI241203791 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.176111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACA AACAAGTCCC CATCCGCTCC GGATTTGGAG CCCACACGAC GGCCGGCGAG 
GTTTTGGCCG GTCTCGATCT TTCCGGCAAG CGCGCCATCG TCACCGGCGG CCATTCCGGC
CTCGGGCTCG AGACCACGCG CGCTCTGGCG GGCGCCGGCG CGAAGGTGAC CATCGGCGCA
AGGAGCATCG AGGCGGCGCG TAGCGCGGTC GCCGGTATCG ATGGCGTAGA GATTGATCGG
CTCGACCTTT CCGACCTCGA AAGCGTTCGC GCCTTTGCCG AGCGGTTCGT CGCATCTGGC
CGCAGCATCG ACATTTTGAT CAACAGCGCC GGCATCATGG CCTGCCCGGA AACGCGTGTC
GGCGACGGAT GGGAGGCACA GTTCGCGACC AATCATCTCG GCCATTTCGC CTTGGTCAAC
CGCCTCTGGC CGGCGATCTC GCGCGGCACT CGCATCGTTT CGGTTTCCTC CGGTGGCCAT
GGCAACTCGG CCATACGATG GGAGGATGTG CATTTCGAGA CCGGTTACGA CAAATGGCAG
GCCTACGGCC AGTCGAAGAC CGCCAACGCA CTTTTCGCCG TGCATCTGGA CAGGCTCGGG
CGCGACACCG GCATCCGCGC CTTCTCGCTG CACCCGGGCA AGATTTTTAC CCCCTTGCAG
CGCCATCTCG CAAAGGAGGA AATGGTCAGT GCCGGCTGGA TCGATGCAGA CGGCAATCCG
ATTGATCCGA CGTTCAAGAC ACCAGCCCAG GGGGCAGCGA CGCAGGTTTG GGCGGCGACC
TCGCCACAAC TCGAAGGTAT GGGAGGCCTC TATTGCGAGG ACTGCGATAT CGCCATCCGC
GCAACGGTTG GAGAACCCGG CGGCGTCAGC GACCATGCAG CCGATCCCGA GGAGGCGGCA
CGCCTGTGGA TCTTGTCGGC AAGGCTGACC GGCATTGACG CTTTCGCGGC GTACGCCTGA
 
Protein sequence
MSDKQVPIRS GFGAHTTAGE VLAGLDLSGK RAIVTGGHSG LGLETTRALA GAGAKVTIGA 
RSIEAARSAV AGIDGVEIDR LDLSDLESVR AFAERFVASG RSIDILINSA GIMACPETRV
GDGWEAQFAT NHLGHFALVN RLWPAISRGT RIVSVSSGGH GNSAIRWEDV HFETGYDKWQ
AYGQSKTANA LFAVHLDRLG RDTGIRAFSL HPGKIFTPLQ RHLAKEEMVS AGWIDADGNP
IDPTFKTPAQ GAATQVWAAT SPQLEGMGGL YCEDCDIAIR ATVGEPGGVS DHAADPEEAA
RLWILSARLT GIDAFAAYA