Gene Rleg_5158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5158 
Symbol 
ID8007054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp561334 
End bp562329 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content64% 
IMG OID644822068 
Productaldo/keto reductase 
Protein accessionYP_002973328 
Protein GI241113493 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC AGCAACTGAC ACGTGAGATC GGCAGATCGG GCGTTTCCGC CTCGGCGGTG 
GGGCTCGGCA CCTGGGCGAT CGGCGGCTGG ATGTGGGGTG GGACCGATGA AGCCGAATCC
ATCGCCGCAA TCCAGGCATC GCTCGATGCC GGCGTGACGC TGATCGACAC GGCGCCGGCC
TATGGGCTCG GACGTTCCGA GGAGATCGTC GGCAAGGCAC TGACCGGGCG CCGCGACAAG
GCTGTCATCG CTACCAAATG CGGCCTTGTC TGGCATACGC AGAAGGGTCG TCATTTCTTC
GACCAGGACG GCAAGCCGGT CCACCGTTAT CTCGGCCGCG ACGCGATCCT GCACGAGGTC
GAGGAAAGTC TTAGGCGGCT CGGAACCGAC TATATCGACC TCTACATCAC TCATTGGCAG
GACCCGACGA CCCCGATCGA GGAAACGATG CGGGCGCTGC AAGACCTGCG CTCATCGGGC
AAGATCCGGG CAATCGGCGC AAGCAATGTC AGCCCCGACG ACCTCAATGG CTATATCGCT
GCCGGTGGTC TCGATGCGAT CCAGGAGCGG TTCAGCATGA TCGACCGGGA AATCGAGGCG
GAACTTCTGC CGCTGACAAA GGCCAACGGC ATTGCGACGC TGAGCTATTC GTCGCTGGCG
CTGGGGTTGC TGTCCGGGAC CATCGGTCCT GACCGCGTGT TTTCCGGCGA CGACCAGCGC
AAGGGCAATC CGCGCTTTTC AGTCGGCAAC CGCCGGAAGG CAACGGCGCT GGCCGACGCC
ATCCGGCCGG TCGCCGAAAA ACACGGCGCC AGCATCGCCC AGATCGTGAT TGCCTGGACG
CTGGCACAGC CTGGCATCAC TTTTGCACTT TGCGGGGCGC GCAATCCGGC ACAAGCGCTC
GATAATGCGC GGGCCGGGAC CATCCGGCTG AATGCGGCCG AGCTTGCGGC CATCGATACG
GCCATAGCGG CGAAACTGAC TGACATGGAC AGGTAG
 
Protein sequence
MSEQQLTREI GRSGVSASAV GLGTWAIGGW MWGGTDEAES IAAIQASLDA GVTLIDTAPA 
YGLGRSEEIV GKALTGRRDK AVIATKCGLV WHTQKGRHFF DQDGKPVHRY LGRDAILHEV
EESLRRLGTD YIDLYITHWQ DPTTPIEETM RALQDLRSSG KIRAIGASNV SPDDLNGYIA
AGGLDAIQER FSMIDREIEA ELLPLTKANG IATLSYSSLA LGLLSGTIGP DRVFSGDDQR
KGNPRFSVGN RRKATALADA IRPVAEKHGA SIAQIVIAWT LAQPGITFAL CGARNPAQAL
DNARAGTIRL NAAELAAIDT AIAAKLTDMD R