Gene Rleg_0491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0491 
Symbol 
ID8011686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp513711 
End bp514706 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content64% 
IMG OID644823083 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_002974336 
Protein GI241203240 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.220192 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTAC AATCAACGAT GAAAGCGCTG TTGGCTGAAG CACCGAATTC ACCATTGCGC 
ATCGCCGATA TTTCAAAACC GGTGCCCGGC GAAGGCCAGG TCCTTGTGCG CATCAAGGCA
AGCGGCGTCA ATCCGCTCGA CCTCAAGATC CGGGCCGGCA ATGCGGCCCA TGCCCGCCAT
CCGCTTCCCG CCATTGTCGG TATCGATATG GCCGGTGTTG TCGAAGAGGT CGGAACTGGC
GTCAGCGGTT TTCGCCGCGG CGACGCGGTC TACGGCATGA CCGGCGGGGT CGGCGGCATT
CAGGGATCGC TTGCCGAATT TGCAGCTGTT GACGCCGCGC TGCTGGCGCC AAAGCCGCAG
AATCTGTCGA TGCGCGAGGC AGCCGCCTTG CCGCTGATCT CGATCACCGC CTGGGAGGGG
CTGGTCGACC GCGCCGGGGT CAAGGCCGGC CAGAAGGTTC TGGTGATTGG CGGCGGCGGT
GTCGGCCATA TCGTCGCGCA GATCGCCAAG GCTGCCGGAG CCGAGGTCTA TGTGGTCGAC
GGCACGTCGA AAGCAGACTA TCTCGCCGGC CTTGGCGCAA CGCCGATCGA TCGTGACGCG
GAAACGGTGG AAACCTATGT CGCAAGACAC ACCGGCGGCA AGGGTTTCGA ACTGGTCTAC
GATACCGTCG GCGGCCAGGG GCTCGATACG GCATTCCGAG CCGTCAGCCA GTTCGGCCAC
GTCGTCAGCT GCCTTGGCTG GGGAAGCCAC GCGCTGGCAC CGCTCTCCTT CAAGGCCGCC
ACCTATTCAG GCGTCTTCAC GCTGCTGCCG CTGCTGACCG GCGAAGGACG CGCCCATCAT
GGTGAGATCA TGCGAGAGAT GACGAAGCTT GCCGAAGCCG GCAAGGTGAT GCCGAAGCTC
GACCCGCGCC GCTTCACCCT TGCCGATACA GATGAGGCGC ATCGGCTGAT CGAAAACCGG
CAGGCGGACG GGAAGCTCGT CGTCGAGATC GACTGA
 
Protein sequence
MSVQSTMKAL LAEAPNSPLR IADISKPVPG EGQVLVRIKA SGVNPLDLKI RAGNAAHARH 
PLPAIVGIDM AGVVEEVGTG VSGFRRGDAV YGMTGGVGGI QGSLAEFAAV DAALLAPKPQ
NLSMREAAAL PLISITAWEG LVDRAGVKAG QKVLVIGGGG VGHIVAQIAK AAGAEVYVVD
GTSKADYLAG LGATPIDRDA ETVETYVARH TGGKGFELVY DTVGGQGLDT AFRAVSQFGH
VVSCLGWGSH ALAPLSFKAA TYSGVFTLLP LLTGEGRAHH GEIMREMTKL AEAGKVMPKL
DPRRFTLADT DEAHRLIENR QADGKLVVEI D