Gene Rleg_4733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4733 
Symbol 
ID8007469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp100879 
End bp101847 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content63% 
IMG OID644821664 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_002972924 
Protein GI241113089 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATATC AGGCGGTCGT GCGAAAATTC GGCCCGGCTC AGGATGTCGT CGAGCTCGAG 
CAGGCTGCGC TGCCGCCGCT GGCGCGCGAC CAGGTGAGGG TGCGCCTGCT GGCGCGGGCG
ATCAATCCGT CCGATATCAT CACCATCTCG GGAGCCTATA GCGGACGCAC GACATTGCCT
TTCGTTCCCG GCTTCGAAGC GTTCGGCGTG GTCGAACAAT GCGGTGAAGA GGTTCATGGG
CTTTCGCCGG GAACACGCGT GCTGCCGGTG CGTAGCGCCG GCGGCTGGCA GGAATTCAAG
GATACCGATC CCGGCTGGTG CCTGCGTGTT CCTGACGAGC TCACCGACTT CGAAGCTGCG
ACGAGCTACG TCAATCCGAT GACGGCCTGG TTGATGCTGC ATGCCAAGAT CGGGCTGAGG
CCGGGCATGC GCATCGCTAT CAATGCCGCC GCCTCTTCGA TCGGAGCGAT ATTGATCGGT
CTCGCCAACG CCGCAGGCGT GGAGCCGGTC GCCATCGTCC GTAGCGAGGG ATCGCTTGAG
CGCCTGCGCG GCCGGGTCGA GGCTATCATC ATCGATAGAG AGGAAAGCGA TCTGGTTGCC
GGGCTTGCCG GCCGACACGG GCTAGACGCG GTGCTCGATT GCGTCGGAGG AGCGCGCGCC
ACAATCCTCG CCGATGCGCT GCGGGCGGGC GGACGCTTCT TGCACTACGG TCTGCTCTCT
GGGCAGAGCA TCCCGAACTC ATTCTGGGCG ACCCATCCCG ATATTTCCTT TTCCTATGTT
CACCTCCGGG AATGGGTTCA TTCCGAAGCT ATGGACGACG TGCAGCACGC CTATTCCAAG
GTCGCGGCGC ACATCGTTTC GAAGGTCATC GAGACCGAGA TCCGGGAGGT GTTTCCTTTG
GAAAGTGTCC GGCAAGCCCT GCAGTCCGCT CTTCCCTTCC GAACGGGCGG CAAGGTTCTG
CTCGCCTGA
 
Protein sequence
MQYQAVVRKF GPAQDVVELE QAALPPLARD QVRVRLLARA INPSDIITIS GAYSGRTTLP 
FVPGFEAFGV VEQCGEEVHG LSPGTRVLPV RSAGGWQEFK DTDPGWCLRV PDELTDFEAA
TSYVNPMTAW LMLHAKIGLR PGMRIAINAA ASSIGAILIG LANAAGVEPV AIVRSEGSLE
RLRGRVEAII IDREESDLVA GLAGRHGLDA VLDCVGGARA TILADALRAG GRFLHYGLLS
GQSIPNSFWA THPDISFSYV HLREWVHSEA MDDVQHAYSK VAAHIVSKVI ETEIREVFPL
ESVRQALQSA LPFRTGGKVL LA