Gene Rleg2_6236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6236 
Symbol 
ID6983309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011370 
Strand
Start bp178617 
End bp179597 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content60% 
IMG OID643399248 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_002284004 
Protein GI209552088 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.547714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.621953 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAG ATATGGGCTT GAACCGATTG GGAAATCATA TGAAGCGTAT TCAATATCAT 
CGCTACGGCG GCCCTGATGT TATGCAGATG GAGCGGTTCG AACCGCCCGC ATTGCGACCT
GACGATATCG CAGTGAAAGT GGTGTACGCG GCGATCAACC CGGTCGATTG GAAAGTCCGC
AGGGGCGATT TGAAACTCGT CACAGGCCGC AAATTTCCTC GGGCGATGGG GTGTGATTTT
TCGGGTGAGA TCCTTGCCGT GGGGTCCGGC GTCACGGCGT TCAAGCTGGG TGAGGCGGTA
TTTGGCGTCG CTCCTGTTAA AACCTGCGGC GCGCTCGCCG AAGTCGTCGT CGCCCCGCAG
ACGGTCGTAG GCCGAAAGCC GGAGAGCGTG ACATTCGAAG AAGCCGCATG TCTGGGTACT
CCCGGCGTCA CCGCCTGGAA CGCCCTGATC GACAAGGCGC ATCTGAAAGC CGGACAGCAT
GTGCTGATCA ATGGCTGCAC CGGCGCGGTG GGAGCCGCCG CCGTGCAGAT CGCCTTGCTT
CAAGGTGCAA TTGTATCGGG GACATGCAGC GCTGATGCCG CAACCCAAGC CAAAGCACTT
GGTGTCACGG AAGTCCTAGA TTATCGCAAA ACAAATCTGG CGACCTTGTC CCGCCGCTTC
GACGTCGTCT TCGATACTGC TATCACGATG CCGATCGCGA CCGGATTGTG CTTGCTCAGC
CGGGGAGGCG TGTTCCTCGA CCTTGAGCCC GGGCCAGCAA AGATCATCCG TTCTCTTTTC
GACCGCCGGT TGAAGCCGAT CATCTGCACG CCGCGCCCTG CCATCATGGC AGCGTTGGCG
GAAGCCGCGA GAGTGGGCCG TCTGTCTGTG CCGAGCGCCC AGATCGTCGA TTTCGATGCC
GCGATCGGCA AGATTGCGAA CCTAGAACAG GGTGTCGGCT CGCGAGGGAA GGCCGTTGTC
GCGATCGGTA CGGCCGTTTA G
 
Protein sequence
MTEDMGLNRL GNHMKRIQYH RYGGPDVMQM ERFEPPALRP DDIAVKVVYA AINPVDWKVR 
RGDLKLVTGR KFPRAMGCDF SGEILAVGSG VTAFKLGEAV FGVAPVKTCG ALAEVVVAPQ
TVVGRKPESV TFEEAACLGT PGVTAWNALI DKAHLKAGQH VLINGCTGAV GAAAVQIALL
QGAIVSGTCS ADAATQAKAL GVTEVLDYRK TNLATLSRRF DVVFDTAITM PIATGLCLLS
RGGVFLDLEP GPAKIIRSLF DRRLKPIICT PRPAIMAALA EAARVGRLSV PSAQIVDFDA
AIGKIANLEQ GVGSRGKAVV AIGTAV