Gene Rleg2_1620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1620 
Symbol 
ID6980356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1646420 
End bp1647409 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content61% 
IMG OID643396345 
Productaldo/keto reductase 
Protein accessionYP_002281136 
Protein GI209549219 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID[TIGR01293] voltage-dependent potassium channel beta subunit, animal 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.884834 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.762073 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATATC GTCGTCTGGG AAAATCGGGT CTGCAAGTGA GCGAGTTCTC GTTCGGCTCA 
TGGGTGACAT TCGGTAAGCA GGTCAATGGC GGCGACGCCG TCGACCTCAT GAAGCTTGCC
TATGACAACG GGGTGAACTT CTTCGACAAT GCCGAAGGAT ACGAAAGCGG CAAGTCCGAG
ATCGTCATGG GCGAGGCGCT GAAGTCGCTT GGCTGGAGCC GCGACAGCTT CGTCGTCTCG
AGCAAGGTCT TCTGGGGCGG CCAAAAGCCG ACGCAGCGCG GCCTGTCGCG CAAGCACGTG
ACCGATGCCT GCCATGCCGC GCTGAAGAGA CTTCAGGTCG ATTACCTCGA CCTCTATTTC
TGCCATCGCC CGGATATCGA CACGCCGATC GAGGAAACGG TCCGGGCGAT GCACGATCTC
GTCGCCCAGG GCAAGGTGCT CTACTGGGGA ACGTCGGAAT GGTCGGCGCA ACAATTGACG
GAAGCCTACG CCGTTGCCCG CGACCTGCGC ATCACGCCGC CGACGATGGA GCAGCCGCAG
TACAATATCT TCGAACGTCA GAAGGTCGAA TCCGACTATC TCCCGCTCTA CGACCTGATC
GGTCTCGGCA CCACGATCTG GTCGCCGCTC GCCTCGGGCG TCCTGACCGG CAAATATAAT
AACGGTGTGC CGGCTGACAG CCGGATGAAC TTGCCGGGCT ACGAATGGCT GAAGGAGAAG
TGGTCCAGCG ACGCCGGCCG CGCCCAGCTC AAGCAAGTGG GTGAACTTGC AAAGCTCGCC
GATGAGATCG GCCTGTCGAT CACCCATCTT GCCCTGTTGT GGTGCCTCGC CAATCGCAAC
GTCTCCACCG TCATTCTCGG CGCCTCGCGC GCCAGCCAGT TGCAGGACAA TCTCGCGGCC
CTTTCGCACA GGCAGAAGAT GACCCCTGAA GTGATGGGCC GGATCGACAC CATCGTTGGA
AACAAGCCGG AAGGCCCGCG TCGATTCTAA
 
Protein sequence
MEYRRLGKSG LQVSEFSFGS WVTFGKQVNG GDAVDLMKLA YDNGVNFFDN AEGYESGKSE 
IVMGEALKSL GWSRDSFVVS SKVFWGGQKP TQRGLSRKHV TDACHAALKR LQVDYLDLYF
CHRPDIDTPI EETVRAMHDL VAQGKVLYWG TSEWSAQQLT EAYAVARDLR ITPPTMEQPQ
YNIFERQKVE SDYLPLYDLI GLGTTIWSPL ASGVLTGKYN NGVPADSRMN LPGYEWLKEK
WSSDAGRAQL KQVGELAKLA DEIGLSITHL ALLWCLANRN VSTVILGASR ASQLQDNLAA
LSHRQKMTPE VMGRIDTIVG NKPEGPRRF