Gene Rleg2_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1004 
Symbol 
ID6979723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1025909 
End bp1026904 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content65% 
IMG OID643395716 
Productaldo/keto reductase 
Protein accessionYP_002280524 
Protein GI209548607 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0837346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.566721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACCA AGACCGCAAC ACCGACCACG ATCACGCTCT GGAACGGCCG CGAAATTCCG 
CGTCTCGGCA TGGGATGCTG GGCGATCGGC GGCCCCTTCT TTGCCGGCGA CACGCCGCTC
GGCTGGGGCG AAGTCGACGA CGATGAATCC GTCGAAGCGA TCGGCAGCGC CATCGACCTC
GGCATCCGCT TCTTCGATAC CGCCTCGAAT TACGGCGCCG GCCATTCCGA AGAAGTGCTC
GGCCGGGCGA TCGGCAATCG CGACGATATT ATTGTTGCCA CCAAATTCGG CTTTGCCACC
GATGCCGCCA CCAAGCAGGC TACCGGCGCC TTTGCCGATG AAGCCTTCAT CCGCCGTTCG
GTCGAGACCT CGCTGCGCCG CCTCAAGCGT GACCGCCTCG ATCTCCTACA GTTCCACATC
AATGATTTTC CGTTGGAACA GTCCGATGCC GTCTTCGACG TGCTGGAGGC GTTGCGCGTC
GAAGGCAAGA TCGACGCATT CGGCTGGAGC ACCGATTCTC CCGATCGCGC CGCCCGCCAT
GCCGGCCGCC AGGGCTATGT CTCGGTGCAG CATACGATGA ACGTCTTCGA GCCGGTGCCG
GAGATGATCG CAGTGATCGA AAGGCAGGAA CTGATCTCGA TCAATCGCGG TCCGCTGGCC
ATGGGGCTGC TGACCGGCAA GTTCACCGCC GACAAGGCGG TGGGCGCCAA GGATGTCCGC
GGCGCGGCCC TCGACTGGAT GGTCTACTTC AAGGACGGGC GCATGGCCCC GGAATTTGCC
GCAAGGCTCG ACGCCGTCCG CGATCTCCTG ACCTCGGGCG GCCGCACACT GACGCAAGGG
GCGCTCGCCT GGCTCTGGGC AAAGTCGCCG CGCACCCTCC CCATTCCAGG CTTCCGCACC
GTCGCCCAGG TGGAGGAAAA TGCCGGCGCA CTGGAAAAGG GACCGCTGCC GGCCGATGTC
ATGGCGGGGA TCGACGCCGC ACTCGGGCAT CAGTGA
 
Protein sequence
MLTKTATPTT ITLWNGREIP RLGMGCWAIG GPFFAGDTPL GWGEVDDDES VEAIGSAIDL 
GIRFFDTASN YGAGHSEEVL GRAIGNRDDI IVATKFGFAT DAATKQATGA FADEAFIRRS
VETSLRRLKR DRLDLLQFHI NDFPLEQSDA VFDVLEALRV EGKIDAFGWS TDSPDRAARH
AGRQGYVSVQ HTMNVFEPVP EMIAVIERQE LISINRGPLA MGLLTGKFTA DKAVGAKDVR
GAALDWMVYF KDGRMAPEFA ARLDAVRDLL TSGGRTLTQG ALAWLWAKSP RTLPIPGFRT
VAQVEENAGA LEKGPLPADV MAGIDAALGH Q