Gene Rleg_5540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5540 
Symbol 
ID8016431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp127080 
End bp128138 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content65% 
IMG OID644827707 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_002978907 
Protein GI241518279 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0827042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0000079765 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCGGA TTTTCACCTA TAGCGGCAGC CCCGCCCATA TCGTCTTCGG CGAAGGCAAG 
AGCGCCGCCG CCGGCGAGTG GGTGGAAAAG CTCGGATGCA CCAAGGCCCT CGTTCTCTCG
ACGCCGCAAC AGAAGGCCGA CGCCGAAGCG CTTGCCACGC GGCTGGGCTC TCTCGCTGTC
GGCGTCTTTG CTGGCGCCAC TATGCACACG CCAGTCGATG TCACCGAAGA GGCGATGGAG
GTGGTTTTCC AAACGCAGGC CGATTGCGTC GTTTCGCTCG GCGGCGGCTC GACCACCGGG
CTTGGCAAGG CGATCGCCTA TCGCACGGAT CTGCATCAGA TTGTCATTCC GACGACCTAT
GCCGGATCGG AAGTGACGCC GATCCTCGGC CAGACCGAGG CCGGGCGCAA GACGACCGTG
CGCCATGCGA GCATCCTGCC AGAGGTGGTG ATCTACGACC CGGCGCTGAC GCTTGGCCTG
CCGGTCGGCA TGAGCGTCAC CTCGGGCCTG AATGCCATGG CCCATGCGGT CGAGGCGCTC
TACGCGCAAG ACCGCAACCC GATTTCGACG CTGATGGCGG TCGAGGGGCT GCGGGCCTTC
AAGACCAGCC TGCCTGATAT CATCGCTAAT CCCCACGAGC CCGATGCCCG TGCCGATGCA
CTCTACGGCG CCTGGCTTTG CGGCACCGTG CTCGGCACGG TCGGTATGGC GCTGCACCAC
AAGATCTGCC ACACGCTGGG CGGCACTTTC GATACGCCGC ACGCCGACAC GCATGCGATC
ATGCTGCCGC ACACCGCCGC CTACAATGCC GCGGCCGTGC CGGAACTTCT GGCGCCGGTC
GCCGATATTT TCGGCGCCTC GGTCGGCGGC GGGCTTTGGG ATTTCGCGAG ACAAATCGGT
TCACCGCTGG CGCTGAAGGG TCTGGGCCTG AGCGTTGCCG ATCTCGATCG CGCAGCCGAG
ATCGCCACCG AAAATCCCTA CTGGAATCCA AGGCCGATCG ACCGGAAGTC CATTCGTGCC
CTGCTGCAGG ATGCCTGGGA GGGCAAGCGG CCGGCATAA
 
Protein sequence
MSRIFTYSGS PAHIVFGEGK SAAAGEWVEK LGCTKALVLS TPQQKADAEA LATRLGSLAV 
GVFAGATMHT PVDVTEEAME VVFQTQADCV VSLGGGSTTG LGKAIAYRTD LHQIVIPTTY
AGSEVTPILG QTEAGRKTTV RHASILPEVV IYDPALTLGL PVGMSVTSGL NAMAHAVEAL
YAQDRNPIST LMAVEGLRAF KTSLPDIIAN PHEPDARADA LYGAWLCGTV LGTVGMALHH
KICHTLGGTF DTPHADTHAI MLPHTAAYNA AAVPELLAPV ADIFGASVGG GLWDFARQIG
SPLALKGLGL SVADLDRAAE IATENPYWNP RPIDRKSIRA LLQDAWEGKR PA