Gene Rleg_5561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5561 
Symbol 
ID8016452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp147898 
End bp149067 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content60% 
IMG OID644827728 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_002978928 
Protein GI241518300 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.210102 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0249106 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAC TTACTTGGCA TGGCAAACAC GACATTCGAT GCGAGAGCGT TCCAGATCCT 
CAAGTCGAGG AAGGGCGCGA TGCCATCATC AAGGTGACGG CCTGCGCGAT CTGCGGGTCT
GATCTTCACC TCTTCAACGG AGTGATGCCG GACATGCATA ACGGCGACAT CATGGGCCAC
GAGACGATGG GCGAGGTCGT CGAAGTCGGC AAGGACAACA AAAAGCTCAA GGTCGGCGAC
CGTGTCGTGG TGCCGTTCAC CATCTCCTGC GGCGAATGCT TCTTCTGCCA GCGCGGCTTT
TATTCCGGAT GCGAACGCAG CAACCCCGAC CCGGCGAAGG TCAAGAAGAT GTGGGGGAAT
TCACCCGCGG GCCTGTTCGG CTACACACAT CTTCTTGGCG GCTACAGTGG AGGTCAGGCT
GAGTACCTGC GCGTGCCTTA CGCGGACGTC GGCCCGATCA AGGTGCCGGA CGGACTGACG
GATGAGCAGG TGCTGTTTCT CTCCGACATC TTTCCGACGG GGTACATGGC CGCGGACTTC
TGCGATATCC AGCCCGGCGA CACGATTGCG ATCTGGGGCT GCGGCCCCGT CGGACAGATG
GCGATCAAGT CGGCCTTCAT CCTCGGCGCC GAGCGCGTCA TCGCTATCGA TACCGTGCCG
GAGCGGCTTG CTCTTGCCGA AGCGTCGGGA GCCACCACCC TCGATTTCAT GGACGAAGAC
ATCTACGACA AGCTCATGGA GCTGACCAAC GGGCGCGGCG CAGACGCGTG CATCGATGCG
GTCGGCACCG AAGCCGATCC GTCGGCGAGC TGGGACTCGC GCCTTGACCG CATCAAGGTC
GCCACGTTCA TGGGAACCGA TCGCCCCCAC GTTCTTCGCC AGGCAATCCA TTGCTGCCGC
AACTTCGGTA CGGTGTCGAT CGTCGGCGTC TATGGTGGTT TCCTCGACAA GATCCCTATG
GGTTCGGCGA TCAATCGCGG CCTGACGTTC CGAATGGCGC AGACCCCGGT GCAGCACTAC
CTGCCGCTGC TCATGGAACG CATTCAAAAC GGCGAAATCG ATCCGTCGTT CATTATCACG
CATCGTGCGA CCCTCGACGA AGGTCCCGAG CTCTACAAGA CATTCCGCGA CAAGAAGGAT
GGCTGCATAA AGGTTGTTCT TAAGCCGTAA
 
Protein sequence
MKALTWHGKH DIRCESVPDP QVEEGRDAII KVTACAICGS DLHLFNGVMP DMHNGDIMGH 
ETMGEVVEVG KDNKKLKVGD RVVVPFTISC GECFFCQRGF YSGCERSNPD PAKVKKMWGN
SPAGLFGYTH LLGGYSGGQA EYLRVPYADV GPIKVPDGLT DEQVLFLSDI FPTGYMAADF
CDIQPGDTIA IWGCGPVGQM AIKSAFILGA ERVIAIDTVP ERLALAEASG ATTLDFMDED
IYDKLMELTN GRGADACIDA VGTEADPSAS WDSRLDRIKV ATFMGTDRPH VLRQAIHCCR
NFGTVSIVGV YGGFLDKIPM GSAINRGLTF RMAQTPVQHY LPLLMERIQN GEIDPSFIIT
HRATLDEGPE LYKTFRDKKD GCIKVVLKP