Gene Rleg_5578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5578 
Symbol 
ID8016469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp159057 
End bp160043 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content65% 
IMG OID644827744 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_002978944 
Protein GI241518316 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.271425 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGT TGGCAAGAGC AGCCGAGGGT GCGACCCGGG CGGCGATCAT CACCGGTCCC 
GGAGACGTGT CCGTCGAAGA GAGACCTCTT CCTCAGCCGA GGCCGGGGCA GGTCAGGATC
AGGCTCGAGG GGTCGGGCGT GTGCGCTTCC AATCTTGGCC CTTGGGCCGG CCCGGAATGG
ATGAGTTTTC CGACGGAGCC GGGCGGTCTC GGGCACGAGG GCTGGGGCAG GATCGATGCG
CTCGGGGACG GGGTTTCTGG CCTCGCCCTG GGAGACCGCG TCGCCGCGCT CACCTATCAC
GCCTACGCCA CCCACGACAT CGCCGATGAA GACATGGTCG TGTCGCTGCC GCGGGCGTTC
GATGGCCAAC CCTTTCCGGG CGAGCCGCTC GGCTGCGCGA TGAACATATT CCGGCGCAGC
CGCGTCGAGG CGGGTCAGAC CGTCGCCATC ATCGGGATCG GATTTCTCGG CGCCTTGCTA
ACGCAGCTTG TCAGCGGCGC CGGCGCGCGC GTCATTGCCA TCTCGCGCCG TCCGTTTTCA
CTTGAAATCG CCAAGCGCAT GGGCGCCGCC GAGACCGTGT CGATGGACGA CCACTGGCGG
ATCATCGAAG CTGTGAGAGA ATTGACCAAC GGTGCTTTCT GCCACCGCGT CATAGAGGCG
GTCGGCAAGC AGTGGCCGCT CGATCTCGCC GGCGAGCTGA CCGCCGAGCG CGGCCGGCTG
GTGGTTGCTG GATACCATCA AGACGGGCCG AGGCAAGTCA ACATGCAGCT CTGGAACTGG
CGTGGCATCG ACGTCATCAA TGCTCACGAA CGCGATCCCA GGATTTATGT CAGTGGCATG
CGCGAGGCGA TCGCAGCAAT GATCTCCGGC AGGCTCGATC CGTCATCGCT CTATACGCAC
GTCTATCCGC TCGAAGGCCT TGGCGAGGCG CTCGACGCCA CCCGCGATAG GCCCGATGGT
TTCCTCAAGG CGATGGTGAC ATGCTGA
 
Protein sequence
MNMLARAAEG ATRAAIITGP GDVSVEERPL PQPRPGQVRI RLEGSGVCAS NLGPWAGPEW 
MSFPTEPGGL GHEGWGRIDA LGDGVSGLAL GDRVAALTYH AYATHDIADE DMVVSLPRAF
DGQPFPGEPL GCAMNIFRRS RVEAGQTVAI IGIGFLGALL TQLVSGAGAR VIAISRRPFS
LEIAKRMGAA ETVSMDDHWR IIEAVRELTN GAFCHRVIEA VGKQWPLDLA GELTAERGRL
VVAGYHQDGP RQVNMQLWNW RGIDVINAHE RDPRIYVSGM REAIAAMISG RLDPSSLYTH
VYPLEGLGEA LDATRDRPDG FLKAMVTC