Gene Rleg2_4487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4487 
Symbol 
ID6977581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp120092 
End bp121282 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content64% 
IMG OID643393665 
Productformaldehyde dehydrogenase, glutathione-independent 
Protein accessionYP_002278483 
Protein GI209546565 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR02819] formaldehyde dehydrogenase, glutathione-independent 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.321496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA ATAGAGGCGT CGTTTATCTC CGGCCGGGCA AGGTCGAAGT TCGCGACATC 
GACGACCCGA AGCTGGAAGC GCCCGACGGC CGCCGCATCG AGCACGGCGT CATCCTGAAG
GTGATCTCCA CCAATATCTG CGGCTCCGAC CAGCACATGG TCCGCGGCCG TACGACGGCA
ATGCCAGGCT TGGTGCTCGG TCATGAAATC ACCGGCGAGA TCATCGAGAA AGGCGTCGAC
GTCGAGATGC TCGATATCGG CGATATCGTC TCGGTCCCGT TCAATGTCGC CTGCGGCCGT
TGCCGCTGCT GCAAGTCCCA GGATACCGGC GTCTGCCTGA CGGTCAATCC GGCCCGCGCC
GGCGGCGCTT ACGGTTATGT CGACATGGGC GGCTGGATCG GCGGACAGGC GCGCTACGTC
ACCATTCCCT ATGCCGATTT CAACCTGCTC AAAATCCCCG ATCGGGACAA GGCAATGGCG
AAGATCCGCG ATCTCACCAT GCTCTCCGAT ATTCTGCCCA CCGGCTTCCA TGGCGCGGTG
CGCGCAGGCG TTGGGGTCGG ATCGACCGTC TACGTCGCAG GCGCCGGCCC TGTCGGCCTT
GCGGCTGCCG CATCGGCGCG CATCCTCGGC GCCGCTGTCG TCATGATCGG CGACTTCAAC
AAGGATCGCC TGGCACATGC GGCAAGGGTC GGTTTCGAAC CGATCGACCT CTCCAAGAGC
GACCGCCTCG GCGACATGAT CGCCCAGGTC GTCGGCAGCA ATGAAGTGGA CAGCGCCATC
GACGCGGTCG GCTTCGAAGC GCGCGGCCAT TCGGGCGGCG AACAGCCGGC GATCGTGCTC
AATCAGATGA TGGAGATCAC CCGCGCCGCA GGCTCGATCG GTATTCCCGG CCTCTACGTC
ACCGAGGATC CGGGCGCCGT CGACAATGCC GCCAAACAAG GCAATCTGTC GCTCCGCTTC
GGCCTCGGCT GGGCCAAGGC GCAGTCTTTC CACACCGGCC AGACGCCGGT GCTCAAGTAC
AATCGGCAGC TCATGCAGGC GATCCTGCAC GACCGGCTGC CGATTGCCGA CATCGTCAAT
GCCAAGGTCA TTCCGCTCGA CGAGGCCGCC GCCGGATATG AGAGCTTCGA CCACGGAGCG
GCGACGAAAT TCGTCCTCGA TCCGCATGGG GACGTCGCGA AAGCCGCCTA G
 
Protein sequence
MSKNRGVVYL RPGKVEVRDI DDPKLEAPDG RRIEHGVILK VISTNICGSD QHMVRGRTTA 
MPGLVLGHEI TGEIIEKGVD VEMLDIGDIV SVPFNVACGR CRCCKSQDTG VCLTVNPARA
GGAYGYVDMG GWIGGQARYV TIPYADFNLL KIPDRDKAMA KIRDLTMLSD ILPTGFHGAV
RAGVGVGSTV YVAGAGPVGL AAAASARILG AAVVMIGDFN KDRLAHAARV GFEPIDLSKS
DRLGDMIAQV VGSNEVDSAI DAVGFEARGH SGGEQPAIVL NQMMEITRAA GSIGIPGLYV
TEDPGAVDNA AKQGNLSLRF GLGWAKAQSF HTGQTPVLKY NRQLMQAILH DRLPIADIVN
AKVIPLDEAA AGYESFDHGA ATKFVLDPHG DVAKAA