Gene Rleg_0442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0442 
Symbol 
ID8011642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp457852 
End bp459312 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content61% 
IMG OID644823036 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002974290 
Protein GI241203194 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGC TTCGCATTCT CAATTGGATC AACGGCCAGG CCAGCCACGC CTCGAGTGAA 
GGATGGCTTG AGAAATTCAA TCCGCACAGT GGCGAACTCC TTTATCACGT GGCTGACTCC
TCGCAGGATG ATGTTGAGCA AGCAATAACG GCAGCGCGTT CGGCGTTCCC AGCCTGGGCG
GAGCTTACAC CCGTAAAGCG CGGCCAGATT CTGATGGATA TCGTCGCCCT GATGAAGCGA
CGTTCCGATG AGCTGGCGGA ATGCATTGCG CTTGAAACCG GCAAACCTCC CCAGGACGCC
AAAGGCGAGA CGGGCGGAGC GATCATGCAG GCGGAATATT TCGCCGGCGA GGGTATGCGC
CTATACGCCC GGTCGCTCAC CTCAGGCACG CCCGGCAAAT ACAGCCACAC AGTGCGCCAA
CCTCGTGGCG TAGCCGGTCT GATCGTGCCG GCAAATACGC CGATCGCCAA TATCGCATGG
AAGACTTTTC CTGCGCTTAT TTGCGGCAAC ACGGTGGTTC TGAAAGCTGC CGAGGACTCT
CCACGCATAG CCCAACTCTT TGCCGAGCTG ACCAAGGAGG CGGGATTGCC CGACGGCGTA
TTCAACGTCG TACATGGGCG TGGCGAGCCG GCTGGCTCGA CGTTGGTCAC AGACGAGCGG
GTCGACATTA TCAGCTTCAC GGGCTCGACC GGAGTAGGCC GCAGGATTGC GGAAGTCGCT
GGAAAGCGTC TCGCACGTAT TTCTCTCGAA CTGGGCGGCA AGAACCCCTT CGTCGTCTGT
GATGACGCCG ATCTCGATCA GGCGGTGCAC TGGGCGGCGC TGTCGGCCTT CAGCAATGCC
GGCCAGCGCT GCGCCGCAGG TAGCCGCATG CTGGTGTTTA AATCGGTCTA CGAGGAGTTT
CGGGACCGAC TGACCGCAAA GGCCAGAAGC CTCAAGCTAG GTGTTGCCGC CGGATGCGAT
CTCGGGCCGC TCGTCAGCCT CCGCCAACAG CAGTCCGTGC TTTCCGCCAT CGAACGCGCA
AAAGAACAAG GCGGCCAGGT GCTTTGCGGG GGGCGCACAC CGGACGCACC GGAGTTGGCC
GGAGGCTATT ATGTCGAGCC TACAGTTATC GATGGCCTTG CCACCACGTC GGATCTCAGT
TGCAAGGAAG TCTTCGGTCC GGTGACGACA CTCCATCCCG TCGGCAGCAT GACCGAGGCG
CTGGATGTAG CAAACGCCAC CGAATACGGA TTGACCGCTG CTGTGCATAC CCGCAACGTC
GATCGCGCGA TGTGGTTCGC CCAAAGGGTC AAAGCCGGCG TCGCCAATGT CAACATGGGT
ACGTATGGCA GCGAGCCGCA CATGCCGTTC GGCGGCTTCG GGTCGTCCGG GAATGGCACG
CGCGAGCCTG GAGTCGAGGC GCTCGATGTG TATTCGGAAC TGAAAAACAT CTCCTTCCTT
GTCCGCCCGG GGATGCTTTG A
 
Protein sequence
MTTLRILNWI NGQASHASSE GWLEKFNPHS GELLYHVADS SQDDVEQAIT AARSAFPAWA 
ELTPVKRGQI LMDIVALMKR RSDELAECIA LETGKPPQDA KGETGGAIMQ AEYFAGEGMR
LYARSLTSGT PGKYSHTVRQ PRGVAGLIVP ANTPIANIAW KTFPALICGN TVVLKAAEDS
PRIAQLFAEL TKEAGLPDGV FNVVHGRGEP AGSTLVTDER VDIISFTGST GVGRRIAEVA
GKRLARISLE LGGKNPFVVC DDADLDQAVH WAALSAFSNA GQRCAAGSRM LVFKSVYEEF
RDRLTAKARS LKLGVAAGCD LGPLVSLRQQ QSVLSAIERA KEQGGQVLCG GRTPDAPELA
GGYYVEPTVI DGLATTSDLS CKEVFGPVTT LHPVGSMTEA LDVANATEYG LTAAVHTRNV
DRAMWFAQRV KAGVANVNMG TYGSEPHMPF GGFGSSGNGT REPGVEALDV YSELKNISFL
VRPGML