Gene Rleg_4113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4113 
Symbol 
ID8014911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4190999 
End bp4192537 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content65% 
IMG OID644826683 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002977893 
Protein GI241206797 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.189618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCG CCGTCCTCGA TCTCGCCACC GAAACCGCCA AGCTGCTTGC CGAACTCGGC 
GTCGATGCCG GCCGCTATCA CGGCGGCACG CTTTCCGTCA CCTCGCCGGT CACCGGCAAG
GAAATCGGCA AACTCAGGGA ACATTCCGTT TCCGAGACGA AGGCGGCGAT CGAAGCGGCG
CATCAGGCCT TCCTCGAATG GCGTGCCGTG CCGGCGCCGA AGCGCGGCGA ACTGGTCCGC
CTGCTGGGCG AGGAACTGCG CGCCTCCAAG GCGGCGCTCG GCCGTCTCGT TTCGATCGAG
GTTGGCAAGA TCACTTCGGA AGGTCTGGGC GAAGTGCAGG AGATGATCGA CATCTGCGAT
TTCGCGGTCG GCCTTTCCCG TCAGCTTTAC GGCCTGACGA TCGCCACTGA GCGATCCGAG
CACCGGATGA TGGAAAGCTG GCATCCGCTC GGCGTGATCG GCATCATCTC CGCCTTCAAC
TTCCCTGTTG CCGTCTGGTC GTGGAATGCC GCACTTGCGA TGGTCTGCGG CAATTCCACC
GTCTGGAAGC CCTCGGAAAA GACGCCGTTG ACGGCGCTTG CCGTGCAGGC GCTGTTCGAA
AAGGCGCTGA AGCGTTTCGT CGCCGAGGGC GGTAGCGCAC CGGCCAATCT GTCGACGCTG
ATCATCGGCG GCCGCGAGGT CGGCGAAGTG CTGGTCGATC ATCCGAAGAT CCCGCTGGTT
TCCGCCACCG GCTCGACGGC CATGGGCCGC GCTGTCGGTC CGCGCCTGTC GCAGCGTTTT
GCCCGCGCCA TTCTCGAACT CGGCGGCAAC AATGCGGCGA TCGTCTGCCC GTCGGCCGAT
CTCGACCTGA CGCTGCGCGG CGTTGCCTTC TCCGCCATGG GTACGGCCGG CCAGCGCTGC
ACGACGCTGC GCCGTCTCTT CGTCCACGAG AGCGTCTACG ATCAGCTGGT GCCTAGACTG
CAGAAGGCCT ACGGCTCCGT CACCATCGGC AATCCGCTCG AAACCGGCAC GCTTGTAGGA
CCGTTGATCG ACGGCCAGGC TTTTAAAAAC ATGCAGGCAG CGCTTGGCGA GGCGAAGTCG
GCCGGCGGCA CGGTGACCGG AGGCGACCGC GTCGAAAGCG GTTCGACCGA GGCTTTCTAC
GTTCGCCCGG CGCTCGTGGA AATGCCTGAT CAGACCGGAC CGGTGGAGCA CGAGACCTTC
GCGCCGATCC TCTATGTGAT GAAATACAGC GATTTCGACG AGGTGCTGGC GCTGCACAAT
GCCGTGCCGC AGGGGCTGTC GTCGTCGATC TTCACCAACG ACATGCGCGA GGCCGAAACC
TTCGTCTCCG CCCGCGGTTC GGATTGCGGC ATCGCCAACG TCAACCTCGG GCCATCGGGC
GCCGAGATCG GCGGCGCCTT TGGCGGCGAG AAGGAGACCG GCGGCGGCCG TGAATCCGGC
TCGGATGCCT GGAAGGCCTA TATGCGCCGC TCCACCAACA CGATCAATTA CGGCAGGACG
CTGCCGCTGG CGCAGGGCGT CAAGTTCGAC GTCGAATAA
 
Protein sequence
MTIAVLDLAT ETAKLLAELG VDAGRYHGGT LSVTSPVTGK EIGKLREHSV SETKAAIEAA 
HQAFLEWRAV PAPKRGELVR LLGEELRASK AALGRLVSIE VGKITSEGLG EVQEMIDICD
FAVGLSRQLY GLTIATERSE HRMMESWHPL GVIGIISAFN FPVAVWSWNA ALAMVCGNST
VWKPSEKTPL TALAVQALFE KALKRFVAEG GSAPANLSTL IIGGREVGEV LVDHPKIPLV
SATGSTAMGR AVGPRLSQRF ARAILELGGN NAAIVCPSAD LDLTLRGVAF SAMGTAGQRC
TTLRRLFVHE SVYDQLVPRL QKAYGSVTIG NPLETGTLVG PLIDGQAFKN MQAALGEAKS
AGGTVTGGDR VESGSTEAFY VRPALVEMPD QTGPVEHETF APILYVMKYS DFDEVLALHN
AVPQGLSSSI FTNDMREAET FVSARGSDCG IANVNLGPSG AEIGGAFGGE KETGGGRESG
SDAWKAYMRR STNTINYGRT LPLAQGVKFD VE