Gene Rleg_4835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4835 
Symbol 
ID8007223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp210037 
End bp211554 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content62% 
IMG OID644821765 
ProductBetaine-aldehyde dehydrogenase 
Protein accessionYP_002973025 
Protein GI241113190 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0460635 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGAAC CCTTGACCGC CTCCGAATAC AAGGCGATCG CGGCCGGCCT TCAGTTTCCA 
GCGAATGCCT TCGTCGACGG CGCATTTCGT CCGGCCAATT CCGGGCAGAC ATTCACCTCG
ACGAATCCCG CGACGGGCGA GGTTCTCGCC GAGATCGCCG CATGCGACGC CACCGATGTC
GACGCCGCCG TCGCCAAGGC AAAGCAAGCC TTCGACGACG GCCGCTGGCG GCTGCGTTCG
CCAGGTGAAC GCAAGGCGGT GCTCCTCAAG CTCGCCAGGC TGCTCGAGGA CAATCGTCAC
GAGCTCGCCG TCATGGAGAG CCTCGATAGC GGCAAGCCGG TCGGCGAATG CCAGACGGTC
GATGTTCCCG ATACCATTCA CACGATCCGC TGGCACGCCG AACTGATCGA CAAGCTTTAT
GACAACACCG CGCCTGTCGG CGCCAACGCA CTGACGATGA TCGTGCGTGA GCCGGTCGGC
GTCGTCGGAT GCGTGCTTCC GTGGAATTTT CCGCTTTTGA TGCTGGCCTG GAAGATCGGC
CCGGCGCTTG CTGCCGGCTG CTCGGTGATC GTCAAGCCTG CACAGGAGAC GACGCTCACC
GCGTTGCGCG TCGCCGAGCT TGCCCATGAA GCGGGCATTC CAGCCGGCGT GTTCAATGTC
GTGACCGGCG GCGGCAAAGA GGTCGGCGAG CCGATCGGCA TGCACATGGA TGTCGATATG
GTGGCCTTCA CCGGATCGAC GCCCACCGGG CGCCGCTTCC TGCGCTATGC AGCGGACTCG
AACCTCAAGC GCGTCGTGCT CGAATGCGGC GGCAAGAACC CCGCCGTCGT TCTCGACGAT
GCCGAAGACC TGGACCTCGT TGCCGAGCAG GTCGTCAATG GCGCCTTCTG GAACATGGGC
GAGAACTGCT CGGCCACGTC GCGTCTGATC GTTCATTCCA AAGTCAAGGA GGAGCTGCTG
AAGCGCATCG GCGCCTATAT GCGCGAATGG AAGACGGGCG ATCCGCTCGA CCCTGCAAAC
CGCATCGGCG CGCTTGTCAG CAAGGCCCAT TTCGAGAAGG TGAAATCCTT CCTCGACGAC
GCCAGGAAGG AGAAGCTGAC GGTCACCCAC GGTGGTGAAA CGTATGGCGG CATCTTTATC
GAACCGACAG TGGTCGAGGG TGTGACGCCT GCCAGCCGTC TTTTCCAGGA AGAGATCTTC
GGGCCGGTGC TTTCGGTCAC CACCTTCAAT TCGCTTGCCG AAGCAATCGC TCTTGCCAAT
GACACGAATT ACGGTCTGAC GGCGTCCGTC TATACCGGCA GCCTGAGGAA CGCCATCAAA
CTCTCGCGCG AGATCCGCGC CGGCGTCGTC ACCGTCAACT GCTTTGGAGA AGGCGACGCC
AGCACGCCGT TTGGCGGCTA CAAGGAGTCC GGCTTCGGCG GCCGCGACAA GTCGGTCTTT
GCCCATGACA ACTACTGCGA ACTGAAGACC ATCTGGATCG ATGTCTCGGA ACGCTCGGTC
GACGAGACCA TCCGATGA
 
Protein sequence
MHEPLTASEY KAIAAGLQFP ANAFVDGAFR PANSGQTFTS TNPATGEVLA EIAACDATDV 
DAAVAKAKQA FDDGRWRLRS PGERKAVLLK LARLLEDNRH ELAVMESLDS GKPVGECQTV
DVPDTIHTIR WHAELIDKLY DNTAPVGANA LTMIVREPVG VVGCVLPWNF PLLMLAWKIG
PALAAGCSVI VKPAQETTLT ALRVAELAHE AGIPAGVFNV VTGGGKEVGE PIGMHMDVDM
VAFTGSTPTG RRFLRYAADS NLKRVVLECG GKNPAVVLDD AEDLDLVAEQ VVNGAFWNMG
ENCSATSRLI VHSKVKEELL KRIGAYMREW KTGDPLDPAN RIGALVSKAH FEKVKSFLDD
ARKEKLTVTH GGETYGGIFI EPTVVEGVTP ASRLFQEEIF GPVLSVTTFN SLAEAIALAN
DTNYGLTASV YTGSLRNAIK LSREIRAGVV TVNCFGEGDA STPFGGYKES GFGGRDKSVF
AHDNYCELKT IWIDVSERSV DETIR