Gene Rleg_5026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5026 
Symbol 
ID8007617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp413620 
End bp415083 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content65% 
IMG OID644821941 
ProductBetaine-aldehyde dehydrogenase 
Protein accessionYP_002973201 
Protein GI241113366 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.985059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTCA GCTTCGATCC GAACACGATT TCCCTGCCCG TCGGACATTT CATCGGCGGC 
CGGCTGGTTC CGGCCGAAGC CGTCATCGAC ATGCATCGTC CATCCGACGG CAAGGCCTAT
GCCGGCTGCC CGCTGGCCGA CGAGGCCCTT GTCGATCAGG CTGTCGAAAC CGCCAGGACC
GCATTGAAGA CGAGCAATTG GGGTGGCATA CGGCCACGCG AGCGCACCGT GGCGCTGCAG
CGCTGGGCGG ACTTGATCGA AGCCGAGGCG GAGACCCTCG CCAGGCTCGA AGCGCTTTCC
TCGACACGGC CAGTCGGCCA TCTCGTCGCC GGCGATATCG CCGTCACCGC CGAACAGATC
CGTTTCTTTG CTGAATTCGC CGACAAGGAA GGCGGCGACC TCGTCCCGAC TGACAATGCG
AATTTCGGCA TGATCATGAC GGAACCCTAT GGCGTCGTCG GCGCCATCAC GCCCTGGAAT
TATCCCGTGT CCATGGCCGG CTGGAAGCTC GGCCCGGCGC TGGCGGCAGG CAATGCGGTG
GTGTTGAAGC CATCGGAAAT GACGCCGTTT TCCACACTCT ATCTCGCCGA ACTTTCGGTG
CGGGCCGGCC TGCCGGCCGG TCTCGTCAAC ATCGTGCTCG GCGACGGCCC GACCACCGGA
AATGCGATCA CCGGACATCC AGGCATCTCC AAGGTCAGTT TCACCGGTTC GACCGCGGCC
GGCTCGGCGA TCATGACCAA CATCGCCCGC ACCGGTGTCA AGCCGATGAC GCTCGAACTC
GGCGGCAAGA GCCCGCAGCT GGTTTTCGCC GATGCCGATC TCGAGCTTGC GGCGGGCGCA
ATCGCCGGCA GCATTCTCTC CAACGCCGGT CAGGCCTGCG TTTGCGGATC CCGCCTCATC
GTCGAGGCGA AGGTGGCGGA CGCGCTGGCA GCCGCACTTA TCGAGAGGCT GGCGGCCATC
CGCCCCGGCC CCACCTGGGA CGAGGCGACC GATTATTCGC CGGTCATCTC CGAACGACAG
ATCGCCCGCA TGGACGGCAT CATCCGTGCC GCTATCGACG ACGGCGCCGA ATGCCTCACC
GGTGGCCGCC GGCTCGACCG CGAAGGTTAT TTCTACGCAC CGACCCTGAT TTCGGGCGTG
ACCGCAACAT CGCCGGCGGT TCTTGAGGAA ATTTTTGGAC CGGTGCTGAC CATTCAGACC
TTCGAGGACG AGGAGGAGGC GCTGAGGCTC GCCGACCATC CGGCCTATGG GCTCGCCGCC
GGCCTCTTCA CCCGCGATCT CTCGCGTGCC ATCCGCGTGA CCCGCCGCCT GCAGGCCGGC
ACCGTCTGGG TCAACCGCTA CGGCCGCTCG CGCGACCATA TCCTGCCGAC CGGCGGCTAC
AAGCAGTCCG GCATCGGCAA GGATCTCGGC CGCGACGCCT ATCTCGCCAA CCGCAAGAGC
AAGAGTGTGC TCATCAGCCT GTGA
 
Protein sequence
MTLSFDPNTI SLPVGHFIGG RLVPAEAVID MHRPSDGKAY AGCPLADEAL VDQAVETART 
ALKTSNWGGI RPRERTVALQ RWADLIEAEA ETLARLEALS STRPVGHLVA GDIAVTAEQI
RFFAEFADKE GGDLVPTDNA NFGMIMTEPY GVVGAITPWN YPVSMAGWKL GPALAAGNAV
VLKPSEMTPF STLYLAELSV RAGLPAGLVN IVLGDGPTTG NAITGHPGIS KVSFTGSTAA
GSAIMTNIAR TGVKPMTLEL GGKSPQLVFA DADLELAAGA IAGSILSNAG QACVCGSRLI
VEAKVADALA AALIERLAAI RPGPTWDEAT DYSPVISERQ IARMDGIIRA AIDDGAECLT
GGRRLDREGY FYAPTLISGV TATSPAVLEE IFGPVLTIQT FEDEEEALRL ADHPAYGLAA
GLFTRDLSRA IRVTRRLQAG TVWVNRYGRS RDHILPTGGY KQSGIGKDLG RDAYLANRKS
KSVLISL