Gene Rleg_0890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0890 
Symbol 
ID8012040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp881222 
End bp882685 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content65% 
IMG OID644823475 
Productbetaine aldehyde dehydrogenase 
Protein accessionYP_002974726 
Protein GI241203630 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01804] glycine betaine aldehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.187132 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.973025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCCC AGCCGAAAGC CTCGCACTTC ATCGATGGCG AATATGTCGA GGATACCGAC 
GGCACCGTCA TCGAGAGCCT CTATCCCGCC ACCGGCGAGG TGATCGCCCG GCTGCATGCC
GCAACGCCTG CGATTGTCGA ACGGGCGATC GCCGCCGCCA AGCGCGCCCA GCCGGAATGG
GCGGCGATGA GCCCGATGGC GCGCGGGCGC ATCCTGAAGC GGGCAGCCGA CATCATGCGC
GAGAGAAACC GGGCGCTCTC CGAACTCGAG ACGCTCGATA CCGGCAAGCC GATCCAGGAG
ACTGTTGTCG CCGACCCGAC CTCGGGCGCC GATGCCTTCG AATTCTTCGG CGGCATCGCG
CCGGCCGGTC TCAACGGCTC GCAGATCCCG CTCGGCCAGG ACTTTGCCTA TACCAAGCGC
GTGGCGCTTG GCGTCTGCGT CGGCATCGGC GCCTGGAACT ATCCGCAGCA GATCGCCTGC
TGGAAAGCTG CGCCGGCGCT TGTCTGCGGC AATGCCATGG TGTTCAAGCC ATCCGAGAAC
ACCCCGCTCG GCGCGCTGAA GATCGCCGAG ATCCTGCTTG AGGCGGGACT GCCGAAGGGG
CTCTTCAACG TGATCCAGGG CGACCGCGAC ACCGGACCGC TGCTCGTCAA CCATCCCGAT
GTCGCCAAGG TGTCGCTGAC CGGCTCGGTG CCGACGGGGC GCAGGGTCGC GGCGGCTGCC
GCCGGCAACC TCAAGCACGT GACGATGGAA CTCGGCGGCA AGTCGCCGCT CATCGTCTTC
GACGATGCCG ATCTCGATTC GGCGGTCGGG GGCGCGATGC TCGGCAATTT CTATTCGACC
GGCCAGGTCT GCTCGAACGG CACGCGCGTC TTCGTGCAGA AGACTGTTAA GGCCGAATTC
CTGAAGCGGC TGAAGATCCG CACCGAGGCG ATGCTGATCG GCGATCCGAT GGACGAGGCG
ACGCAGGTCG GGCCGATGGT CTCCTGGGCG CAGCGCGAGA AGGTGATCTC CTATATCGAG
AAGGGCAAGG CCGAGGGCGC AACACTCATT GCCGGCGGCG GCATTCCGAA CAACGTCTCC
GGCGAAGGCT ATTATGTGCA GCCGACGGTG TTTGCCGATG TCACTGACGA CATGACAATC
GCCCGCGAGG AAATCTTTGG CCCCGTCATG TCAGTGCTCG ATTTCGACGC CGAGGACGAG
GTGATCGCCC GCGCCAATGC CAGCGAATTC GGCCTTTCCG GCGGCGTCTT CACCGCCGAC
CTCACCCGCG CCCACCGCGT CGTCGACCGG CTGGAAGCGG GCACGCTGTG GATTAACACC
TATAATCTCT GCCCGGTGGA AATCCCCTTC GGCGGCTCGA AACAATCCGG CTACGGCCGC
GAGAATTCGC TTGCGGCGTT GGAGCATTAT TCCGAACTGA AGACGGTTTA TGTGGGCATG
GGGCCGGTGG CGGCGCCTTA TTGA
 
Protein sequence
MKAQPKASHF IDGEYVEDTD GTVIESLYPA TGEVIARLHA ATPAIVERAI AAAKRAQPEW 
AAMSPMARGR ILKRAADIMR ERNRALSELE TLDTGKPIQE TVVADPTSGA DAFEFFGGIA
PAGLNGSQIP LGQDFAYTKR VALGVCVGIG AWNYPQQIAC WKAAPALVCG NAMVFKPSEN
TPLGALKIAE ILLEAGLPKG LFNVIQGDRD TGPLLVNHPD VAKVSLTGSV PTGRRVAAAA
AGNLKHVTME LGGKSPLIVF DDADLDSAVG GAMLGNFYST GQVCSNGTRV FVQKTVKAEF
LKRLKIRTEA MLIGDPMDEA TQVGPMVSWA QREKVISYIE KGKAEGATLI AGGGIPNNVS
GEGYYVQPTV FADVTDDMTI AREEIFGPVM SVLDFDAEDE VIARANASEF GLSGGVFTAD
LTRAHRVVDR LEAGTLWINT YNLCPVEIPF GGSKQSGYGR ENSLAALEHY SELKTVYVGM
GPVAAPY