Gene Rleg_5346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5346 
Symbol 
ID8007304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp755304 
End bp756821 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content65% 
IMG OID644822250 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002973510 
Protein GI241113675 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.671468 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTGGA CCCCTTCAGG CAGGCATTTC ATCGCCGGCG AGTGGATTGC CGGAACGACG 
ACATTTCGGT CAGAGCCGGC GCATGGGCAG GCACATGACT TTGCTGTCGG CACAACGGAA
CTGGTCGACC GCGCCTGCCG CGCCGCCGAA GCTGCTTTTG CAGTGTTTTC GGCAACGACA
CGCGAGGAGC GCGGCATTTT CCTCGAGGCA ATCGCCGAGG AGATCGACAA GCGCGGCGAG
GCCGTCACCC TCATCGGAAC GCAGGAAACC GGGCTTCCGG AGGGCCGGCT CAACGGCGAA
CGCGCCCGCA CCACCGGCCA ACTCAAACTG TTTGCCGACC ATATCCGCAA GGGCGCGCAT
CTCGACGCAC GCGTCGACGC AGCCCAACCC GATCGGCAGC CGGCGCCGCG GCCCGAGATC
CGTCTGGTGC AACGGCCGAT CGGCCCGGTC GCCGTCTTCG GCGCTTCGAA TTTTCCGCTG
GCATTCTCGA CGGCCGGCGG CGATACGGCA GCCGCGCTTG CCGCGGGCTG CCCGGTCGTG
GTGAAGGGAC ATTCGGCCCA TCCCGGCACA GGTGAGATCA TTGCCGAGGC GATTGCAGCC
GCTATCGAGC GCACTGGAAT GCCGGCCGGC GTCTTCAGCC TGATCCAGGG CGGCCGTCGT
GATGTCGGAA CGGCGCTGGT GACGCACCCG GCCATCAAAG CCGTTGGCTT TACCGGTTCA
CTTGCCGGCG GAAGGGCGCT CTTTGACCTT TGCGCCCAGC GCCCTGAGCC GATCCCCTTT
TTCGGGGAAC TCGGCAGCGT TAATCCGATG TTCCTGCTGC CGGCCGCTAC CGCTGCCCGG
GCCGAGGCAA TCGGTTCAGG CTGGGCCGGT TCACTGACGC TTGGTGCCGG CCAGTTCTGC
ACCAAACCCG GTATCGCCGT CGTCGTCGAC GGGCCGGAAG CGGACAAGTT CACCAGTGCC
GCCAAAGCGG CTCTTGAAAA GGTGGCGCCG CAGACGATGC TGACCAAAGG CATCGCCTCG
GCCTATCACG AAGGTGTCGA GCGCATGCGA ACAAGCAATG CCGTCGCGCC GGTTCTGGCG
GCACAGAGTG CTGGCCGCGA AGCAACGCCG AACCTGTTCG AGACCAACGG CTCGGCCTGG
CTTGCCGATC ACTCGCTCAG CGAAGAAGTA TTCGGTCCTC TCGGCCTCGT CGTGCGCGTC
GGCTCGCCTG AAAAGTTGCT CACCCTTGCC GAAAGCTTTC AGGGACAACT GACCGCGACG
ATCCATATGG ACGACGCCGA TCTGGGCCTT GCCCGCGACC TGCTGCCAAT CCTCGAACGG
AAGGCCGGCA GGGTGCTGGT CAACGGCTTT CCAACCGGCG TCGAGGTTGT CGATTCCATG
GTGCATGGCG GACCCTACCC GGCCTCGACC AACTTCGGCG CGACCAGCGT CGGAACCATG
TCGATCCGCA GGTTTTTGCG CCCCGTCGCC TATCAGAATT TCCCCGCCGA CCTGTTGCCG
CAAGACCTGC GCAACTGA
 
Protein sequence
MSWTPSGRHF IAGEWIAGTT TFRSEPAHGQ AHDFAVGTTE LVDRACRAAE AAFAVFSATT 
REERGIFLEA IAEEIDKRGE AVTLIGTQET GLPEGRLNGE RARTTGQLKL FADHIRKGAH
LDARVDAAQP DRQPAPRPEI RLVQRPIGPV AVFGASNFPL AFSTAGGDTA AALAAGCPVV
VKGHSAHPGT GEIIAEAIAA AIERTGMPAG VFSLIQGGRR DVGTALVTHP AIKAVGFTGS
LAGGRALFDL CAQRPEPIPF FGELGSVNPM FLLPAATAAR AEAIGSGWAG SLTLGAGQFC
TKPGIAVVVD GPEADKFTSA AKAALEKVAP QTMLTKGIAS AYHEGVERMR TSNAVAPVLA
AQSAGREATP NLFETNGSAW LADHSLSEEV FGPLGLVVRV GSPEKLLTLA ESFQGQLTAT
IHMDDADLGL ARDLLPILER KAGRVLVNGF PTGVEVVDSM VHGGPYPAST NFGATSVGTM
SIRRFLRPVA YQNFPADLLP QDLRN