Gene Rleg_1481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1481 
Symbol 
ID8012567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1466387 
End bp1467907 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content62% 
IMG OID644824070 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002975312 
Protein GI241204216 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.450354 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTTC TCGTCAACCC CACCGCGCTC AACGACCACA AAGCGCGTGA TTTCAAGATG 
CTCATCGACG GCAGATGGGA GGCTGGTGCC TCCGATCCGA TCGAGCGTGT CGCGCCGAGC
CATGGTGTCG TGGTCAGCCG GTTTCCGACC GGCAGCAGGA AGGACGCCGA GCGTGCGATT
TCCGCGGCAC GCAAGGCTTT CGATCTTGGG CCGTGGCCCC GAATGACCGC TTCCGAACGT
TCCGCCATCC TGCTCAAGGC GGCCGATCTG ATCGCAGCGC GCGCAGAGGA GCTGGCATTT
CTCGATGCTA TCGAGGCAGG AAAGCCGATC ACGCAGGTGC GGGGAGAAAT TGCAGGCTCC
GTTGACATAT GGCGTTACGC GGCGGCACTT GCCCGCGACC TTCACGGTGA AAGCTACAAC
ACGCTCGGCG ACGGCACGCT CGGCGTCGTC TTGCGCGAAG CGATCGGCGT GGTGTCGATC
ATCACGCCTT GGAACTTTCC GTTCCTGATC GTCGGCCAGA AGCTGCCATT CGCCTTGGCC
GCTGGCTGCA CGACCGTCGT CAAGCCCTCG GAACTGACCT CGGGATCGAC GCTGGTGCTG
GGAGAGATCC TGCAGCAGGC CGGCATTCCG GATGGTGTCG TCAACATTGT CACCGGCACG
GGACCAGAGG TCGGTGCGAT CATGACCTCC CATCCCGACG TCGACATGGT GTCCTTCACC
GGCTCGACCG GCGTGGGAAA GCTGACCATG TCGAATGCCG CACAGACGCT GAAGAAGGTC
TCGCTGGAAT TGGGTGGGAA GAACCCGCAG ATCGTGTTTC CGGATGCCGA TCTCGGTGCC
TTCATCGATG CCGCGGTCTT CGGCGCATAC TTCAATGCCG GCGAGTGCTG CAATGCCGGT
TCGCGGCTGA TCCTTCACAA ATCAATCGCT TCCGATGTCG TCAAGCGGAT TGCCGAATTG
TCGAAGGCAG TGAAGGTCGG TGACCCGCTC GATCCCTCGA CGCAGGTCGG CGCCATCATC
ACGCCACAGC ATCTGGAGAA AATATCAGGA TACGTTGCCG GCGCGAGGAG CAGCGGTGCC
CGGGTCGCGC ATGGCGGCGA GACGCTCGAC CTCGGCATGG GGCAGTTCAT GTCGCCGACG
ATCCTCGAAG AGGTCACCCC TGATATGGCC GTGGCGCGCG AGGAAGTCTT TGGCCCGGTG
CTTTCGGTCC TGACATTCGA GACATCAGCA GAAGCCATCA GGATTGCGAA TTCCATCGAC
TACGGCCTGT CGGCCGGTGT CTGGAGCCGC GATTTCGACA CGTGCCTGAC GATCGGCCGA
TCGGTGCGGG CGGGCACGGT CTGGATGAAC ACCTTCATGG ACGGCGCCTC GGAACTTCCC
TTTGGCGGTT ACAAGCAAAG CGGCCTCGGC CGTGAGCTCG GCCGCCATGC GGTTGAAGAT
TACACGGAGA CCAAGACGCT CAACATGCAT ATCGGCAAAC GCACCAGCTG GTGGATGCCG
CAGACAGAAA AGCTGGCTTA G
 
Protein sequence
MTVLVNPTAL NDHKARDFKM LIDGRWEAGA SDPIERVAPS HGVVVSRFPT GSRKDAERAI 
SAARKAFDLG PWPRMTASER SAILLKAADL IAARAEELAF LDAIEAGKPI TQVRGEIAGS
VDIWRYAAAL ARDLHGESYN TLGDGTLGVV LREAIGVVSI ITPWNFPFLI VGQKLPFALA
AGCTTVVKPS ELTSGSTLVL GEILQQAGIP DGVVNIVTGT GPEVGAIMTS HPDVDMVSFT
GSTGVGKLTM SNAAQTLKKV SLELGGKNPQ IVFPDADLGA FIDAAVFGAY FNAGECCNAG
SRLILHKSIA SDVVKRIAEL SKAVKVGDPL DPSTQVGAII TPQHLEKISG YVAGARSSGA
RVAHGGETLD LGMGQFMSPT ILEEVTPDMA VAREEVFGPV LSVLTFETSA EAIRIANSID
YGLSAGVWSR DFDTCLTIGR SVRAGTVWMN TFMDGASELP FGGYKQSGLG RELGRHAVED
YTETKTLNMH IGKRTSWWMP QTEKLA