Gene Rleg2_1377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1377 
Symbol 
ID6980105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1397133 
End bp1398653 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content64% 
IMG OID643396098 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002280897 
Protein GI209548980 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.162758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTCC TCGTCAACCC CACGGCGCTC AGCGACCACA AGGCGCGTGA TTTCAAGATG 
CTGATCGACG GCAGATGGGA GACCGGGGCC GCCGATCCGA TCGAGCGTGT CGCACCGAGC
CATGGGGTCG TGGTCAGCCG GTTCCCGACC GGCAGCAGGA ATGATGCCGA GCGCGCCATT
GCCGCCGCAC GCAAGGCTTT CGACCAGGGA CCATGGCCGC GGATGACCGC GTCCGAACGC
TCCGCTATCC TGCTCAAGGC GGCTGATCTG ATCGCAGCGC GTGCGGAGGA ACTGGCATTT
CTCGATGCCA TCGAGGCCGG AAAACCGATC ACGCAGGTGC GGGGCGAAAT TGCCGGCTCG
GTCGACATCT GGCGCTATGC GGCGGCTCTC GCACGCGATC TCCACGGTGA AAGCTACAAC
ACGCTCGGCG ACGGCACGCT CGGCGTCGTG CTCCGCGAAG CGATCGGCGT GGTCTCGATC
ATCACGCCCT GGAATTTCCC GTTCCTGATC GTCGGCCAGA AGCTGCCCTT CGCGCTTGCG
GCGGGCTGCA CGGCCGTCGT CAAACCTTCG GAGCTGACAT CGGGATCGAC GCTCGTGCTC
GGAGAAATTC TGCAGCAGGC CGGCGTTCCG GATGGCGTCG TCAATATTGT CACCGGTACG
GGACCTGAGG TCGGCGCGAT CATGACATCT CATCCCGATG TCGACATGGT CTCCTTCACC
GGCTCGACCG GTGTCGGAAA ACTGACCATG TCGAATGCAG CGCAGACGCT GAAGAAGGTC
TCGCTGGAAC TCGGCGGCAA GAACCCGCAG ATCGTTTTCC CGGATGCCGA TCTCGACGCC
TTCGTCGATG CCGCGGTCTT CGGTGCTTAT TTCAATGCCG GCGAGTGCTG CAATGCCGGC
TCGCGGCTGA TCCTGCACAA ATCGATCGCC TCAGACGTCG TCAGCCGGAT TGCCGAACTG
TCGAAGGCAG TGAAGGTCGG CGATCCCCTT GATCCCTCCA CACAGGTCGG CGCGATCATC
ACGCCGCAGC ATCTGGAGAA GATCTCAGGC TATGTCACTG GTGCCAGGAG CAGCGGCGCC
CGTGTCGCCC ATGGCGGCAA GACGCTCGAC CTCGGCATGG GGCAGTTCAT GTCGCCGACG
ATCCTCGAAG CGGTCACCCC CGATATGGCG GTGGCGCGCG AGGAAGTCTT TGGCCCGGTC
CTGTCGGTCC TGACATTCGA GACATCGGCC GAGGCGATCA GCATCGCCAA TTCCATCGAC
TATGGCCTGT CGGCCGGTGT CTGGAGCCGC GATTTCGACA CCTGCCTGAC GATCGGCCGG
TCGGTGCGGG CGGGCACCGT CTGGATGAAC ACCTTCATGG ACGGCGCCTC GGAGCTTCCC
TTCGGCGGCT ACAAGCAGAG CGGCCTCGGC CGCGAACTCG GCCGCCATGC GGTCGAGGAT
TACACCGAGA CCAAGACGCT GAACATGCAT ATCGGCAAGC GCACCGGCTG GTGGATGCCG
CAGACGGAAA AGCCGGCTTA G
 
Protein sequence
MTVLVNPTAL SDHKARDFKM LIDGRWETGA ADPIERVAPS HGVVVSRFPT GSRNDAERAI 
AAARKAFDQG PWPRMTASER SAILLKAADL IAARAEELAF LDAIEAGKPI TQVRGEIAGS
VDIWRYAAAL ARDLHGESYN TLGDGTLGVV LREAIGVVSI ITPWNFPFLI VGQKLPFALA
AGCTAVVKPS ELTSGSTLVL GEILQQAGVP DGVVNIVTGT GPEVGAIMTS HPDVDMVSFT
GSTGVGKLTM SNAAQTLKKV SLELGGKNPQ IVFPDADLDA FVDAAVFGAY FNAGECCNAG
SRLILHKSIA SDVVSRIAEL SKAVKVGDPL DPSTQVGAII TPQHLEKISG YVTGARSSGA
RVAHGGKTLD LGMGQFMSPT ILEAVTPDMA VAREEVFGPV LSVLTFETSA EAISIANSID
YGLSAGVWSR DFDTCLTIGR SVRAGTVWMN TFMDGASELP FGGYKQSGLG RELGRHAVED
YTETKTLNMH IGKRTGWWMP QTEKPA