Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1377 |
Symbol | |
ID | 6980105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1397133 |
End bp | 1398653 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643396098 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002280897 |
Protein GI | 209548980 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.162758 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGTCC TCGTCAACCC CACGGCGCTC AGCGACCACA AGGCGCGTGA TTTCAAGATG CTGATCGACG GCAGATGGGA GACCGGGGCC GCCGATCCGA TCGAGCGTGT CGCACCGAGC CATGGGGTCG TGGTCAGCCG GTTCCCGACC GGCAGCAGGA ATGATGCCGA GCGCGCCATT GCCGCCGCAC GCAAGGCTTT CGACCAGGGA CCATGGCCGC GGATGACCGC GTCCGAACGC TCCGCTATCC TGCTCAAGGC GGCTGATCTG ATCGCAGCGC GTGCGGAGGA ACTGGCATTT CTCGATGCCA TCGAGGCCGG AAAACCGATC ACGCAGGTGC GGGGCGAAAT TGCCGGCTCG GTCGACATCT GGCGCTATGC GGCGGCTCTC GCACGCGATC TCCACGGTGA AAGCTACAAC ACGCTCGGCG ACGGCACGCT CGGCGTCGTG CTCCGCGAAG CGATCGGCGT GGTCTCGATC ATCACGCCCT GGAATTTCCC GTTCCTGATC GTCGGCCAGA AGCTGCCCTT CGCGCTTGCG GCGGGCTGCA CGGCCGTCGT CAAACCTTCG GAGCTGACAT CGGGATCGAC GCTCGTGCTC GGAGAAATTC TGCAGCAGGC CGGCGTTCCG GATGGCGTCG TCAATATTGT CACCGGTACG GGACCTGAGG TCGGCGCGAT CATGACATCT CATCCCGATG TCGACATGGT CTCCTTCACC GGCTCGACCG GTGTCGGAAA ACTGACCATG TCGAATGCAG CGCAGACGCT GAAGAAGGTC TCGCTGGAAC TCGGCGGCAA GAACCCGCAG ATCGTTTTCC CGGATGCCGA TCTCGACGCC TTCGTCGATG CCGCGGTCTT CGGTGCTTAT TTCAATGCCG GCGAGTGCTG CAATGCCGGC TCGCGGCTGA TCCTGCACAA ATCGATCGCC TCAGACGTCG TCAGCCGGAT TGCCGAACTG TCGAAGGCAG TGAAGGTCGG CGATCCCCTT GATCCCTCCA CACAGGTCGG CGCGATCATC ACGCCGCAGC ATCTGGAGAA GATCTCAGGC TATGTCACTG GTGCCAGGAG CAGCGGCGCC CGTGTCGCCC ATGGCGGCAA GACGCTCGAC CTCGGCATGG GGCAGTTCAT GTCGCCGACG ATCCTCGAAG CGGTCACCCC CGATATGGCG GTGGCGCGCG AGGAAGTCTT TGGCCCGGTC CTGTCGGTCC TGACATTCGA GACATCGGCC GAGGCGATCA GCATCGCCAA TTCCATCGAC TATGGCCTGT CGGCCGGTGT CTGGAGCCGC GATTTCGACA CCTGCCTGAC GATCGGCCGG TCGGTGCGGG CGGGCACCGT CTGGATGAAC ACCTTCATGG ACGGCGCCTC GGAGCTTCCC TTCGGCGGCT ACAAGCAGAG CGGCCTCGGC CGCGAACTCG GCCGCCATGC GGTCGAGGAT TACACCGAGA CCAAGACGCT GAACATGCAT ATCGGCAAGC GCACCGGCTG GTGGATGCCG CAGACGGAAA AGCCGGCTTA G
|
Protein sequence | MTVLVNPTAL SDHKARDFKM LIDGRWETGA ADPIERVAPS HGVVVSRFPT GSRNDAERAI AAARKAFDQG PWPRMTASER SAILLKAADL IAARAEELAF LDAIEAGKPI TQVRGEIAGS VDIWRYAAAL ARDLHGESYN TLGDGTLGVV LREAIGVVSI ITPWNFPFLI VGQKLPFALA AGCTAVVKPS ELTSGSTLVL GEILQQAGVP DGVVNIVTGT GPEVGAIMTS HPDVDMVSFT GSTGVGKLTM SNAAQTLKKV SLELGGKNPQ IVFPDADLDA FVDAAVFGAY FNAGECCNAG SRLILHKSIA SDVVSRIAEL SKAVKVGDPL DPSTQVGAII TPQHLEKISG YVTGARSSGA RVAHGGKTLD LGMGQFMSPT ILEAVTPDMA VAREEVFGPV LSVLTFETSA EAISIANSID YGLSAGVWSR DFDTCLTIGR SVRAGTVWMN TFMDGASELP FGGYKQSGLG RELGRHAVED YTETKTLNMH IGKRTGWWMP QTEKPA
|
| |