Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4795 |
Symbol | |
ID | 8007479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 164510 |
End bp | 165955 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644821725 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002972985 |
Protein GI | 241113150 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0338248 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATTT CCCTCCTCAT AAACGGCGCC GACCGCGCGG CCTCCGGCGG CCGGACCTTT GATCGCATCG ATCCCTTCAC CGAGAAGCTC GCAAGCCGCG CCGCCGCGGC AAGCCTGGAT GATGCGGCCG CTGCCGTCGA AGCCGCCGCC GCTGCCTTCG GCGCCTGGTC GAAGACCGGC CCCGGCCAGC GCCGCGCGAT CCTGATGAAG GCTGCCGATA TCATGGATTC CAAAGTCGGC GAGTTTACCC GGCTGATGAT CGAGGAGACC GGCTCAACCG CACCCTGGGC CGGCTTCAAT GTCATGCTCG CCGCCAACAT CCTGCGCGAG GCCGGCGCCA TGACGACGCA GATATCAGGC GAAATCATCC CTTCCGATAA GCCGGGCACG CTCGCCATGG GTGTTCGCCA GGCCGCCGGC GTCTGCCTGG CGATCGCTCC CTGGAACGCG CCGGTCATCC TCGCCACCCG CGCCATCGCC ATGCCGATCG CCTGCGGCAA TACCGTCGTG CTCAAGGCCT CGGAACAATG CCCCGGCACG CATAGGCTGA TCGCCACCGC GCTGACCGAA GCCGGCCTGC CGGCCGGCGT CATCAACGTC CTCACCAACG CGCCGGAAGA TGCGCCCGAG ATCGTCGCCG CGCTGATTGC CCATCCGGCC GTCAAGCGCG TGAACTTCAC CGGGTCGACC AAGGTCGGCA AGATCATCGC CGAGACCTGC GGCAGGTATC TCAAACCCGC CCTTCTTGAG CTCGGCGGCA AGGCACCGCT GGTCATCCTC GACGACGCCG ATATCGACGG TGCCGTCAAT GCCGCCATTT TCGGCGCCTT CATGCATCAG GGCCAGATCT GCATGTCGAC CGAGCGGATC ATCGTCGACG AGACGATCGC CGATCAGTTC GTCGCCAAAC TGGCCGCCCG CGCCAGCCAG CTGCCGGCCG GCGACCCACG TGGCCACGTC GTTCTCGGCT CGCTGATCAG CCTCGATGCG GCGAAGAAAA TGGAGGAGCT GATCGCCGAT GCGACGGCCA AGGGGGCAAA ACTCGTGGCC GGCGGCAAAC GCTCGGGCAC CGTGGTCGAG GCGACGCTGC TTGATCATGT CACGCCCGAG ATGCGCGTCT ATGCAGAAGA ATCCTTCGGC CCGGTCAAGC CGATCATTCG CGTTACCAGT GAGGAAGAAG CCATCCGCAT CGCCAACGAC ACCGAATACG GCCTGTCATC CGCCGTCTTC AGCCGCAATG TCCAGCGGGC AATGGCGGTT GCGGCGCGGA TCGAATCCGG CATCTGCCAC ATCAATGGCC CGACGGTAAA CGACGAGGCG CAAATGCCCT TCGGCGGCGT CAAGGGCAGC GGCTACGGCC GCTTCGGCGG CAAGGCGGCA ATCGCTGAAT TCACCGACCT GCGCTGGATC ACGGTCGAGG ATTCCGCCCA GCACTATCCT TTCTGA
|
Protein sequence | MNISLLINGA DRAASGGRTF DRIDPFTEKL ASRAAAASLD DAAAAVEAAA AAFGAWSKTG PGQRRAILMK AADIMDSKVG EFTRLMIEET GSTAPWAGFN VMLAANILRE AGAMTTQISG EIIPSDKPGT LAMGVRQAAG VCLAIAPWNA PVILATRAIA MPIACGNTVV LKASEQCPGT HRLIATALTE AGLPAGVINV LTNAPEDAPE IVAALIAHPA VKRVNFTGST KVGKIIAETC GRYLKPALLE LGGKAPLVIL DDADIDGAVN AAIFGAFMHQ GQICMSTERI IVDETIADQF VAKLAARASQ LPAGDPRGHV VLGSLISLDA AKKMEELIAD ATAKGAKLVA GGKRSGTVVE ATLLDHVTPE MRVYAEESFG PVKPIIRVTS EEEAIRIAND TEYGLSSAVF SRNVQRAMAV AARIESGICH INGPTVNDEA QMPFGGVKGS GYGRFGGKAA IAEFTDLRWI TVEDSAQHYP F
|
| |