Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3801 |
Symbol | |
ID | 8014626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3859630 |
End bp | 3861138 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644826364 |
Product | Aldehyde dehydrogenase (NAD(+)) |
Protein accession | YP_002977583 |
Protein GI | 241206487 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.804674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCATC AGAAAATCGT CGAGTCGCCG TTCAAGCTGA AATACGGCAA CTATATCGGC GGCGAATGGC GCGAGCCGGT CGAGGGAAAA TACTTCGAAA ACCTCACGCC CGTCACCGGC GGCAAGCTCT GCGACATTCC CCGCTCCAAT GAAAAGGACA TTAACCTCGC ACTCGACGCC GCTCATGCGG CAAAGGAAAA ATGGGGTCGC ACCTCGGTTG CCGAGCGCTC CAATATCCTC ATGAAGATCG CCCAGCGCAT GGAAGACAAG CTTGAATTGC TCGCCCAGGC TGAGACCTGG GACAATGGCA AGCCGATCCG TGAAACCATG GCGGCCGACA TTCCGCTGGC GATCGACCAT TTCCGTTATT TCGCCTCCTG CATTCGCGCC CAGGAAGGTT CGATCGGCGA GATTGACCAC GACACTGTCG CCTATCACTT CCATGAGCCG CTCGGCGTCG TCGGCCAGAT CATTCCGTGG AACTTCCCGA TCCTGATGGC CACCTGGAAG CTGGCGCCCG CACTTGCCGC CGGCAATTGC GTCGTACTGA AACCCGCCGA GCAGACCCCG GCCTCGATCC TGGTCTGGGC TGAACTCGTC GGCGATCTCC TGCCGGCAGG GGTCCTCAAC ATCGTCAACG GTTTCGGCCT CGAAGCCGGC AAGCCGCTGG CGACCAGCCC GCGCGTCGCT AAGATCGCCT TCACCGGCGA GACGACGACA GGCCGGCTTA TCATGCAATA TGCCAGCCAG AACCTCATTC CGGTGACGCT GGAGCTCGGC GGCAAATCGC CGAACATCTT CTTCGCCGAC GTGATGGCGG AAGACGACGA CTTCCTCGAC AAGGCGTTCG AAGGCTTTGC GATGTTTGCC TTGAACCAGG GCGAAGTCTG CACCTGCCCG AGCCGCGCCC TCGTCCAGGA ATCGATCTAC GACCGTTTCA TGGAAAAGGC CGTCAAACGC GTTGAGGCGA TCAAGCAGGG CAACCCGCTC GATAGCGCAA CGATGATCGG CGCCCAGGCC TCGACCGAGC AGCTGGAAAA GATCCTCGCC TATCTCGACA TCGGCAAGCA GGAAGGCGCG GAAGTGCTGA CCGGCGGCTC GCGCAACGAT CTCGGCGGCG AGCTGGCGAA CGGCTACTAT GTCAAGCCGA CGATCTTCAA GGGTCACAAC AAGATGCGCG TGTTCCAGGA GGAAATCTTC GGGCCGGTGG TTTCGGTGAC GACCTTCAAG AACGAGAAGG AAGCGCTCGA AATCGCTAAC GACACGCTCT ACGGCCTCGG CGCCGGCGTC TGGAGCCGCG ATGCCAATCG CTGCTACCGT TTCGGCCGCG AGATCCAGGC CGGCCGCGTC TGGACCAACT GCTACCACGC CTACCCGGCC CATGCCGCCT TCGGCGGCTA CAAGCAGTCG GGCATCGGCC GTGAAACCCA TAAGATGATG CTCGACCACT ACCAGCAGAC CAAGAACATG CTGGTGAGCT ACAGCCCGAA GGCACTCGGC TTCTTCTGA
|
Protein sequence | MLHQKIVESP FKLKYGNYIG GEWREPVEGK YFENLTPVTG GKLCDIPRSN EKDINLALDA AHAAKEKWGR TSVAERSNIL MKIAQRMEDK LELLAQAETW DNGKPIRETM AADIPLAIDH FRYFASCIRA QEGSIGEIDH DTVAYHFHEP LGVVGQIIPW NFPILMATWK LAPALAAGNC VVLKPAEQTP ASILVWAELV GDLLPAGVLN IVNGFGLEAG KPLATSPRVA KIAFTGETTT GRLIMQYASQ NLIPVTLELG GKSPNIFFAD VMAEDDDFLD KAFEGFAMFA LNQGEVCTCP SRALVQESIY DRFMEKAVKR VEAIKQGNPL DSATMIGAQA STEQLEKILA YLDIGKQEGA EVLTGGSRND LGGELANGYY VKPTIFKGHN KMRVFQEEIF GPVVSVTTFK NEKEALEIAN DTLYGLGAGV WSRDANRCYR FGREIQAGRV WTNCYHAYPA HAAFGGYKQS GIGRETHKMM LDHYQQTKNM LVSYSPKALG FF
|
| |