Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5505 |
Symbol | |
ID | 6978599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1154916 |
End bp | 1156430 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643394604 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002279422 |
Protein GI | 209547504 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000574808 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAGACGA CAGTCAGTTC CATCGAAGGT TCGCGACAGG GCCTGTTCAT CAACGGTGAG TTCGTGGCGC CCAAGGCCGG CAACTACATT GCCAGCTACG ACCCGACGAC CGGCGAGCGC TGGTACGATC TGGCCGAAGC CGACGCCGAT GACGTCTCTG CGGCCGTTGC CGCTGCGAAT GCCGCCTTCC GCAATCCCGC CTGGCGGCGG ATGACGCAGA CCGACCGCGG TGCCCTGGTG CGCACGCTCG CCGAACTTGT CCGCACCAAT GCCGACACGC TTGCCGAAAT CGAAACCCGC GACAACGGCA AGCTTCTCAA GGAAACCCGG GCGCAGATGC GCTCGATGCC GGACAGCTAT CATTATTTCG CCGGGATGGC CGACAAGCTG CAGGGCGATA CGATCCCGAT CAACAGGGCC GATACGCTGA ACATCAACCT GCGCGAAGCG CTCGGCGTCG TCGGCATGAT TACGCCCTGG AATTCGCCGC TGATGCTTTT GACCGGCACG CTGGCGCCGT GCCTGGCGAT CGGCAACACC GTCGTCATCA AGCCATCGGA ACATGCGACC GCCTCGACGC TGGCGCTTGC CGAACTGATC CATGAGGCGG GCTTCCCCGC CGGGGTCGTC AACGTCGTCA CCGGCACCGG CAAGAGCGCC GGTGAGGCGC TGACCCGCCA TCCCGGTGTT TCAAAATATG TCTTTACCGG CAGCACCGCC ACCGGCCGCC GCATCGCCGG AAACGCGGCG CAGAACCTCG TGCCCTGCTC GATGGAGCTC GGCGGAAAGT CGCCGCATGT GATCTTCGGC GATGTCGAGC TCGAGCATGC CGTCAATGGC GTCGTTTCCG GCGTGTTTGC CGCTGCCGGC CAGACCTGCG TGGCCGGTTC ACGCTGCTTC GTCGAGGCCA GCATCTACGA CAAGTTCATC GACGCGCTGA TTGCCCGCAC CGGCCGCATC CGCGTCGGTC TGCCGACGGC AGAGGATACC GATATCGGCC CGCTGGCGCT TTCCGATCAG TTGACGAAGG TCGAGGGCTA TGTGGCGTCC GGCGTCAAGG AAGGCGCGAA GATCGCCGCC GGCGGGCGCC GTCCGCAGAA GGAAGGTCTG TCGCGTGCCG GCTGGTACTT CGAGCCGACG GTGATGGTCG ATGTGCACAA CGACATGGGC TTCATGCGCG ACGAGATTTT CGGCCCGGTC GTCGGCGTCA TGCCGTTCCG CGACGAGGCC GAGATGATCG CGCTCGCTAA CGACAGCCAT TACGGCCTCG CTTCCGGGAT TTGGACCAAG GATATCGACC GCGCCCTGCG CTTCGCCAAC CAGATCGAAG CCGGCACCGT CTGGGTCAAT ACCTATCGCT CGGCCTCCTT CATGTCGGCC AATGGCGGCT TCAAGGAGAG CGGCTACGGC CGGCGCGGCG GCTTCGAGGT GATGCACGAA TTCTCCCGGC TGAAAAACGT CATCATCGAT TATTCCGGGG CGATGCAGGA CCCCTTCGTC ATCCGTCTGA AGTGA
|
Protein sequence | MKTTVSSIEG SRQGLFINGE FVAPKAGNYI ASYDPTTGER WYDLAEADAD DVSAAVAAAN AAFRNPAWRR MTQTDRGALV RTLAELVRTN ADTLAEIETR DNGKLLKETR AQMRSMPDSY HYFAGMADKL QGDTIPINRA DTLNINLREA LGVVGMITPW NSPLMLLTGT LAPCLAIGNT VVIKPSEHAT ASTLALAELI HEAGFPAGVV NVVTGTGKSA GEALTRHPGV SKYVFTGSTA TGRRIAGNAA QNLVPCSMEL GGKSPHVIFG DVELEHAVNG VVSGVFAAAG QTCVAGSRCF VEASIYDKFI DALIARTGRI RVGLPTAEDT DIGPLALSDQ LTKVEGYVAS GVKEGAKIAA GGRRPQKEGL SRAGWYFEPT VMVDVHNDMG FMRDEIFGPV VGVMPFRDEA EMIALANDSH YGLASGIWTK DIDRALRFAN QIEAGTVWVN TYRSASFMSA NGGFKESGYG RRGGFEVMHE FSRLKNVIID YSGAMQDPFV IRLK
|
| |