Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3784 |
Symbol | |
ID | 6982547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3910636 |
End bp | 3912174 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643398506 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002283272 |
Protein GI | 209551355 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.888405 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTATCG CCGTCCTCGA TCTCGCCACC GAAACCGCCA AGCTGCTTGC CGAACTCGGC GTTGACGCCG GCCGCTATCA CGGCGGCACG CTTTCCGTTG CCTCGCCGGT GACGGGCAAG GAAATCGGCA AACTCAGGGA GAACACCGTC TCCGAGACCA AAGCGGCGAT CGAAGCGGCG CACAAGGCCT TCCTCGAATG GCGTGACGTC CCCGCGCCGA AGCGCGGCGA ACTGGTCCGC CTGCTCGGCG AGGAGCTGCG CGCCGCGAAG ACGGCGCTCG GCCGTCTCGT GTCGATCGAG GTCGGCAAGA TCACCTCGGA AGGTCTCGGT GAAGTGCAGG AGATGATCGA TATCTGCGAT TTCGCCGTCG GCCTTTCCCG CCAGCTCTAC GGCCTGACGA TCGCCACCGA GCGCTCCGAA CACCGGATGA TGGAAAGCTG GCATCCGCTC GGCGTGGTCG GCATCATCTC CGCCTTCAAC TTCCCGGTTG CCGTCTGGTC GTGGAATGCG GCACTTGCCA TGGTCTGCGG CAATTCCACC GTCTGGAAGC CTTCGGAAAA GACACCTTTG ACCGCACTTG CCGTGCAGGC GCTGTTCGAA AAGGCGCTGA AGCGTTTCAT CGCCGAGGGC GGCGCGGCGC CGGCCAATCT GTCGACATTG ATCATCGGCG GCCGCGAGGT CGGCGAAGTG CTCGTCGACC ATCCGAAGAT CCCGCTCGTC TCCGCCACCG GCTCGACCGC CATGGGCCGT GCCGTCGGTC CGCGCCTGTC GCAGCGTTTT GCCCGCGGCA TTCTCGAACT CGGCGGCAAC AATGCGGCGA TCGTCTGCCC GAGCGCCGAT CTCGACCTGA CGCTGCGCGG CGTTGCCTTC TCCGCCATGG GCACGGCCGG CCAGCGCTGC ACGACGCTGC GCCGTCTGTT CGTGCATGAA AGTGTCTATG ACCAGCTGGT GCCGCGCCTG CAGAAGGCCT ATGGCTCCGT CACCATCGGC AATCCGCTCG AAACAGGCAC GCTTGTCGGA CCGCTGATCG ACGGCCAGGC TTTCGAGAAG ATGCAGGCAG CACTCAGCCA GGCGGCATCG GCCGGCGGCA AGGTGACGGG CGGCGATCGC GTCGGCAACG GTTTGACCGA TGCCTTCTAT GTTCGCCCGG CGCTTGTCGA AATGCCGGCG CAGACCGGCC CGGTCGAGCA CGAGACCTTC GCGCCGATCC TCTATGTGAT GAAATACAGC GATTTCGATG CGGTACTCGC TCTGCACAAT GCCGTGCCGC AGGGGCTGTC GTCGTCGATC TTCACCAACG ACATGCGCGA GGCCGAAACC TTCGTTTCGG CGCGCGGATC GGATTGCGGC ATTGCCAACG TCAATCTCGG CCCATCGGGT GCGGAAATCG GCGGTGCCTT CGGTGGCGAG AAGGAAACCG GCGGCGGCCG CGAATCCGGC TCGGATGCCT GGAAAGCCTA TATGCGCCGC TCCACCAACA CGATCAATTA CGGCAGAACG CTGCCGCTCG CCCAGGGCGT CAAGTTCGAC GTCGAATAA
|
Protein sequence | MTIAVLDLAT ETAKLLAELG VDAGRYHGGT LSVASPVTGK EIGKLRENTV SETKAAIEAA HKAFLEWRDV PAPKRGELVR LLGEELRAAK TALGRLVSIE VGKITSEGLG EVQEMIDICD FAVGLSRQLY GLTIATERSE HRMMESWHPL GVVGIISAFN FPVAVWSWNA ALAMVCGNST VWKPSEKTPL TALAVQALFE KALKRFIAEG GAAPANLSTL IIGGREVGEV LVDHPKIPLV SATGSTAMGR AVGPRLSQRF ARGILELGGN NAAIVCPSAD LDLTLRGVAF SAMGTAGQRC TTLRRLFVHE SVYDQLVPRL QKAYGSVTIG NPLETGTLVG PLIDGQAFEK MQAALSQAAS AGGKVTGGDR VGNGLTDAFY VRPALVEMPA QTGPVEHETF APILYVMKYS DFDAVLALHN AVPQGLSSSI FTNDMREAET FVSARGSDCG IANVNLGPSG AEIGGAFGGE KETGGGRESG SDAWKAYMRR STNTINYGRT LPLAQGVKFD VE
|
| |