Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4960 |
Symbol | |
ID | 6978054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 604787 |
End bp | 606196 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643394112 |
Product | Aldehyde Dehydrogenase |
Protein accession | YP_002278930 |
Protein GI | 209547012 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.613851 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCTG CCAAACTTCA TATCACGACG AGCTTGGCTC CGATCACCGT TCACAACCCC TATGACGGGG CAATCCTGGG AACGGTCGAG GCCACGGATG CGAGCGACGT CAATGCCATT CTTGGACGTG CCCGGCGCGG CGCGCAGATT TCGCGCAGCC TGCCGCGGCA TCAGCGTGCG AGCATCCTGG AAAGGGCGGC TAACATCATC GAGAGCCGCC GCGACGCCTT TGCAGAAACC ATCGTTCGCG AAGCCGGAAA GACAATTGTT CAGGCGCGCA AGGAAGTGCT GCGTTGCGTT AATACGATAA AGCTCTCCGC AGAAGAAGCA AAGCGCAATG CGGGCGAAGT CGTGCCGTTC GATGCATATA ACGGCTCTGA ACAACGGCAG GGGTGGTTCA CCCGCGAACC GCTCGGCATC ATCACGGCGA TCACGCCCTA CAACGATCCT CTGAACCTGG TTGCGCACAA GCTTGGCCCG GCCATCGCCG GCGGCAATGC GGTCCTGCTC AAGCCGTCGA ACCTGACGCC TTTCTCTGCC ATCAAGCTGG TGGAAGCACT GCGTGAGGCG GGATTGCCTG AGGAAGTCAT CACGGTCGCG CACGGTGACC GAGAACTGGT CACCGCGATG ATCGCCGCTC GCGAGGTGCG GATGGTGTCA TTTACCGGCG GCTTTGCCAC CGCCGAGGCG ATCAGCCGCG CCGCTGGGCT AAAGAAGCTC GCCATGGAGC TCGGTGGCAA TGCGCCGGTG ATCGTCATGA ACGACTGCGA CTTCGACAAA GCCGTCGAAG GTTGCGTCTC CGGTGCCTTT TGGGCCGCAG GCCAAAACTG CATCGGTGCG CAGCGCATTC TTATCCAGGG GAAGCTTTAC GATCGTTTCC GCGATGCATT CGTCTCAGCG ACACAGAGGC TCAAGGCCGG CGACCCTCTG CAGGAAGATA CCGACGTCGG CCCGATGATC TCCACCCAAG TCGCCGAACG CACCGAATCC GTCGTCAGCG ACGTCATCAA AGCAGGCGCA AAGCTGCTCT GCGGCAATAG TCGCGAAGGA TCCCTCTATC ATCCGACGGT GCTCGAAGGC ACGCCGGTGA CCTGCAAGCT ATGGCATGAG GAAGTGTTCG CACCCGTGGT CATGCTGGCA CCGTTCCACA CGCTCGATCA GGCGATCGAG ATGGCCAACG ATCCGGATTA CAGCCTCCAT GCCGGCATCT ACACCAGCGA CCTCAACGTT GCGCTTGACG CAGCCAACCG CATCGAGGCT GGCGGCGTGA TGATCAATGA CTCCTCTGAC TACCGCTTCG ACGCCATGCC CTTCGGTGGT TTCAAGTACG GCAGCATGGG CCGCGAGGGC GTCCGCTTCG CTTACGAAGA CATGACCCAG CCGAAGGTCG TTTGCATCAA TCGGGGATAA
|
Protein sequence | MTAAKLHITT SLAPITVHNP YDGAILGTVE ATDASDVNAI LGRARRGAQI SRSLPRHQRA SILERAANII ESRRDAFAET IVREAGKTIV QARKEVLRCV NTIKLSAEEA KRNAGEVVPF DAYNGSEQRQ GWFTREPLGI ITAITPYNDP LNLVAHKLGP AIAGGNAVLL KPSNLTPFSA IKLVEALREA GLPEEVITVA HGDRELVTAM IAAREVRMVS FTGGFATAEA ISRAAGLKKL AMELGGNAPV IVMNDCDFDK AVEGCVSGAF WAAGQNCIGA QRILIQGKLY DRFRDAFVSA TQRLKAGDPL QEDTDVGPMI STQVAERTES VVSDVIKAGA KLLCGNSREG SLYHPTVLEG TPVTCKLWHE EVFAPVVMLA PFHTLDQAIE MANDPDYSLH AGIYTSDLNV ALDAANRIEA GGVMINDSSD YRFDAMPFGG FKYGSMGREG VRFAYEDMTQ PKVVCINRG
|
| |