Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4509 |
Symbol | |
ID | 6977603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 147796 |
End bp | 148827 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643393687 |
Product | Alcohol dehydrogenase GroES domain protein |
Protein accession | YP_002278505 |
Protein GI | 209546587 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCCA TCGTCATTCA TGCCGCCAAG GATCTCAGGA TCGAGGAGCG CGAGCCGGAG GTTGCGGGAG AGGGGCAGGT GGAAATCGCC ATCGAAGCCG GCGGCATCTG CGGCTCCGAC CTGCATTATT ACAATCATGG CGGTTTCGGC ACTGTTCGCC TGCGCGAGCC GATGATCCTC GGCCACGAGA TCGCCGGCAC AGTCAAGGCG CTCGGTGCCG GTGTGAGCGA TCTTGCCGTC GGCGACCGCG TGGCGGTGTC TCCGAGCCGG CCGTGCAATC ATTGCCAATA TTGCCTGAAG GGACAGCAAA ACCACTGCCT CAACATGCGG TTTTACGGCA GTGCCATGCC GATGCCGCAT ATTCATGGCG GCTTTCGCCA GCGGCTGGTG GCGGAGGGCT GGCAATGCCA CAAGGTGGCC GACGGCATCT CCATTCACGA GGCGGCCTTC GCCGAACCCT TTGCCGTGAC GCTGCATGCG GCCAGCCGCG CCGGATCGCT GCTGGGCAAA CGGGTGCTCG TCACCGGCTG CGGCCCGATC GGCATGCTGG CGATCGTTGC CGCGCGTGTT CTCGGCGCCC GCGAAATCGT CGCGACCGAT GTGACCGACA GCGTTCTGGC AATCGCCCGC ACCAGCGGCG CGGATCGGAC GATCAATGTT GCCACGCATG CCGCTGACCT CGCCGCTTAC GGCGCCGACA AGGGATATTT CGACGTCATG TTCGAGGCGT CGGGCAATGA GCGGGCGGTG CGCGCCGGCC TGGAGGCGCT GAAGCCCCGC GCCGTTCTCG TGCAGCTCGG CCTTGGCGGC GATGTCTCCA TTCCGCAGAA CATGATCGTC GCCAAGGAAA TCGAGATGCG CGGGACATTC CGCTTTCACG AGGAATTTGC CCTTGCCGTC GAACTGATCA ACGCGCGCCG GGTCAATCTG AAACCGCTGC TGACAGGTGT TTTTGCCATC GAAGAGGCCG TCGCCGCCTT CGAATTGGCC AGCGACCGCA GCAAGTCGAT GAAGGTGCAG ATCGCTTTCT GA
|
Protein sequence | MKAIVIHAAK DLRIEEREPE VAGEGQVEIA IEAGGICGSD LHYYNHGGFG TVRLREPMIL GHEIAGTVKA LGAGVSDLAV GDRVAVSPSR PCNHCQYCLK GQQNHCLNMR FYGSAMPMPH IHGGFRQRLV AEGWQCHKVA DGISIHEAAF AEPFAVTLHA ASRAGSLLGK RVLVTGCGPI GMLAIVAARV LGAREIVATD VTDSVLAIAR TSGADRTINV ATHAADLAAY GADKGYFDVM FEASGNERAV RAGLEALKPR AVLVQLGLGG DVSIPQNMIV AKEIEMRGTF RFHEEFALAV ELINARRVNL KPLLTGVFAI EEAVAAFELA SDRSKSMKVQ IAF
|
| |