Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5659 |
Symbol | |
ID | 5319961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 624730 |
End bp | 625791 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640777393 |
Product | aldo/keto reductase |
Protein accession | YP_001314325 |
Protein GI | 150377730 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.184034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.516333 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGAGT ATCGCTATCT TGGGCGCAGC GCCCTTAAAG TGTCGCCGCT GACACTGGGG TCCATGATGT TCGGCAACCA AACGCCGGAC GACGTCGCCT TCCGCATCAT CGACAAGGCG CGCGAACAGG GCATCAACTT CATCGACACG GCGGATGTCT ACCACGACGG CAAGTCGGAA GAGGTGGTCG GGCGCGGTAT CAAGTCCAAC CGCGACCACT GGGTGGTGGC GACGAAGTTC GTCAATTCGC GTGCGAAGGG ACCGAACCTA GGCGGCTATT CGCGCAAGTG GGTGTATCAG ACGATCGAGA ACTCGCTGCG CAATCTCGGC ACCGACTATA TCGACATCCT CTATTTCCAT CGCGCCGTCT TCGATGCGCC GCTCGAAGAG CCGGTACGGG CGATCGCCGA CCTGATCAAG GCCGGCAAAC TGCGCTACTT CGGGGTCTCG AATTTCCGGG GATGGCGCAT TGCGGAAGTC GCCCATCTCG CCGACCGGCT CGGCATTGAC CGGCCGATCG CCAGCCAGCC GCTCTATAAC ATCGTCAACC GCACCGCCGA AGCGGAGCAG TTGCCGGCAG CTGCCGCCTA TGGACTTGGT GCTGTGTCCT ACAGCCCGCT CGCCCGCGGC GTTCTGACGG GCAAGTATAA TCCCGGCGAA GCCCCAGCCG CCGATACCCG CGCCGGCCGT GGCGACAAGC GCATGCATGA CGTCGAGTTT CGCGAAGAAT CGATCGCGAT CGCTAAGCAG ATTGCCGCGC ACGCTCAAGC CAAGGGCATT GCTTCCGCCG ATTTCGCGCT CGCCTGGGTG CTGAACAACC GGCTGATCAC CTCGACAATC GCAGGACCAC GCACGGAAGA GCATTGGGAC GCCTATATCC GCGCCCTCGA TGTCAAGCTT GATGCGGAAG ACGAGGCTTT GGTCGATCGG CTGGTCGCGC CCGGCCATCC GTCGACTCCG GGCTTCACCG ATCCAGGCCA CCCGCTCGAG GGCCGCGAAC CGCATTTCGG AGCGGCGGAA GCCGAGATCA TTCCGCTTGC CCGTTCGCAG CGCGTCGCCT GA
|
Protein sequence | MVEYRYLGRS ALKVSPLTLG SMMFGNQTPD DVAFRIIDKA REQGINFIDT ADVYHDGKSE EVVGRGIKSN RDHWVVATKF VNSRAKGPNL GGYSRKWVYQ TIENSLRNLG TDYIDILYFH RAVFDAPLEE PVRAIADLIK AGKLRYFGVS NFRGWRIAEV AHLADRLGID RPIASQPLYN IVNRTAEAEQ LPAAAAYGLG AVSYSPLARG VLTGKYNPGE APAADTRAGR GDKRMHDVEF REESIAIAKQ IAAHAQAKGI ASADFALAWV LNNRLITSTI AGPRTEEHWD AYIRALDVKL DAEDEALVDR LVAPGHPSTP GFTDPGHPLE GREPHFGAAE AEIIPLARSQ RVA
|
| |