Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3859 |
Symbol | |
ID | 5832769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4286763 |
End bp | 4287692 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641369649 |
Product | aldo/keto reductase |
Protein accession | YP_001641302 |
Protein GI | 163853259 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.665179 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.078293 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCCG TCCCGCCGCT TTCCCGCCGA ACCCTGTTGC AGGGCGCGGC CCTCACCGCC GGAGCTGGCT TCGCGCCGGC CCTGCTGCGT CCGGCCCGCG CCGCCGACGC CCTGATCACC CGGCCGATCC CCTCCTCCGG TGAGCGGTTG CCGGCCATGG GCATCGGCAC CTCGCGCCGC TACGAGGTGG CGCCGACGCC CGAGGCGACC GCGCCCTTGC GCGAGGCGGT CGAGCGCTTC GTCGCGCTCG GCGGCCGGGT GATCGACACC GCCCCGAGCT ACGGCACCGC CGAGGACGTC CTCGGCCTGA TCCTTGAGGG TCTGCGCGAA AAGATCTTTC TCGCGACCAA GGTCGCGGCC CGCGGCCGGG AGGCGGCGCA GGCCGAGACC GAGCGCTCGT TCCAGCGCCT GCGCACGGAC AAGATCGATC TCATCGCCGT GCACAACCTG ATCGACACCG AGACCAACCT CGCGGTCCTG CGCGCGCTCA AGGAGAAGGG CCGCATCCGC TATGTCGGCG TCACGGTCTG GCGCGACGAG CAGTTCCCCG AGCTCGAGAC GGTGATGAAG CGGGAAAAGC TCGACTTCGT GCAGGTGAAC TACGCCCTCG ACAGCCGGGC GGCGGCCGAG CGCGTCCTGC CGCTGGCCGT CGAGCGCGGC GTGGCGGTGA TGGTCAACGT GCCCTTCGGC CGCGACCGCC TGTTCAAGGC GGTGAAGGAC AAGCCCCTGC CCGCGTTCGC CGCCGAATTC GGCTGCAAGA GCTGGGCGCA GTTCTTCCTC AAATACGTCC TCGCCAACGA GGCCGTGACT TGCCCGATCC CGGGCATGGC CAAGGCGAGT TACGTCGAGG ACAACCTCGC CGCCGCGACC GGGCGGCTGC CGGACGCCGC CGAGCGGCGC AAGATGGAGG CCTTCATCGA TGCGGTCTGA
|
Protein sequence | MPPVPPLSRR TLLQGAALTA GAGFAPALLR PARAADALIT RPIPSSGERL PAMGIGTSRR YEVAPTPEAT APLREAVERF VALGGRVIDT APSYGTAEDV LGLILEGLRE KIFLATKVAA RGREAAQAET ERSFQRLRTD KIDLIAVHNL IDTETNLAVL RALKEKGRIR YVGVTVWRDE QFPELETVMK REKLDFVQVN YALDSRAAAE RVLPLAVERG VAVMVNVPFG RDRLFKAVKD KPLPAFAAEF GCKSWAQFFL KYVLANEAVT CPIPGMAKAS YVEDNLAAAT GRLPDAAERR KMEAFIDAV
|
| |