Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4628 |
Symbol | |
ID | 5833804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 5176985 |
End bp | 5177965 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641370422 |
Product | aldo/keto reductase |
Protein accession | YP_001642067 |
Protein GI | 163854024 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0836973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.048745 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACAA AAAAACTCGG CCGCACCGGC CTCGACATCT CGCCCCTCTG CCTCGGCTGC ATGACCTACG GCGTGCCGGA GCGGGGACCG CATCCGTGGA CCCTGCGGGA GGAGGAGAGC CGGCCGCTGA TCAAGCGGGC GCTCGATCTC GGGATCAACT TCTTCGACAC GGCGAACTAC TATTCGGACG GCACGTCCGA GGAGATCGTC GGCCGCGCTC TCAAGGACTA CGCCGACCGC GACAGCATCG TACTGGCCAC CAAGGTCTAC TACCCGCAGA AGAACCAGCC CAATGCCGGC GGCCTCTCGC GCAAGGCGAT CTTTTCCGCG ATCGACGCTT CACTGAAGCG GCTCGGCACC GATTACGTCG ATCTCTACCA GATCCATCGC TGGGACTACG CAACGCCGAT CGAGGTGACG CTGGAGGCGC TGCACGACGT CGTGAAGGCG GGCAAGGCCC GTTACATCGG TGCCTCGTCG ATGTTCGCGT GGCAGTTCGC CAAGGCGCTC TACACCTCTG ACCTGAACGG CTGGACGCGG TTCGCGACGA TGCAGAACCA CCTCAATCTG CTGCACCGCG AGGAGGAGCG GGAGATGATC CCGCTCTGCG CCGACCAGGG CATCCCGCTC CTGCCCTGGA GCCCGCTCGC CCGCGGCCGG CTCACCCGCG ACTTCGACGC GGGCAGCGCC CGGCAAGAGA GCGATCTCGT GGGCAAGAAC CTCTATGACG CCACGGTCGA GGCCGATCGG CAGGTGGTCG AGGCGGTGGC CGACGTCGCG CGCGACCGCG GCGTGCCCCG GGCGCAGGTG GCGCTCGCCT GGGTGATCCA GAAGCGGGGC GTGGCCGCCC CGATCATCGG CGCCTCGAAG CCGGGGCACC TCGACGACGC GGCGGCGGCC CTCGACCTCG AATTGATGCC GGACGAGATC GCCCGGCTCG AAGCCCCTTA CGTTCCGCAC GCCGTGGTCG GGTTCCAATG A
|
Protein sequence | MQTKKLGRTG LDISPLCLGC MTYGVPERGP HPWTLREEES RPLIKRALDL GINFFDTANY YSDGTSEEIV GRALKDYADR DSIVLATKVY YPQKNQPNAG GLSRKAIFSA IDASLKRLGT DYVDLYQIHR WDYATPIEVT LEALHDVVKA GKARYIGASS MFAWQFAKAL YTSDLNGWTR FATMQNHLNL LHREEEREMI PLCADQGIPL LPWSPLARGR LTRDFDAGSA RQESDLVGKN LYDATVEADR QVVEAVADVA RDRGVPRAQV ALAWVIQKRG VAAPIIGASK PGHLDDAAAA LDLELMPDEI ARLEAPYVPH AVVGFQ
|
| |