Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4203 |
Symbol | |
ID | 5831984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4674453 |
End bp | 4675325 |
Gene Length | 873 bp |
Protein Length | 290 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641369993 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001641643 |
Protein GI | 163853600 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.424888 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATC CGCTGACCAA ATACCCGCGC CCGCCCTTCG AGACGCCGCC CCAGAGTTTC CCCGGCAAGA CGGCCAAGAT GGCGCCCGAG CCGGATCACG GCGAGGAGAG CTACAAGGGC TCCGGCAAGC TCACCGGAAA AGCCGCTTTG GTGACGGGCG GCGACAGCGG CATCGGCCGG GCGGTGGCGA TCGCCTATGC CCGCGAGGGC GCCGACGTGG CGATCTCCTA CCTGCCCGAC GAGCAGAAGG ATGCGGAGGC CGTCGGCAAC TGGATCGAAA AGGCGGGCCG GCGCGCCCTG CTGCTTCCCG GCGACATCAA GGACGCGGCC TATGCCCGCG AGATTGTCGA GCGCACGGCC AAGGAATTCG GCCGGCTCGA TATCCTAGTG AACAACGCCG CCTTCCAGCA GCCGAACCAG GGCGTCACGG ACATCGACGA CGCGTTGTTC GAAAAACACT TCCAGACCAA CATCTTCGGC CCGTTCTACG CCACCAAGGC CGCGCTGGCG CATCTCAAGC CCGGCGCCTC GGTGATCTTC ACCTCCTCGG TCAATTCCAA GCACCCGGTG CCGACCCTGT TCGCCTACAG CGCCACCAAG GGCGCGCTCA GCAACATGGT GCTCGGTCTC GCGCAGTTGC TCGCCGAAAA GGGCATCCGC GTGAACGGCG TGCTACCGGG CCCGATCTGG ACCCCGTTCA TCCCCGCCGG GATGCGCGAG GATGCGGTGA AGACCTTCGG CAGCCAAGTG CCGTTCAACC GCCCCGGCCA GCCGGCGGAA CTCGCCTCGG CCTACGTGAT GCTGGCGGCC GAAGAGAGCA GCTACACCTC CGGCGCCCTC ATCACCGTCG CGGGCGGTAT GCCGATCTTC TGA
|
Protein sequence | MTDPLTKYPR PPFETPPQSF PGKTAKMAPE PDHGEESYKG SGKLTGKAAL VTGGDSGIGR AVAIAYAREG ADVAISYLPD EQKDAEAVGN WIEKAGRRAL LLPGDIKDAA YAREIVERTA KEFGRLDILV NNAAFQQPNQ GVTDIDDALF EKHFQTNIFG PFYATKAALA HLKPGASVIF TSSVNSKHPV PTLFAYSATK GALSNMVLGL AQLLAEKGIR VNGVLPGPIW TPFIPAGMRE DAVKTFGSQV PFNRPGQPAE LASAYVMLAA EESSYTSGAL ITVAGGMPIF
|
| |