Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3802 |
Symbol | |
ID | 6134749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 4233939 |
End bp | 4235099 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641643970 |
Product | aldo/keto reductase |
Protein accession | YP_001770614 |
Protein GI | 170741959 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0192807 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACGTC GCAGTTTCCT GCAGGGCGCG GTGCTCGGGG CGGGCCTCGC GGCCGGCGGC TCGGCCGGCG CCGCGTCCCC GGCGGGGCCC GGCCCGGCGC TCCCGTCCGC GGGGCCGCTG CCGCCCGCGG CGGCCGCGCT CGCCGGGGCG AACCGGCCCG AGGATCCGGC GGCGCTGCCC TTCGTGACCG ATCCGGGGGA GAGGCGGGGC GAGATGCTCT ACCGGCCCCT CGGCCGCACC GGGGTCACCG TCTCGGCGAT CGGCATGGGC GGGTTCCACC TCGGCAAGAA GGCGCTCAGC GATGCCGAGG CGACGCGGCT GATCCATCAG GGCGTCGACC GCGGCATCAC CTTCATGGAC AATTGCTGGG ACTACAACGA GGGCAGGTCC GAGGAGCGGA TGGGCGCGGC GCTCGCCGAG GGCGGTTACC GGGCCAAGGT CTTCCTGATG TCGAAGATGG ACGGCCGGAC CAAGAAGGAG GCGGCCGCCC AGATCGACAC CTCGCTCAAG CGCCTGCGCA CGGACCGCAT CGACCTCGTC CAGCACCACG AGATCCTGCG CTACGACGAT CCCGACCGGG TCTTCGCCGA GGGCGGGGCC ATGGAGGCCT TCATCGAGGC GCGCCAGGCC GGCAAGCTGC GCTTCATCGG CTTCACGGGC CACAAGGACC CGCGCATCCA CCTGCAGATG CTGGAGGTCG CGGCCGAGCG GGGCTTCCGC TTCGACACCG TGCAGATGCC CCTCAACGTG CTCGACGCGC AGTTCCGCAG CTTCGCGCAC CTCGTGCTGC CCTCCCTGGT GGCGCAGGGG ATCGGCGTGC TCGGGATGAA GACCTTCGGC GACGGGGTCA TCCTCAAGAG CAACGCCCCG ATCCGGCCGA TCGAGTACCT CCACTTCAAC CTCAACCTGC CGACCTCCGT GGTGATCACC GGCATCCAGA GCCAGCGCGA CCTCGACCAG GCCTTCGAGG CGGTGAAGAG CTTCCGGCCG ATGGACAAGG CGGCGGTGGC GGAACTGCTC GCCCGCGCTC GACCCTACGC GCTCGAGGGC AAGTACGAGC TGTTCAAGAC GAGTTCGACC TTCGACGGCA CCGCCAAGAA CGCCGCCTGG CTCGGCGGCG AGGCCGAGGG CGTGCAATCC CTCGCCCCGA CCATGGAATA G
|
Protein sequence | MERRSFLQGA VLGAGLAAGG SAGAASPAGP GPALPSAGPL PPAAAALAGA NRPEDPAALP FVTDPGERRG EMLYRPLGRT GVTVSAIGMG GFHLGKKALS DAEATRLIHQ GVDRGITFMD NCWDYNEGRS EERMGAALAE GGYRAKVFLM SKMDGRTKKE AAAQIDTSLK RLRTDRIDLV QHHEILRYDD PDRVFAEGGA MEAFIEARQA GKLRFIGFTG HKDPRIHLQM LEVAAERGFR FDTVQMPLNV LDAQFRSFAH LVLPSLVAQG IGVLGMKTFG DGVILKSNAP IRPIEYLHFN LNLPTSVVIT GIQSQRDLDQ AFEAVKSFRP MDKAAVAELL ARARPYALEG KYELFKTSST FDGTAKNAAW LGGEAEGVQS LAPTME
|
| |