Gene Mext_3859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3859 
Symbol 
ID5832769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4286763 
End bp4287692 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content71% 
IMG OID641369649 
Productaldo/keto reductase 
Protein accessionYP_001641302 
Protein GI163853259 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.665179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.078293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCCG TCCCGCCGCT TTCCCGCCGA ACCCTGTTGC AGGGCGCGGC CCTCACCGCC 
GGAGCTGGCT TCGCGCCGGC CCTGCTGCGT CCGGCCCGCG CCGCCGACGC CCTGATCACC
CGGCCGATCC CCTCCTCCGG TGAGCGGTTG CCGGCCATGG GCATCGGCAC CTCGCGCCGC
TACGAGGTGG CGCCGACGCC CGAGGCGACC GCGCCCTTGC GCGAGGCGGT CGAGCGCTTC
GTCGCGCTCG GCGGCCGGGT GATCGACACC GCCCCGAGCT ACGGCACCGC CGAGGACGTC
CTCGGCCTGA TCCTTGAGGG TCTGCGCGAA AAGATCTTTC TCGCGACCAA GGTCGCGGCC
CGCGGCCGGG AGGCGGCGCA GGCCGAGACC GAGCGCTCGT TCCAGCGCCT GCGCACGGAC
AAGATCGATC TCATCGCCGT GCACAACCTG ATCGACACCG AGACCAACCT CGCGGTCCTG
CGCGCGCTCA AGGAGAAGGG CCGCATCCGC TATGTCGGCG TCACGGTCTG GCGCGACGAG
CAGTTCCCCG AGCTCGAGAC GGTGATGAAG CGGGAAAAGC TCGACTTCGT GCAGGTGAAC
TACGCCCTCG ACAGCCGGGC GGCGGCCGAG CGCGTCCTGC CGCTGGCCGT CGAGCGCGGC
GTGGCGGTGA TGGTCAACGT GCCCTTCGGC CGCGACCGCC TGTTCAAGGC GGTGAAGGAC
AAGCCCCTGC CCGCGTTCGC CGCCGAATTC GGCTGCAAGA GCTGGGCGCA GTTCTTCCTC
AAATACGTCC TCGCCAACGA GGCCGTGACT TGCCCGATCC CGGGCATGGC CAAGGCGAGT
TACGTCGAGG ACAACCTCGC CGCCGCGACC GGGCGGCTGC CGGACGCCGC CGAGCGGCGC
AAGATGGAGG CCTTCATCGA TGCGGTCTGA
 
Protein sequence
MPPVPPLSRR TLLQGAALTA GAGFAPALLR PARAADALIT RPIPSSGERL PAMGIGTSRR 
YEVAPTPEAT APLREAVERF VALGGRVIDT APSYGTAEDV LGLILEGLRE KIFLATKVAA
RGREAAQAET ERSFQRLRTD KIDLIAVHNL IDTETNLAVL RALKEKGRIR YVGVTVWRDE
QFPELETVMK REKLDFVQVN YALDSRAAAE RVLPLAVERG VAVMVNVPFG RDRLFKAVKD
KPLPAFAAEF GCKSWAQFFL KYVLANEAVT CPIPGMAKAS YVEDNLAAAT GRLPDAAERR
KMEAFIDAV