Gene Mext_3852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3852 
Symbol 
ID5832607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4277374 
End bp4278543 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content68% 
IMG OID641369642 
Productaldo/keto reductase 
Protein accessionYP_001641295 
Protein GI163853252 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.697779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.307754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGG CGCCATTCGC GCCTTGCAGG GTGCCCGCCC GGTCCGCCGC CTTGCCGGGG 
CGCGCCGAGC TCCTCAAGAC GGGGACAGGC GGCCTCTCAG GGCCGTTTCC CGACAGCCAC
AGCGGAGTGA CGGACATGGA ATACCGGCAG TTCGGACGCT CGGGCCTCAA GGTACCGGTG
CTCAGCCTCG GCACCGCCAC CTTCGGCGGC ACCGACGCGT TCTTCCAGAA CTGGGGCAGC
ACCGGCGTGG CCGAGGCGAG CCGCCTGATC GATCTCTGCC TCGATGCGGG CGTCAACTTC
CTCGACACCG CCGACCTCTA TTCGAGCGGC GATTCCGAAA AAATCCTCGG CGAGGCGATC
AAGGGCCGCC GCGACCGGCT CCTGATCTCC ACCAAGGCGA CCTTCCGGGT CGGCGAGGAT
GTGAACGCCG TCGGCTCGTC GCGCCACCAC CTGATCCGCG CCTGCGAGGC GAGCCTGAAG
CGGCTCCAGA CCGATCATAT CGACGTCTAC TTCATGCACG GCTTCGACGC GCTGACCCCC
GTCGAGGAGA CCCTGCGGGC GCTCGACGAT CTCACCCGTT CGGGCAAGAT CGGCTATATC
GGCGCCTCGA ACTTCTCCGG CTGGCAATTG ATGAAGGCGC TGGCGACCTC GGAGAAGGAG
GGGCTCGCCC GCTACGTCGT CTATCAGGGC TATTACTCGC TGATCGGCCG CCATTACGAG
TGGGAGCTGA TGCCGCTCGG CCTCGACCAG GGCGTCGGCC TGATGGTGTG GAGTCCGCTG
GGCTGGGGCC GGCTCACCGG CAAGATCCGG CGGGGGCAGG CGGCCTCGGG CGGGCGGCTG
TCCACGGCGA GCGGCGCGGA GGGCGGTCCC ACCGTCTCCG ACGACTACCT CTACGATGTT
GTCGATGCCC TCGACGCGGT CGCCGCCGAG ACCGGCAAGA CGGTCCCGCA GGTGGCGCTG
AACTGGCTGC TCAGCCGCCC GACCGTGTGC AACATCGTCA TCGGCGCTCG CAACGAGGAG
CAGTTGAAGC AGAATCTCGG TGCGGTGGGC TGGTCGCTCA CGCCCGACCA GATCGCCCGC
CTCGATGCGG CGAGCCGGGA AAATCCGATC TATCCCTACT GGCATCAGAT CGGCTTCGAC
GAGCGCAACC CCAAGCCGAC GGCTTGGTGA
 
Protein sequence
MAAAPFAPCR VPARSAALPG RAELLKTGTG GLSGPFPDSH SGVTDMEYRQ FGRSGLKVPV 
LSLGTATFGG TDAFFQNWGS TGVAEASRLI DLCLDAGVNF LDTADLYSSG DSEKILGEAI
KGRRDRLLIS TKATFRVGED VNAVGSSRHH LIRACEASLK RLQTDHIDVY FMHGFDALTP
VEETLRALDD LTRSGKIGYI GASNFSGWQL MKALATSEKE GLARYVVYQG YYSLIGRHYE
WELMPLGLDQ GVGLMVWSPL GWGRLTGKIR RGQAASGGRL STASGAEGGP TVSDDYLYDV
VDALDAVAAE TGKTVPQVAL NWLLSRPTVC NIVIGARNEE QLKQNLGAVG WSLTPDQIAR
LDAASRENPI YPYWHQIGFD ERNPKPTAW