Gene Mext_4628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4628 
Symbol 
ID5833804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5176985 
End bp5177965 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content67% 
IMG OID641370422 
Productaldo/keto reductase 
Protein accessionYP_001642067 
Protein GI163854024 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0836973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.048745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACAA AAAAACTCGG CCGCACCGGC CTCGACATCT CGCCCCTCTG CCTCGGCTGC 
ATGACCTACG GCGTGCCGGA GCGGGGACCG CATCCGTGGA CCCTGCGGGA GGAGGAGAGC
CGGCCGCTGA TCAAGCGGGC GCTCGATCTC GGGATCAACT TCTTCGACAC GGCGAACTAC
TATTCGGACG GCACGTCCGA GGAGATCGTC GGCCGCGCTC TCAAGGACTA CGCCGACCGC
GACAGCATCG TACTGGCCAC CAAGGTCTAC TACCCGCAGA AGAACCAGCC CAATGCCGGC
GGCCTCTCGC GCAAGGCGAT CTTTTCCGCG ATCGACGCTT CACTGAAGCG GCTCGGCACC
GATTACGTCG ATCTCTACCA GATCCATCGC TGGGACTACG CAACGCCGAT CGAGGTGACG
CTGGAGGCGC TGCACGACGT CGTGAAGGCG GGCAAGGCCC GTTACATCGG TGCCTCGTCG
ATGTTCGCGT GGCAGTTCGC CAAGGCGCTC TACACCTCTG ACCTGAACGG CTGGACGCGG
TTCGCGACGA TGCAGAACCA CCTCAATCTG CTGCACCGCG AGGAGGAGCG GGAGATGATC
CCGCTCTGCG CCGACCAGGG CATCCCGCTC CTGCCCTGGA GCCCGCTCGC CCGCGGCCGG
CTCACCCGCG ACTTCGACGC GGGCAGCGCC CGGCAAGAGA GCGATCTCGT GGGCAAGAAC
CTCTATGACG CCACGGTCGA GGCCGATCGG CAGGTGGTCG AGGCGGTGGC CGACGTCGCG
CGCGACCGCG GCGTGCCCCG GGCGCAGGTG GCGCTCGCCT GGGTGATCCA GAAGCGGGGC
GTGGCCGCCC CGATCATCGG CGCCTCGAAG CCGGGGCACC TCGACGACGC GGCGGCGGCC
CTCGACCTCG AATTGATGCC GGACGAGATC GCCCGGCTCG AAGCCCCTTA CGTTCCGCAC
GCCGTGGTCG GGTTCCAATG A
 
Protein sequence
MQTKKLGRTG LDISPLCLGC MTYGVPERGP HPWTLREEES RPLIKRALDL GINFFDTANY 
YSDGTSEEIV GRALKDYADR DSIVLATKVY YPQKNQPNAG GLSRKAIFSA IDASLKRLGT
DYVDLYQIHR WDYATPIEVT LEALHDVVKA GKARYIGASS MFAWQFAKAL YTSDLNGWTR
FATMQNHLNL LHREEEREMI PLCADQGIPL LPWSPLARGR LTRDFDAGSA RQESDLVGKN
LYDATVEADR QVVEAVADVA RDRGVPRAQV ALAWVIQKRG VAAPIIGASK PGHLDDAAAA
LDLELMPDEI ARLEAPYVPH AVVGFQ