Gene M446_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3040 
Symbol 
ID6134889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3363318 
End bp3364445 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content75% 
IMG OID641643231 
Product2-alkenal reductase 
Protein accessionYP_001769885 
Protein GI170741230 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0397571 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGATC GTTTCGTGCG GATCGCCCTC GGCGCCGCGC TGGGGCTGCT CGCCCTGTTC 
GTGGCGCAGC CCTACCTGAC CGCCCTGCTG TTCTCGGTGG AGCAGCCGCG GGCCGTCACC
CCGCGCGGCG ACCTCGCCCC CGCCGAGGCC GCCACCGTGG CGCTGTTCGA GCGCGCCGCC
CCCTCGGTCG TCTACGTCTT CGCGCGCCGC GCCCCCAGCG TGCAGGACCT GATGCGCCAG
GGCATGGACG GCACCGAGCA GGGCGGGCAG GGGAGCGAGC AGACCGGGAC CGGCTTCGTC
TGGGACGCGG GCGGCCACGT GGTCACCAAC AACCACGTCA TCCAGGGCGG CTCGGAGATC
TCGGTGCGGC TGTCGAGCGG CGAGATCGTG CCGGCGACCC TGGTCGGCGC GGCGCCCAAC
TACGACCTCG CGGTGCTGCG CCTCGGGCGG GTGAGCGCCA TGCCGCCGCC CATCGCCATC
GGCAGCTCCG CCGACCTCAA GGTCGGGCAG TTCGTCTACG CGATCGGCAA CCCGTTCGGG
CTCGACCACA CCCTGACCTC GGGGGTGATC AGCGCCCTGC AGCGGCGCCT GCCGACCCAG
GAGGGGCGGG AGCTCTCGGG CGTGATCCAG ACCGACGCGG CGATCAACCC GGGCAATTCC
GGCGGGCCGC TCCTCGACTC GGCCGGGCGG GTGATCGGGG TCAACACGGC CATCTTCTCG
CCCTCGGGCG CGAGCGCCGG CATCGGCTTC GCGGTGCCGA TCGACGTCGT CAACCGCGTG
GTGCCGGACC TGATCCGCAC GGGCCGCGCG CCGAGCCCGG GGATCGGCAT CGTGGCGGCG
CAGGAGGAGG CGGCCGCCCG GCTCGGCATC GACGGGGTCG CGGTGGTGCG CGTGCTGCGC
GGATCGCCGG CCGCCGCCGC CGGCCTGCGC GGCGTCGACC CGGCCACGGG CGAACTGGGC
GACATCATCG TCGGGGTCAA CAACCGCCCG GTCCACCGCC TGGCCGACCT CACGGCGGCG
ATCCAGGAGG CGGGCGTCGG CCGGACCCTG GAACTGACCA TCCTGCGCGA CGGGCGGCCG
CGCACGCTCC AGATCACCAC GGCCGATATG GGGCAGCGCG TCCCTTGA
 
Protein sequence
MPDRFVRIAL GAALGLLALF VAQPYLTALL FSVEQPRAVT PRGDLAPAEA ATVALFERAA 
PSVVYVFARR APSVQDLMRQ GMDGTEQGGQ GSEQTGTGFV WDAGGHVVTN NHVIQGGSEI
SVRLSSGEIV PATLVGAAPN YDLAVLRLGR VSAMPPPIAI GSSADLKVGQ FVYAIGNPFG
LDHTLTSGVI SALQRRLPTQ EGRELSGVIQ TDAAINPGNS GGPLLDSAGR VIGVNTAIFS
PSGASAGIGF AVPIDVVNRV VPDLIRTGRA PSPGIGIVAA QEEAAARLGI DGVAVVRVLR
GSPAAAAGLR GVDPATGELG DIIVGVNNRP VHRLADLTAA IQEAGVGRTL ELTILRDGRP
RTLQITTADM GQRVP