Gene Mchl_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_2037 
Symbol 
ID7118737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2134603 
End bp2135676 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content67% 
IMG OID643524787 
ProductRieske (2Fe-2S) domain protein 
Protein accessionYP_002420812 
Protein GI218529996 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.950553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.781731 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACGA CCCGCCAGAA GCTGTGGCGC CATTACTGGT ACGCGACCTT GCGCCTGTCC 
GACCTCGCCG ACGGCCCCAA GCCCTTCACC CTGATGGGCG AGCGGATCGT GCTGTTTCTC
GACGGCGAGG GCGAGCCCGC CGCGATGATG GACCGCTGCT GCCACCGCAC CGCCCGGCTC
TCCAAGGGCT GGTGCGAGGA CGGCCTGATC GTGTGCGGCT ATCACGGCTG GGCCTATGAC
CGGCACGGCG CGCTCGCCCG CATCCCGCAA TTCAGCCCGG AGCAGGTCGT GCCGCGGCTC
GCGGTGAAGA GCTACCACTG CACCGCGAAG TACGGCTACG CCTGGGTCTG CCTGGAGGAG
CCCTACGCGG CGATCCCCGA GATCCCCGAG GACACGATGC CCGGCTACCG GCGCATCCAG
CAATTCCACG ACGTGTGGAA GACCTCGCCC CTGCGCCTGA TGGAGAACTC GTTCGACAAC
GCGCACTTCG CCTTCGTGCA CCAGAACACG TTCGGGCAGA TCAGCCAGCC GATCCCGGAA
AAGTACGAGA TCACCGAGAC TGAGTACGGC TTCGAGGCGG AGACGATCAT CACCATCGCC
AACCCGCCGA TGGCCCACCG CATCAGCGGC ACCACTGAGC CGACCACCAA GCGCCACATG
CGCAACAAGT GGTTCATGCC GTTCTGCCGC CGGCTCGACA TCGAATACCC GTCGGGCTTG
CGGCACATCA TCTTCAACTC GGCGACGCCG ATCGACGACG GCACGATCCG ACTCGCGCAG
ATCCTCTACC GCAACGACCG CGAGGAGGAT TGCTCAACGG AAGCGCTGAT CGCCTGGGAT
GCGGTGATCG TCGAGGAGGA CCGCGACATC CTCGAATCGA CCGACCCGGA CGCCGCGGTC
GATATGGGCC GCAAGGTCGA GAGCCACATG CCCTCCGACC GCCCCGGCAT GATCATGCGC
CGCCGCCTGC TGGCCGCCCT GCACGCCCAT GGTGAGGAGG AGGTGTCAGA GGCAACGCCG
GCGGTCTCCG TGCCGGTGGC GCCGACGCTG ATGCCGCACG AGAGGGTCGC GTGA
 
Protein sequence
MLTTRQKLWR HYWYATLRLS DLADGPKPFT LMGERIVLFL DGEGEPAAMM DRCCHRTARL 
SKGWCEDGLI VCGYHGWAYD RHGALARIPQ FSPEQVVPRL AVKSYHCTAK YGYAWVCLEE
PYAAIPEIPE DTMPGYRRIQ QFHDVWKTSP LRLMENSFDN AHFAFVHQNT FGQISQPIPE
KYEITETEYG FEAETIITIA NPPMAHRISG TTEPTTKRHM RNKWFMPFCR RLDIEYPSGL
RHIIFNSATP IDDGTIRLAQ ILYRNDREED CSTEALIAWD AVIVEEDRDI LESTDPDAAV
DMGRKVESHM PSDRPGMIMR RRLLAALHAH GEEEVSEATP AVSVPVAPTL MPHERVA