Gene Mchl_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_3035 
Symbol 
ID7118313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp3202424 
End bp3203572 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content60% 
IMG OID643525786 
ProductRieske (2Fe-2S) domain protein 
Protein accessionYP_002421801 
Protein GI218530985 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.416925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCCTT GCGCAAATTA CTACCGTCCC GAGATCCTTG CCGACGAACT GGACACTCTG 
TTCGATCCAC TCTGGCAATT TGGAGCCCTA GCAGGAGAAC TCGCCGCGGA TCGCGATTTC
GTCTGCGTCG ATTACAAGAA CACGGCGACC GTCCTGCAGA ACTTCCGCGG CGAGATCCGG
GCATTCGCCA ATGTGTGCAG CCATCGGTTC AACCGCATCC AGCCGGGCGA GCGCGGCAAC
CGGCCGCTGA TGTGTGCCTA TCATGGCTGG AGCTTCGACA GCACCGGTTT CCCGCACGGC
ATGCCGCGCC GCGGCGGATT CGCGCTCGAT GATCGCGAGC GGTTATGTCT GACCGGTTAC
GAAGTTGAGA CGTGCGGAAT TTTCGTATTT TTTCGTAAGC GGAGCGGCGG ACCGTCTCTG
CGCGAGTATC TGGGCGCATT CTATCCACTG CTTGAGCAGA TCGGATCCTA TTTCGGACCG
GAAATTGATG CTGGAACAAT TTCACACGCG GCCAATTGGA AGCTGCTCGT CGAGAACGTT
CTTGAATGCT ACCATTGCTC GGTCGTCCAT CAGGACACGT TCGTGAAAAC GCTCGGGATT
GGCAGAGCAG GCATCGAGCA GGAACGGTTC GACGGACCGC ATTCCAGTAG CCACTTCCCG
CGCACCGCGA CGGCCGGAGA GGCCCGGCGG CAGAAGGCGC TCGCCTATCT CGACACCCGC
GCCTTCACCC ACGACTCGTT TTTCCACATT CACATCTTTC CCAACCTGTT CATCTCATCG
ACGCAGGGCC TGTCTTTTTA TGTCGGCCAC GCTTTGCCTC TGTCGGCGAC GGAAACCGGA
CTGCGCTTTC GGCTATTCGA ACCGAAGCTC GACCTGACCC GTGCGCAGCG CGCGGCACAG
GATCTGATCA ACCAATCGGG CAAGGCGCTG GGTCGTGCGG TGATCGACGA GGACCGAGCG
ATCCTGGAAA ATGTCCAGCG GGGCGTCGAA TTGTCGGAGA AGCCCGGTGT GATCGGTCGC
GACGAAATCC GGATCGCCGC GTTCATGCGC GCCTACACGC ACCTCATGGG TGGCGGCTCA
CTTGGCGGTA TACCCTCCAT CGACGACCAT GTTGCTGCCG GTGATCCAGC GCGAAGCATC
GCTGAGTAG
 
Protein sequence
MLPCANYYRP EILADELDTL FDPLWQFGAL AGELAADRDF VCVDYKNTAT VLQNFRGEIR 
AFANVCSHRF NRIQPGERGN RPLMCAYHGW SFDSTGFPHG MPRRGGFALD DRERLCLTGY
EVETCGIFVF FRKRSGGPSL REYLGAFYPL LEQIGSYFGP EIDAGTISHA ANWKLLVENV
LECYHCSVVH QDTFVKTLGI GRAGIEQERF DGPHSSSHFP RTATAGEARR QKALAYLDTR
AFTHDSFFHI HIFPNLFISS TQGLSFYVGH ALPLSATETG LRFRLFEPKL DLTRAQRAAQ
DLINQSGKAL GRAVIDEDRA ILENVQRGVE LSEKPGVIGR DEIRIAAFMR AYTHLMGGGS
LGGIPSIDDH VAAGDPARSI AE