Gene Mchl_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_4043 
Symbol 
ID7118048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp4258461 
End bp4259720 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content66% 
IMG OID643526762 
ProductRieske (2Fe-2S) domain protein 
Protein accessionYP_002422771 
Protein GI218531955 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.839195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.202366 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGATG TGACGCCGAC CCCGCTTCAG CGCCTGCTGC GCGAGCGTCG CCCCGGCTAC 
ACCCTCGCGG CCCCGTTCTA CCTCAGCCCC GAGGTGTTCG AGGCCGACAT GGAGATCATC
TTCGGCCGCC ACTGGATCTA TGTCGGCGTC GAGCCCGACG TGCCGGAGGC CGGCGACGTC
ATGGTCGTCG AGATCGGCAA GACTTCGGTC GCGATCGTGC GCGACGACGA CAACGCGATC
CGCGCCTTCC ACAATGTCTG CCGCCACCGC GGCGCCCGGC TCGTCCATGA CGAGAAGTCC
ACGGTCGGCA ACCTCGTCTG CCGCTACCAT TCCTGGACCT ACGACCTCAC CGGCAACCTG
ATCCACGCCG AGCATATGGG TCCGGACTTC AAGAAGAGCT GCCACGGCCT CAAGCCCGTC
CACATCCGCT CGCTCGCCGG CCTGCTCTTC ATCTGCCTCG CCGACCAGCC CCCGGCCGAT
TTCGACGAGA TGGCCGCGAA GCTCGGCCCC TATATCGAGC CGCACAACGT GCGCGACACC
AAGGTCGCCT TCCAGAAGGA CATCATCGAG CCCGGCAACT GGAAGCTCAC GATGGAGAAC
AACCGCGAGT GCTACCATTG CGGGGCCAAC CATCCCGAGT TGACCGTGCC GCTCTTCGCC
TACGGCTTCG GCTTCGCGCC CGAGGAGATG GACGAGCACG ACCGCGCCAA CGCCGAGCGC
TACGGCTGCC TGCTCAAGAC CCGCCACGGC GAGTGGGAGG CGGAAGGTCT GCCGTCGAAG
GAGATCGACG AGCTTGACAC CATGATCACG GGCTTCCGCA CCGAGCGGCT GCCGCTCGAC
GGTGAGGGCG AGTCCCACAC CCTCGACACC AAGGCCGCCT GCAAGCGGCT GCTCGGCAAC
CTCACCAGCG CCAAGCTCGG CGGGCTCTCG GTCTGGACGC AGCCGAATTC CTGGCACCAC
TTCCTCGGCG ACCACATCGT CACCTTCTCG GTGCTGCCGC TCGATGCCGA GCGCTCGCTG
CTGCGCACCA AGTGGCTCGT GCACAAGGAT GCGGTCGAGG GCGTCGATTA CGATCTCGCC
AACCTCACCG GCGTCTGGGA AGCCACGAAC GATCAGGACA GCGAACTCGT CGGCATCTGC
CAGCAGGGTG TCGCGAGCCC GGCCTACGAG CCCGGCCCCT ACTCGCCGCA TACCGAGATG
CTCGTGGAGA AGTTCTGCAA CTGGTATGTC GGCCGCATGG CCGCGCATCT GGGGCGCTGA
 
Protein sequence
MLDVTPTPLQ RLLRERRPGY TLAAPFYLSP EVFEADMEII FGRHWIYVGV EPDVPEAGDV 
MVVEIGKTSV AIVRDDDNAI RAFHNVCRHR GARLVHDEKS TVGNLVCRYH SWTYDLTGNL
IHAEHMGPDF KKSCHGLKPV HIRSLAGLLF ICLADQPPAD FDEMAAKLGP YIEPHNVRDT
KVAFQKDIIE PGNWKLTMEN NRECYHCGAN HPELTVPLFA YGFGFAPEEM DEHDRANAER
YGCLLKTRHG EWEAEGLPSK EIDELDTMIT GFRTERLPLD GEGESHTLDT KAACKRLLGN
LTSAKLGGLS VWTQPNSWHH FLGDHIVTFS VLPLDAERSL LRTKWLVHKD AVEGVDYDLA
NLTGVWEATN DQDSELVGIC QQGVASPAYE PGPYSPHTEM LVEKFCNWYV GRMAAHLGR