Gene Mext_3748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3748 
Symbol 
ID5832957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4154092 
End bp4155351 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content67% 
IMG OID641369538 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001641193 
Protein GI163853150 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.219388 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGATG TGACGCCGAC CCCGCTTCAG CGCCTGCTGC GCGAGCGTCG CCCCGGCTAC 
ACCCTCGCGG CCCCGTTCTA CCTCAGCCCC GAGGTGTTCG AGGCCGACAT GGAGATCATC
TTCGGCCGCC ACTGGATCTA TGTCGGCGTC GAGCCCGACG TGCCGGAGGC CGGCGACGTC
ATGGTCGTCG AGATCGGCAA GACCTCGGTC GCGATCGTGC GCGACGACGA CAACGCGATC
CGCGCCTTCC ACAATGTCTG CCGCCACCGC GGCGCCCGGC TCGTCCATGA CGAGAAGTCC
ACGGTCGGCA ACCTCGTCTG CCGCTACCAT TCCTGGGCCT ACGACCTCAC CGGCAACCTG
ATCCATGCCG AGCATATGGG TCCGGACTTC AAGAAGAGCT GCCACGGCCT CAAGCCCGTG
CATATCCGCT CGCTCGCCGG CCTGCTGTTC ATCTGCCTCG CCGACCAGCC CCCGGCCGAT
TTCGACGAGA TGGCCGCGAA GCTCGGCCCC TATATCGAGC CGCACAACGT GCGCGACACC
AAGATCGCCT TCCAGAAGGA CATCATCGAG CCCGGCAACT GGAAGCTGAC GATGGAGAAC
AACCGCGAGT GCTACCATTG CGGGGCCAAC CATCCCGAGC TGACCGTGCC GCTCTTCGCC
TACGGCTTCG GCTTCGCGCC GGAGGAGATG GACGAGCACG ACCGCGCCAA CGCCGAGCGC
TACGGCTGCC TGCTCAAGAC CCGCCACGGC GAGTGGGAGG CGGAAGGCCT GCCGTCGAAG
GAGATCGACG AGCTCGACAC CATGATCACG GGCTTCCGCA CCGAGCGGCT GCCGCTCGAC
GGTGAGGGCG AGTCCCACAC CCTCGACACC AAGGCCGCCT GCAAGCGGCT GCTCGGCAAC
CTCACCAGCG CCAAGCTCGG CGGGCTCTCG GTCTGGACGC AGCCGAATTC CTGGCACCAC
TTCCTCGGCG ACCACATCGT CACCTTCTCG GTGCTGCCGC TCGATGCCGA GCGCTCGCTG
CTGCGCACCA AGTGGCTCGT GCACAAGGAT GCGGTCGAGG GCGTCGATTA CGATCTCGCC
AACCTCACCG GCGTCTGGGA AGCCACGAAC GATCAGGACA GCGAACTCGT CGGCATCTGC
CAGCAGGGTG TCGCGAGCCC GGCCTACGAG CCCGGCCCCT ACTCGCCGCA TACCGAGATG
CTCGTGGAGA AGTTCTGCAA CTGGTATGTC GGCCGCATGG CCGCGCATCT GGGGCGCTGA
 
Protein sequence
MLDVTPTPLQ RLLRERRPGY TLAAPFYLSP EVFEADMEII FGRHWIYVGV EPDVPEAGDV 
MVVEIGKTSV AIVRDDDNAI RAFHNVCRHR GARLVHDEKS TVGNLVCRYH SWAYDLTGNL
IHAEHMGPDF KKSCHGLKPV HIRSLAGLLF ICLADQPPAD FDEMAAKLGP YIEPHNVRDT
KIAFQKDIIE PGNWKLTMEN NRECYHCGAN HPELTVPLFA YGFGFAPEEM DEHDRANAER
YGCLLKTRHG EWEAEGLPSK EIDELDTMIT GFRTERLPLD GEGESHTLDT KAACKRLLGN
LTSAKLGGLS VWTQPNSWHH FLGDHIVTFS VLPLDAERSL LRTKWLVHKD AVEGVDYDLA
NLTGVWEATN DQDSELVGIC QQGVASPAYE PGPYSPHTEM LVEKFCNWYV GRMAAHLGR