Gene Mext_2807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2807 
Symbol 
ID5831628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3142735 
End bp3143883 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content60% 
IMG OID641368609 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001640269 
Protein GI163852226 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCCTT GTGCAAACTA CTACCGTCCC GAGATCCTTG CCGACGAGCT GGGCACTCTG 
TTCGATCCAC TCTGGCAGTT TGGAGCGCTG GCAGGAGAAC TCGCCGCGGA TCGCGATTTC
GTCTGCGTCG ATTACAAGCA CACGGCAACC GTCCTGCAGA ACTTCCGCGG CGAGATCCGG
GCATTCGCCA ATGTGTGCAG CCATCGATTC AACCGCATCC AGCCGGGCGA GCGCGGCAAC
CGGCCGCTGA TGTGTGACTA TCATGGCTGG AGCTTCGATA GCACCGGTTT CCCGCATGGC
ATGCCGCGCC GCGACGGCTT CGCGCTCGAT GATCGCGAGC GATTATGCCT GAGCCGTTAC
GAAGTTGAGA CGTGTGGTAT TTTCGTATTC TTTCGTAAGC GCAGGGGCGG ACCGTCTCTG
CGCGAGTATC TGGGCGCGTT CTATCCACTG CTTGAGCAGA TCGGATCCTA TTTCGGGCCA
GAAATTGATA CTGGAACGAT TTCGCACGCG GCCAATTGGA AGCTGCTCGT CGAGAACGTT
CTCGAATGCT ACCATTGCTC GGTCGTCCAT CAAGACACCT TCGTGAAAAC GCTCGGGATT
GGCAGGGCAG GCATCGAGCA GGAACGGTTC GACGGACCGC ATTCCAGTAG CCACTTCCCG
CGCACCGCGA CGGCCGGAGA GGCGCGGCGG CAGAAGGCGC TCGCCTATCT CGACACCCGC
GCCTTCACCC ACGACTCGTT TTTCCACATT CACATCTTTC CCAACCTGTT CATCTCATCG
ACGCAGGGTC TGTCTTTTTA TGTCGGCCAC GCTTTGCCTC TGTCGGCGAC GGAAACCGGA
CTGCGCTTTC GGCTATTCGA ACCGAAGCTC GACCTGACCC GTGCGCAGCG CGCGGCACAG
GATCTGATCA ACCAGTCGGG CAAGGCGCTG GGTCGCGCGG TGATCGACGA AGATCGCGCG
ATCCTGGAGA ATGTCCAACG GGGCGTCGAA CTGTCGGAGA AGCCCGGTGT CATCGGTCGC
GACGAGATCC GGATCGCCGC GTTCATGCGC GCCTACACGC ACCTCATGGG CGGCGGGTCA
CTTGGCGGTA TACCCCCCAT CGACGACCAT GTTGCTGCCG GTGATCCAGC GCGAAGCATC
GCTGAGTAG
 
Protein sequence
MLPCANYYRP EILADELGTL FDPLWQFGAL AGELAADRDF VCVDYKHTAT VLQNFRGEIR 
AFANVCSHRF NRIQPGERGN RPLMCDYHGW SFDSTGFPHG MPRRDGFALD DRERLCLSRY
EVETCGIFVF FRKRRGGPSL REYLGAFYPL LEQIGSYFGP EIDTGTISHA ANWKLLVENV
LECYHCSVVH QDTFVKTLGI GRAGIEQERF DGPHSSSHFP RTATAGEARR QKALAYLDTR
AFTHDSFFHI HIFPNLFISS TQGLSFYVGH ALPLSATETG LRFRLFEPKL DLTRAQRAAQ
DLINQSGKAL GRAVIDEDRA ILENVQRGVE LSEKPGVIGR DEIRIAAFMR AYTHLMGGGS
LGGIPPIDDH VAAGDPARSI AE