Gene Mext_4238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4238 
Symbol 
ID5835089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4717352 
End bp4718407 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content71% 
IMG OID641370029 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_001641678 
Protein GI163853635 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.658364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACG CCACCCCGCG CCCGCTCGCC ATCGAGACCG CGCGCGAAGC CGCCCCCGGT 
GCAGGGCCGC TGAGCCTGAC CCAGCGCCCG CGCCGCAACC GCAAGGCGGA TTGGTCGCGC
CGCCTCGTGC GCGAGCATAG CCTCACCGTC GATGACCTGA TCTGGCCGCT CTTCGTGATC
GAGGGCGAGA AGCGCCGCGA GCCGATCGCC TCCATGCCCG GCGTCGAGCG CCTGAGCGTG
GACGAGATCG TGCGCGAGGC CGAGCGCGCC GCGCGGCTCG GCATCCCGGC GATCTCGTTC
TTCCCCTACA CCGAGCCGTC CCTGCGCGAT CCGACCGGCT CCGAGGCGCT GAACCGCGAA
AACCTCGTCT GCCGGGCGGT GCGGGCGGTG AAGCGGGCTG TTCCCGAGAT CGGCGTGATG
ACCGATGTCG CGCTCGACCC CTATACCAGC CACGGCCATG ACGGCTTGAT CGAAGCCGGC
GCCATCCTCA ACGACGAGAC CGTGGCGGTG CTGGTCGAGC AGAGCCTGAT CCAGGCCGAG
GCCGGCACTG ACATTATCGC CCCCTCCGAC ATGATGGACG GGCGCGTCGG CGCGATCCGC
ACCGGCCTCG ACCGGGCCGG CTTTCGCGAT GTTCAGATCA TGGCCTACGC CGCGAAATAC
GCCAGCGCGT TCTACGGGCC GTTCCGCGAC GCCATCGGCA CCAGCGCGGC GCTGGTCGGC
GACAAGCGCA CCTACCAGAT GGATCCCGGC AACGCGGCCG AGGCCCTGCG CGAGGTGGCC
CTCGACCTTG CCGAGGGCGC CGACTCGGTG ATGGTCAAGC CCGGCCTGCC CTATCTCGAC
ATCATCACCC GCGTGAAGAC GGAGTTCGGC GTGCCGACCT TCGCCTATCA GGTGTCGGGC
GAGTACGCGA TGATCGAGGC CGCCGCCCGC AACGGCTGGC TCGACGGCGA CCGCGCCATG
ACGGAGAGCC TGCTCGCCTT CAAGCGCGCG GGCGCCGACG GGGTGCTGAC CTACTACGCC
CCCCGCGTCG CCGAGCGCCT GCGCGCGGGC GCCTGA
 
Protein sequence
MSDATPRPLA IETAREAAPG AGPLSLTQRP RRNRKADWSR RLVREHSLTV DDLIWPLFVI 
EGEKRREPIA SMPGVERLSV DEIVREAERA ARLGIPAISF FPYTEPSLRD PTGSEALNRE
NLVCRAVRAV KRAVPEIGVM TDVALDPYTS HGHDGLIEAG AILNDETVAV LVEQSLIQAE
AGTDIIAPSD MMDGRVGAIR TGLDRAGFRD VQIMAYAAKY ASAFYGPFRD AIGTSAALVG
DKRTYQMDPG NAAEALREVA LDLAEGADSV MVKPGLPYLD IITRVKTEFG VPTFAYQVSG
EYAMIEAAAR NGWLDGDRAM TESLLAFKRA GADGVLTYYA PRVAERLRAG A