Gene Mext_4804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4804 
Symbol 
ID5835239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5366803 
End bp5368074 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content67% 
IMG OID641370601 
Product5-aminolevulinate synthase 
Protein accessionYP_001642243 
Protein GI163854200 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID[TIGR00858] 8-amino-7-oxononanoate synthase
[TIGR01821] 5-aminolevulinic acid synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.577654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.145905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGCGGA CGCAAGATCC ACCCACCGGC ATCATCTTCT TCGGGGAGGA CGGCATGGAT 
TACGAAGCGT TCTTCGACAA TGCGATCACC GGCCTCCACC GGGAGGGGCG CTACCGCGTC
TTCACCGATC TGGAGCGACA GGCGGGGCGG TTCCCCTACG CGACGCATCA CAGCCCCGGC
GGCGCGCGTG AGGTCACCGT CTGGTGCTCC AACGACTATC TCGGCATGGG CCAGCATCCG
TCGGTGCTGC AAGCGATGCA CGAGGCGATC GACCGATGTG GAGCCGGCGC GGGCGGCACC
CGAAACATCT CCGGCACCAA CCATTATCAC GTGCTCTTGG AGCAGGAACT CGCCGACCTC
CACGGAAAGG AAGCGGCGCT GATCTTCTCC TCCGGCTACG TCTCGAACTG GGCGGCGCTC
GGCACACTCG CCTCGAAGCT TCCGGGCTGC GTCGTCTTCT CCGACGAGGG CAACCATGCC
TCGATGATCG AGGGCATCCG TTCGAGCCGG GCCGAGCGCC AGATCTTCCG CCACAACGAT
CCGGAGGATC TGGACCGCAA GCTCGGGCTG ATCGAGCCCG GCCGGGCCAA GCTCGTCGCC
TTCGAGTCGG TCTATTCGAT GGATGGCGAC ATCGCCCCGA TCGACGAGAT CTGCGACGTG
GCCGAGGCGC ACGGGGCGCT CACCTATCTC GACGAGGTGC ACGCGGTCGG CCTCTACGGC
GCGCGGGGCG GCGGCATCTC GGAGCGGATG GAACTCGCCC ACCGGCTCGA CGTGATCGAG
GGAACGCTCG GCAAAGCGTT CGCCGTCCAT GGCGGCTACA TCACCGGCTC GACGCAGCTC
TGCGACTTCG TGCGCAGCTT CGCCTCGGGC TTCATCTTCA CGACCTCGCT GCCGCCGGCG
GTCGCGGCCG GCGCGGCGGC GAGCATCCGC CACCTCAAGG CGAGCCGTGT CGAGCGGGCG
CGGCATCAGG AGCGGGTGGC GCGGGTCCGG CAAGCGCTGG ATGCGGCGGG CATCCCGACT
TTGGCCAACC GCAGCCACAT CGTGCCGGTG ATGGTTTGCG ATCCCGTACT GTGCAAGGCG
ATCAGCGATA CCCTGCTCGA CGAGTTCGGC ATCTACGTGC AGCCGATCAA CTACCCGACC
GTGCCGCGCG GGACGGAGCG CCTACGCATC ACGCCCTCGC CGCTGCATTC CAACGCCGAC
ATCGACCACC TCGTGGACGC ACTGAGCACG ATCTGGCGGC GGATCGGGCT GAGCAAGGCG
GCGGCGGAGT AG
 
Protein sequence
MRRTQDPPTG IIFFGEDGMD YEAFFDNAIT GLHREGRYRV FTDLERQAGR FPYATHHSPG 
GAREVTVWCS NDYLGMGQHP SVLQAMHEAI DRCGAGAGGT RNISGTNHYH VLLEQELADL
HGKEAALIFS SGYVSNWAAL GTLASKLPGC VVFSDEGNHA SMIEGIRSSR AERQIFRHND
PEDLDRKLGL IEPGRAKLVA FESVYSMDGD IAPIDEICDV AEAHGALTYL DEVHAVGLYG
ARGGGISERM ELAHRLDVIE GTLGKAFAVH GGYITGSTQL CDFVRSFASG FIFTTSLPPA
VAAGAAASIR HLKASRVERA RHQERVARVR QALDAAGIPT LANRSHIVPV MVCDPVLCKA
ISDTLLDEFG IYVQPINYPT VPRGTERLRI TPSPLHSNAD IDHLVDALST IWRRIGLSKA
AAE