Gene Msil_1167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1167 
Symbol 
ID7092240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1253802 
End bp1255121 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content64% 
IMG OID643464508 
Product5-aminolevulinate synthase 
Protein accessionYP_002361498 
Protein GI217977351 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID[TIGR00858] 8-amino-7-oxononanoate synthase
[TIGR01821] 5-aminolevulinic acid synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0761593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTATC GGCGCTTCTT CGACGACGCC ATCTCGAAGT TGAAGGGCGA ACGGCGCTAT 
CGCGTCTTTG CCGATCTCGC GCGCGACGCG GAAGCTTTTC CGCGGGCGGT GTGGCGGCGC
GGCGAAGGCG GTCCCGAAGT CAACGTCACC GTCTGGTGCT CCAACGACTA TCTGTGCATG
GGCCGCCATC CGAAGGTGAT CGCCGCCATG CAGGACGCGG CGCAGGCGCA TGGCGTGGGA
GCCGGCGGAA CCCGCAATAT TTCCGGCAAT AATCATCCGA TCGTCGAACT CGAGGCCGAA
CTCGCCGATT TGCACGGCAA GGAAGCCGGC CTCGTCTTCA CCTCGGGCTG GATCTCCAAT
CTCGCCGCGA TCTCGACGAT CGGCGACATT CTGCCCGACT GCCTCATTCT GTCCGACCAG
CTCAATCATA ATTCGATGAT CGAGGGGGTG CGCCGCTCCA GGGCCGAGCG CAAGATTTTC
CGCCACAATG ATCTTGCTCA CCTCGAACAG CTGTTGGCCG AGGCCGGCGA GAGCCGCGCC
AAGCTGATTG TGTTTGAAAG CCTCTATTCG ATGAACGGCG ATATCGCGCC GATCAACGCC
ATCGCCGATC TTGCGCAGCG CTATAACGCC ATGACCTATA TGGACGAGGT CCACGCGGTC
GGCCTTTATG GCGCGCGGGG CGGCGGCATC GCCGAGCGCG ACGGCGCCAT GGCCCGCATC
GACGTCATCG AGGGAACCCT CGCCAAGGGG TTCGGCACGC TTGGCGGCTA TATTACCGGC
GAGGCTTCGA TCATCGATGC GGTGCGCTCC TATGCGCCAT CCTTCATCTT CACGACCTCG
CTGCCGCCGG CGATCGCCAC CGCCGCGAAA ACAGCCGTGA GCCTGTTGAA GCAGGGCGAG
GGGGCCGAAC TGCGCGCCCG CCACCAGCGC CAGTCGATGC TGACCAAACA TGCGCTGTCG
GCCGCCGGTC TGCCGGTGAT GCCGAATTCA TCGCATATCG TGCCGGTGCT CGTCGGCGAC
GCCGAGCTGT GCAAGGCGGC GACCGACATG CTGCTCGACC GGCATGCGAT CTATATCCAG
CCGATCAACT ATCCGACCGT CGCCAAGGGC ACGGAGCGGC TGCGCATCAC GCCGAGCCCG
CTGCATTCCG ACGCCCATAT CGCCCATCTG GTGGAATCGC TGGTCGATGT CTGGGCGAGC
CTAAAGCTGC CCTTCGTGGA GCAGCCGAAT ATCGTCGAAT TCCGCCGCGA AATGCCGGTC
CACGCCGCCG CCGAGGCGCA ATGCACCTTC CCCGAATTCT TCAAGAAGGC GGCGGAGTAG
 
Protein sequence
MEYRRFFDDA ISKLKGERRY RVFADLARDA EAFPRAVWRR GEGGPEVNVT VWCSNDYLCM 
GRHPKVIAAM QDAAQAHGVG AGGTRNISGN NHPIVELEAE LADLHGKEAG LVFTSGWISN
LAAISTIGDI LPDCLILSDQ LNHNSMIEGV RRSRAERKIF RHNDLAHLEQ LLAEAGESRA
KLIVFESLYS MNGDIAPINA IADLAQRYNA MTYMDEVHAV GLYGARGGGI AERDGAMARI
DVIEGTLAKG FGTLGGYITG EASIIDAVRS YAPSFIFTTS LPPAIATAAK TAVSLLKQGE
GAELRARHQR QSMLTKHALS AAGLPVMPNS SHIVPVLVGD AELCKAATDM LLDRHAIYIQ
PINYPTVAKG TERLRITPSP LHSDAHIAHL VESLVDVWAS LKLPFVEQPN IVEFRREMPV
HAAAEAQCTF PEFFKKAAE