Gene Msil_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2030 
Symbol 
ID7094228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2201709 
End bp2202935 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content62% 
IMG OID643465354 
Product5-aminolevulinate synthase 
Protein accessionYP_002362332 
Protein GI217978185 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID[TIGR00858] 8-amino-7-oxononanoate synthase
[TIGR01821] 5-aminolevulinic acid synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTACG AACAAGGCTT CCAGGTCCGG ATCGACGCCC TGCATGCGGA GGGGCGATAC 
CGCGTCTTCG CCGACATCAT CCGCCAGCGC GGAGCCTTTC CCAAAGCCGA GCATTTACGG
GGCGGCGCGC ATCGCAGCGT GACGGTCTGG TGCTCCAATG ATTACCTCGG CATGGGGCAG
CATCCCGTCG TGCTCGCGGC GATGCATGAG GCCCTCGATA CGGCCGGCGC AGGATCGGGC
GGAACGCGGA ATATTTCCGG CACCACCCAT TATCACGTCG AACTCGAGGC CGAGCTGGCT
GACCTGCATG GCAAGGAATC CGCCTTGTTG TTTACGTCCG CCTATGTCGC CAATGATGCC
GCGATCGCGA CGTTGGTCAA ATTGCTCCCG GGCTGCGTCA TTTTTTCCGA CGAGAAAAAT
CACGCGTCTC TGATTGCGGG GATTCGTCAC GGCGGCGGCC AGAAGGAAAT CTGGCGCCAC
AACAACATCA AGGATCTTGA GGCCAAACTC AGCAAATATC CAAGGCACGC GCCGAAATTG
ATTGTCTTTG AAAGCGTCTA TTCGATGGAC GGCCATATTG CGCCGATTGC GGAGGTCTGC
GCGCTGGCGA AGAAATATAA CGCGCTGACC TACCTCGACG AGGTTCATGG CGTCGGCCTC
TATGGCGCGC GTGGCGCCGG CGTCGCCGAG CGCGACGGCG CGATGGATCA GGTCGACATC
ATAAATGGCA CGCTCGCCAA GGGCTTCGGC GTGATGGGCG GCTACATCGC GGGCAGCCGC
GCCTGCTGCG ACGCAATCCG CTCCTATGCG GCGGGCTTCA TCTTCACGAC CTCGCTCGCG
CCCGTCATCG CCGCCGGCGC GAGGGCCAGC ATCCGCCACC TGAAAGCCAG CAGCGCCGAG
CGCGTACTTC ACCAGCAGCG CGCGATCACA TTGAAGCAGC GCCTCACCGA CGCCGGCTTG
CCGGTCATGA GAAGCCAAAG CCACATCGTG CCGGTGATCG TCGGCGATCC GGTGCACTGC
AAGGCGATCA CCGATCTGTT GCTCGACGAT TATGCGATCT ATGTGCAGCC GATCAACTAC
CCGACCGTCG CGCGCGGTTC GGAGCGCATA AGGCTGACGC CGTCGCCGGT GCATACGGAC
GCCCAGATGG ACTATCTCGT CGACACGCTG TCACATCTCT GGTCGCGGTG TCCGATGTCG
CAGGCGATGG CGATTGCCGC GCAATAA
 
Protein sequence
MNYEQGFQVR IDALHAEGRY RVFADIIRQR GAFPKAEHLR GGAHRSVTVW CSNDYLGMGQ 
HPVVLAAMHE ALDTAGAGSG GTRNISGTTH YHVELEAELA DLHGKESALL FTSAYVANDA
AIATLVKLLP GCVIFSDEKN HASLIAGIRH GGGQKEIWRH NNIKDLEAKL SKYPRHAPKL
IVFESVYSMD GHIAPIAEVC ALAKKYNALT YLDEVHGVGL YGARGAGVAE RDGAMDQVDI
INGTLAKGFG VMGGYIAGSR ACCDAIRSYA AGFIFTTSLA PVIAAGARAS IRHLKASSAE
RVLHQQRAIT LKQRLTDAGL PVMRSQSHIV PVIVGDPVHC KAITDLLLDD YAIYVQPINY
PTVARGSERI RLTPSPVHTD AQMDYLVDTL SHLWSRCPMS QAMAIAAQ