Gene Msil_0420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0420 
Symbol 
ID7093579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp461843 
End bp462934 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content66% 
IMG OID643463750 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_002360756 
Protein GI217976609 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGGC CCCTCAATTC AAAACCGAAA CAGTTTCCGG CCGATGAGGG CGCTGCCGCG 
GCCGCGGCGC GCGCCCTCGC CCCGGACGGC GATGCTCCTG CGCCCCTGCC CGGCCCGACG
GTGCGCCTGC GCCGCAACCG CAAGGCGGAG TGGACGCGCC GCCTCGTCCG CGAAAATGTC
CTGACCGCCA ATGATCTGAT CTGGCCGATT TTCGTGCGCG AGGGAGTCAA TCAGCATGAG
GCGATCGCCT CCATGCCGGG CGTCGAGCGC CTCTCGATCG ACGCCGCCGT CGAAAGCGCG
CGCGAGGCGC ATGCCCTTGG CGTGCCGGCG ATCGCGCTGT TTCCCTATAC CGACCCCGCG
CTTCGCGACG CCGCCGGGAC CGAGGCGCTG AACCCGGACA ATCTGATCTG CCGCGCTGTC
CGGGCGATCA AAGAAACGAC GCCGGAGATC GGCCTCATCA CCGACGTCGC GCTCGATCCC
TACACCAGCC ATGGCCATGA CGGCCTGATG CGCGGCGAGG AAATTCTCAA CGATGAGACG
GTCGAGGTTC TGGTCAAACA GGCGCTGAAT TTCGCGCGCG CGGGCGCCGA CATGATCGCC
CCCTCGGACA TGATGGACGG GCGCGTCGGC GCGATAAGGC GCGGGCTCGA CGCGGAGGGA
TTTACCTCCG TTCAGGTGCT GGCCTATGCC GCTAAATATG CCTCGGCCTT CTACGGCCCG
TTCCGCGACG CCGTCGGCAC GCAGAAGACG CTCATTGGCG ACAAGCGCAC CTATCAGATG
GACCCGGCCA ATTCGGATGA AGCGCTGCGC GAGGTGGCGC AGGATATTGC CGAGGGCGCC
GACATGGTGA TGGTGAAGCC CGGCCTGCCC TATCTCGACA TCATCTATCG CGTGAAGGAA
AAATTCGGCC TGCCGACCTT CGCCTATCAG GTGTCGGGCG AATACGCGAT GATCGAAGGC
GCGGCGCGCA ATGGCTGGCT CGACGGCGAC CGCGCGATTA TGGAGAGCTT GCTCGCTTTC
AAGCGCGCCG GCGCCGACGC CGTGCTGACC TATTTCGCCC CGCGCGTGGC GCGGCTGCTG
CGGGACGAGT AA
 
Protein sequence
MIWPLNSKPK QFPADEGAAA AAARALAPDG DAPAPLPGPT VRLRRNRKAE WTRRLVRENV 
LTANDLIWPI FVREGVNQHE AIASMPGVER LSIDAAVESA REAHALGVPA IALFPYTDPA
LRDAAGTEAL NPDNLICRAV RAIKETTPEI GLITDVALDP YTSHGHDGLM RGEEILNDET
VEVLVKQALN FARAGADMIA PSDMMDGRVG AIRRGLDAEG FTSVQVLAYA AKYASAFYGP
FRDAVGTQKT LIGDKRTYQM DPANSDEALR EVAQDIAEGA DMVMVKPGLP YLDIIYRVKE
KFGLPTFAYQ VSGEYAMIEG AARNGWLDGD RAIMESLLAF KRAGADAVLT YFAPRVARLL
RDE