Gene Msil_3543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3543 
Symbol 
ID7092400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3892625 
End bp3893632 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content68% 
IMG OID643466834 
Producttranscriptional regulator, ArsR family 
Protein accessionYP_002363794 
Protein GI217979647 
COG category[H] Coenzyme transport and metabolism
[K] Transcription 
COG ID[COG0640] Predicted transcriptional regulators
[COG2226] Methylase involved in ubiquinone/menaquinone biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.304877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAT TTCACGCTCC ATTCGACGTC GTTTTGAACG CGCTTGCCGC CGCCGGGGAG 
GCGACCCGCC TGCGCCTGCT GGCGCTTCTC GCCGAGGCGG AGCTGACCGT CACCGAAATT
GTCACCATTC TCGGCCAGTC TCAGCCGCGC GTCTCGCGGC ATCTTCGCCT GCTCGCCGAA
GCCGGCCTCG TCGAGCGCCA TCGCGAGGGC TCCTGGGTGT TCTTTCTGAT GAGCCAGCAC
GGACCGGCCG CCGGCTTTGC GCGCGACATC GTCGCGAGGC TCGCCCCCGG GGAGCCTCTC
CTCGGCGCAG ACCGGGCGCG CCTTGCCGAA GTGCGCCAGG CGCGCGCCGA GCAGGCGGCG
CGCTATTTTG CAGCCCATGC GGCGAATTGG GACGAGTTGC GCGCGATGCA TCTTCCCGAG
GAGCGCGTCG AGGCGGCGAT CGTCGACATC GTCGGCAAAT CGCCGATCCA TGCGCTGCTC
GACCTTGGCG CCGGCACGGG GCGGATGCTT GAACTTCTCG CGCCGCTCGC CGCGCGCGCC
GTCGGCGTCG ATCAGTCGCC GCAGATGCTG GCCGTCGCGC GCGCGCGCCT TGAGCGCGCA
GGCCTGCGCA ACACGCAATT GCGGCAGGGC GATATTTACG CGCTTCCGGT CGAGCCCGAC
CATTACGATC TCGTCGTCAT GCATCAGGTG CTGCATTATC TTGACGATCC GCTGCGCGCC
ATCCGCGAGG CGACGCGGGC GCTGCGGCCG GGCGGGCGTC TCCTCATCGT CGACTTCGCG
CCGCATCATG AGGAGCATTT GCGCGCCGCC CATGCGCATC GCCGCCTCGG CTTTGCGGCC
GAGGAGATCG CGGGCTTCAT GCAGGCGTCC GGCCTCGACG TCGCGCTGCG CCGCGATCTC
GCCCCCAACC TCAGCGAGGG CGGCAAGCTC ACCGTGTCGA TCTGGCTCGG ACAAGACCGA
CGAATCATCA CCGATCAACT TCCTTTGACC GCGCGAGAAG TCGCATGA
 
Protein sequence
MTQFHAPFDV VLNALAAAGE ATRLRLLALL AEAELTVTEI VTILGQSQPR VSRHLRLLAE 
AGLVERHREG SWVFFLMSQH GPAAGFARDI VARLAPGEPL LGADRARLAE VRQARAEQAA
RYFAAHAANW DELRAMHLPE ERVEAAIVDI VGKSPIHALL DLGAGTGRML ELLAPLAARA
VGVDQSPQML AVARARLERA GLRNTQLRQG DIYALPVEPD HYDLVVMHQV LHYLDDPLRA
IREATRALRP GGRLLIVDFA PHHEEHLRAA HAHRRLGFAA EEIAGFMQAS GLDVALRRDL
APNLSEGGKL TVSIWLGQDR RIITDQLPLT AREVA