Gene Msil_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2037 
Symbol 
ID7094235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2207402 
End bp2208832 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content67% 
IMG OID643465361 
ProductPUCC protein 
Protein accessionYP_002362339 
Protein GI217978192 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAGC TGAACGAAAA GCTCGCGCGC AACTGGAAGC GGCTCAGCCC GAGCCTTCTC 
CCCTTCGCCG ACGCAGCCAC TGTGGAGTTG CCGCTTGGCC AGCTGTTGCG GCTTTCGCTG
TTTCAAGTCA GCGTCGGCGT TTCCATTGTT CTGCTGGTCG GCACGTTGAA CCGCGTCATG
ATCGTCGAGC TTGGCGTTCC GGCCTGGCTC GTCGCGCTGA TGGTTTCGCT GCCTCTGGTT
TTCGCTCCGT TTCGCGCCCT CGTCGGCTTC CGCTCCGACG CCCATCGTTC CGCGCTCGGA
TGGCGCAGGG TGCCTTATCT CTGGCTCGGG ACGCTCATCC AGTTCGGCGG GCTGGCGATC
ATGCCCTTCG CCTTGATTGT TCTTTCCGGC GATTCCAATG GACCCGTCTG GGTCGGCCAC
GTCGCGGCGG CGCTCGCCTT TCTTCTGGTT GGCGCGGGCC TGCATACAAC GCAGACGGTG
GGGCTCGCGC TTGCGACAGA TCTGGCTCCG GCCCACGCGC GTCCGCGCGT CGTCGCCCTG
CTTTGCATGA TGCTTCTCGT TGGCATGCTT GCGAGCGCCC TCGCGTTCGG AGCGCTTCTC
GCCAACTTCA GCGAACTTCG CCTGATCAGG GTGGTTCAGG GCGCCGCCGT CTTGACCATG
GGCCTCAACA TGATCGCCTT GTGGAAGCAG GAGCCGCGCC GGGCGCTTCA GCCGCTCATC
ATGCCGCGCC CGTCCTTCGC CGCCTCCTGG AACGCCTACC TGCGCCAGAG CGAGCGCGCC
AAACGCCGGC TCCTTGTGAT CGCGCTTGGC ACGGCGGCGT TCAGCATGGA GGACATTCTG
CTTGAACCGT ATGGCGGGCA GGTGCTGCAC TTGCCGGTTG GCGCGACGAC CGCGCTGACC
GCGATGCTGG CGATCGGAAG CATTTGCGGC CTTTGGCTCG CCGCGCGGCT TCTTGGCGGG
GGCGCCGATC CGCACCGCGT GTCGGCCTAT GGGCTGCTCG CGGGGCTCGC CGCCTTCAGC
GCTGTGATCT TCGCCGCCCC GCTCGACTCC GCCCGCCTGT TCGGCGTGGG AACAGTGCTG
ATCGGCTTCG GCGCCGGACT GTTCGCCCAT GGCACGCTGA CCGCGACAAT GAACCAGGCC
AGGCGCGACG CCGCCGGAAT GGCGCTCGGC GCCTGGGGGG CGGCGCAGGC GAGCGCCGCC
GGGCTCGCGA TCGCGCTCGG AGGCGCCATC GCCGACGGGG TATCGACGTT CGCCGCGCAA
GGAGCGTTCG GGCCGACTGT CGCCGGTCCG GCCACGGGTT ACATAGCCGT TTATATGATC
GAGCTCATGC TGATGTTCGT GACCCTTGTC GCAATCGGCC CTCTCGTGCG CCACGACGCA
CGAGAGGGCG GTGCGGCAGG CGCATTCGAG CTCGGCAAGA GTGCTGGTTG A
 
Protein sequence
MTKLNEKLAR NWKRLSPSLL PFADAATVEL PLGQLLRLSL FQVSVGVSIV LLVGTLNRVM 
IVELGVPAWL VALMVSLPLV FAPFRALVGF RSDAHRSALG WRRVPYLWLG TLIQFGGLAI
MPFALIVLSG DSNGPVWVGH VAAALAFLLV GAGLHTTQTV GLALATDLAP AHARPRVVAL
LCMMLLVGML ASALAFGALL ANFSELRLIR VVQGAAVLTM GLNMIALWKQ EPRRALQPLI
MPRPSFAASW NAYLRQSERA KRRLLVIALG TAAFSMEDIL LEPYGGQVLH LPVGATTALT
AMLAIGSICG LWLAARLLGG GADPHRVSAY GLLAGLAAFS AVIFAAPLDS ARLFGVGTVL
IGFGAGLFAH GTLTATMNQA RRDAAGMALG AWGAAQASAA GLAIALGGAI ADGVSTFAAQ
GAFGPTVAGP ATGYIAVYMI ELMLMFVTLV AIGPLVRHDA REGGAAGAFE LGKSAG