Gene Msil_0984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0984 
Symbol 
ID7093663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1065740 
End bp1066951 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content60% 
IMG OID643464323 
Productglycosyl transferase group 1 
Protein accessionYP_002361315 
Protein GI217977168 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0137415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGACA AGCCTGCGGA GTGTTTGACC ATCGGCGCGA ACGCAATCCA TTCCCATTTG 
GCATTTACGA AGATGCGCAC CGTCGCTTGC GCTCTCCAGA CCTATTCCAA GGTTGGAGGA
CTGCAAAGCT TCAATCGCAG GTTGTTTCAA AATTTGGGCC GCCGCGCCCT CGACGCCGGC
GAGCTCCCTG TGCGCGCCTT TGTCGATGAT GACGCCGATG TGGCACTCCC CACGCTTCCC
GGCGTCGAAT TGGTCGCGCC GAAATCAAGG CTTGCATTTT TCGCTGGAGC TTTCTGGAGC
GGCGTATTCG AAGCCGATGC GCTTCTGGTC TGCCATATCA ACTTGTTGCC GCTGGCGATC
GCCGTTCGGC TGTTTCGTCC CCGTCTGCCG ATCGTGTTGT TTGTGCATGG TTTTGAAGCC
TGGAACAGCC AAAATCGGCC GCGCAAGCTG AGCGAACATC TATTCCTGAA AGCCGTGACC
CGAATCGTTT CCGTGAGCCG CTATACCGCC GCCGTAATGA GCCGCGAATT CGGCGTCCCG
CTCGAAAAAT TCCGCATCCT GCCGAATGCG GTCGATCATA TCGGGCTCGA GGTTCCGGCC
CCGGCGCGAC GGCCTTTCTC GATCTTGACC GTGACGCGCC TCAGCGCTGG GGAACGCGCC
AAGAACGTCC ATGAAATGAT CGCCGCCGTC GCCGCCTTGC GGAAGGTCCT GCCGGACGTG
TCCTATGAGA TCATCGGCGA AGGCGCGCTG CGTCCAGAGC TTGAAGCGCT CACGCGCGAG
CTTGGCGTCG ACGATGTGGT TTCCTTCCGC GGGCTTGTCG ACGTCGAAAC CCTGCAGGCG
GCCTATGCTT CGGCCTGCGT CTTCGCCATG CCGTCGGACA AGGAGGGGTT TGGCATCGTC
TATCTTGAGG CCTGGCAATA TGGCTTGCCG GTCATCTGCA GCATCCACGG CGCCGCGAGC
GAAGTCGTCA CGGACGGCGT CGAAGGTTTC GTGGTCGACC CGGCCGATAT TTCCACGCTG
ACGGCGCGGC TTCATGATTT GCTGTCGAAG CCGGATTTCG CGCGGGAGAT GGGCGAGCGT
GGGCGCCAGA AGGTCGAGGC AAAATATCTC AACGCCAATT TCCGCGTCGA TCTTTCCGTT
ATTCTCGACG AACTCGACGA CCCTGAAGGC GAGGGCGCGG TCGCCCGCCG CCATTCGCAG
CTCAAATTCT GA
 
Protein sequence
MFDKPAECLT IGANAIHSHL AFTKMRTVAC ALQTYSKVGG LQSFNRRLFQ NLGRRALDAG 
ELPVRAFVDD DADVALPTLP GVELVAPKSR LAFFAGAFWS GVFEADALLV CHINLLPLAI
AVRLFRPRLP IVLFVHGFEA WNSQNRPRKL SEHLFLKAVT RIVSVSRYTA AVMSREFGVP
LEKFRILPNA VDHIGLEVPA PARRPFSILT VTRLSAGERA KNVHEMIAAV AALRKVLPDV
SYEIIGEGAL RPELEALTRE LGVDDVVSFR GLVDVETLQA AYASACVFAM PSDKEGFGIV
YLEAWQYGLP VICSIHGAAS EVVTDGVEGF VVDPADISTL TARLHDLLSK PDFAREMGER
GRQKVEAKYL NANFRVDLSV ILDELDDPEG EGAVARRHSQ LKF