Gene Msil_0640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0640 
Symbol 
ID7093721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp694151 
End bp695407 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content68% 
IMG OID643463975 
Productglycosyl transferase group 1 
Protein accessionYP_002360974 
Protein GI217976827 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0857805 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCAG GCTCAAGCTC ATATTTTCGT GACGGGACGG CGCATCCGCT CGCGGGCCGG 
AGCGTGCTCC AGATTGTGCC CGACCTCGAT GACGGCGCGG CGGCGCGCAC AACAATCGAG
ATCGCGGCGG CTCTGACCCT CGTTGGCGCG AACGCCTTTG TCGCCGCGCG GGGCGGCAGT
CTCGTGAGCG AATTGCAGGC GCGCGGCGGA CTGTTCGCGC CCCTTCCCGC AGACGCCAAA
AACCCCCTCA CGATGGCGAT CAATGTGGAA AGGCTGGCGC GCCTTATCAA GGCGGAACGG
ATCGATCTCG TGCATGCGCG CTCACGCGCC TCAGCTTGGT CAGCCTATGC CGCAACCCGC
ATTCTGAAGA CGCCCTTTGT GACAAGCTTT GAAAGCTCCT ATGCCGTGGG CGGACCGCTC
GCGCTGCGCT ACAATTTCGT GATGACGCGC GGCGACGCGA TCATCGCCGG TTCGGCCGAA
GCGGCGCATG GCGCGGCGCA TCTCAATCCG GCGGCGAAAG ACAAAATTCA TGTCATCCTC
GGCGGCGTCG ACTGCCGGGT CTTCTCGCCG AAATCGACGC CTCCGGCGCG GGTCCAGGCG
GTCCGGCGGC TGTGGGGCGC CCCGCCCGAC GCCAGGGTGG CGCTGATCGC GCTCGGCCCC
AAGCCCGCCG GAGACTGCAA GGCGGCGCTG GACGCCATCC GAATGCTGGC CGAGCAAATC
CGCGCCGAAT CGTCTGACGC AGCCTTCGAC GTCTCGAGCC TTCGGGTCAT CATCGGCGCC
GCCAGCGCCA CCGCAACGGA GATCAAGGAG ATCGACGCCA TCGTCGCGGA CTCCGGTTTG
CAGGACATTG TGCAGCGGGG CGACATCGTT TCCGATCCGG CCGCCGCCTT GCTGGCCGCC
TCGGTCATCA TGGCACAGTC GAGCAATCCG GCGGCCTTCG CGAGCCTCGC TCTCGAGGCG
CAGGCAATGG GAGCGCCGAT CATCGCAACC ACAGGGGGGG CGGCCGCCGA AACTCTGCTC
GCCCCGCCGG AGGTCGAGCC CAGCGCGCGG ACCGGCTGGC GCGCGCCGAC CGGCGATCCG
GGCGCGAGCG CCATAGCGCT GAGTGAGGCA TTGAGCCTTG GCGCCACGGC GCGCGAACGG
CTTTCGCTGC GCGGGCGCGC TCACGTCGAG CGGCGGTTCG CGATGGAGCT AATGTGGGAG
CAGACGCTCG ACGCCTATGC GGCGGGGCTC GACGCCGTCC GTAAGCCGAC CAATTGA
 
Protein sequence
MSSGSSSYFR DGTAHPLAGR SVLQIVPDLD DGAAARTTIE IAAALTLVGA NAFVAARGGS 
LVSELQARGG LFAPLPADAK NPLTMAINVE RLARLIKAER IDLVHARSRA SAWSAYAATR
ILKTPFVTSF ESSYAVGGPL ALRYNFVMTR GDAIIAGSAE AAHGAAHLNP AAKDKIHVIL
GGVDCRVFSP KSTPPARVQA VRRLWGAPPD ARVALIALGP KPAGDCKAAL DAIRMLAEQI
RAESSDAAFD VSSLRVIIGA ASATATEIKE IDAIVADSGL QDIVQRGDIV SDPAAALLAA
SVIMAQSSNP AAFASLALEA QAMGAPIIAT TGGAAAETLL APPEVEPSAR TGWRAPTGDP
GASAIALSEA LSLGATARER LSLRGRAHVE RRFAMELMWE QTLDAYAAGL DAVRKPTN