Gene Msil_1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1996 
Symbol 
ID7094194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2166473 
End bp2167618 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content66% 
IMG OID643465322 
Productglycosyl transferase group 1 
Protein accessionYP_002362300 
Protein GI217978153 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGATCG CCATTCTCGC CCATTCCACC AATCCGCGCG GCGGCGTCGT CCATGCGCTG 
GCGCTCGGCG ATGCGCTGAC GCGGCTTGGC CATGGGGCGG TCGTCCACGC GCCGGACGCG
GCGGGCGCGG GCTTCTTCCG CAAGACGCTG TGCGACACGA TTCTGGTTCC GGCGACGCCT
TCCGGCCCCG GAGTGACCGG ATTGGTCGAA CGCCGCGTCG CCGATTATGT CCGCCATTTC
GAGGCGCCGG CGCATCGCCG CTTCGACGTC TACCATGCGC AGGACGGAAT TTCCGGCAAT
GCGCTCGCGA CGCTGAAGCA GCGCGGGCTG ATCCGCGATT TCATCCGGAC TGTTCACCAC
ATTGATGATT TCGCCGACCC GAGATTGCGC GCCCTACAGA AGCGCTCCAT CACACAGGCA
GGGCGCCATC TCGTCGTCAG CCACGCCTGG CGCAACGCGC TTGCCCATGA CTTCGGGGTT
GAGGCGGCGA TCGTCGGCAA TGGCGTCGAC AGGCGCTGCT TTTCGCCAGC TCGAGACGGG
AGCGAATCCG CGCTGCGCGA AAACCTCGGC CTTGGCGCGG GGCCGGTTTT TCTCTCCATC
GGCGGCGTCG AGGCGCGCAA GAATTCGCTA TGCATGCTCA GGGCCTTTGC CCGCCTCCAG
AGGCGGCTGC CTTCGGCGCA GCTCGTCATC GCCGGCGGCG CCTCGCTGCT CGACCATGAC
GCCTATCAGC GGCAATTTTC TGACGCGCTG ACGGAGCTTC GCCTGCCGCC CGGCGCCGTG
ATCCGCACCG GGCCGCTGGC GCAGGCCGTC ATGCCGGCGC TCTACAGGCT GGCGGATGGG
TTGGTGTTTG CCTCGCTCAA AGAGGGCTTC GGTCTGGCAG TGCTGGAAGC CATGGCGTGT
GGCGTTCCGG TCATCGTCTC CGAGATCGCG CCCTTCACCG AATATCTTGG GCCTGACGAC
GCCGCCTGGT GCGATCCGCT CGATGTTGAC TCCATCGCGC GCGCTATGAC GGCGGCATTG
CGCCCTCAGC TTCGCGCTCA ACTCATAGAG AATGGATTCG CTGCGGCCGC GCGGCATGAT
TGGGACGCGA CGGCGCAAGC GCATCTTGCC AGCTATGAAA GCCTGAAGGA AACCGCCGAT
GCCTGA
 
Protein sequence
MRIAILAHST NPRGGVVHAL ALGDALTRLG HGAVVHAPDA AGAGFFRKTL CDTILVPATP 
SGPGVTGLVE RRVADYVRHF EAPAHRRFDV YHAQDGISGN ALATLKQRGL IRDFIRTVHH
IDDFADPRLR ALQKRSITQA GRHLVVSHAW RNALAHDFGV EAAIVGNGVD RRCFSPARDG
SESALRENLG LGAGPVFLSI GGVEARKNSL CMLRAFARLQ RRLPSAQLVI AGGASLLDHD
AYQRQFSDAL TELRLPPGAV IRTGPLAQAV MPALYRLADG LVFASLKEGF GLAVLEAMAC
GVPVIVSEIA PFTEYLGPDD AAWCDPLDVD SIARAMTAAL RPQLRAQLIE NGFAAAARHD
WDATAQAHLA SYESLKETAD A