Gene Msil_0689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0689 
Symbol 
ID7091920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp751409 
End bp752602 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content61% 
IMG OID643464024 
Productglycosyl transferase group 1 
Protein accessionYP_002361022 
Protein GI217976875 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.997173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATTT TGCACATCAT TCCGACGTGT AACCCCGAAT ATGGCGGCCC TATCGAAGGC 
ATCTTCACTT CGGCGCCGGC CCTGCGCGCG CAGGGTTGCG ACCGCGAGAT CGTGTCGCTC
GACATGCCGA CCGATCCATG GGTGAAAACG TCGCCGGTGC GCGTGTACCC CATGGGGAAT
CCTAGCCCAG CCTATCACGC CTGGAAGAAG CGCATTCCGT TCCTCCGCTA CGGCTATAGT
CCGGCGATTG TTCCTTGGAT TCGGGAAAAC GCCAAGCGCT ATGACGCGGT CATCGTCAAC
GGTCTTTGGA ATTTCGCCTC GCTCGCCGCG CGGCAGGCGC TGGTCGGCAC CGATACGCGA
TATTTCGTTT ACGCGCATGG GATGCTCGAC CCCTATTTCA ACAAGATTTC CCCCGTCAAA
GCCTTCTTCA AGCAGTTGCT CTGGTGGGCC AGCGAGGGTC GGCTGATCAA CAATGCGACG
TCCGTCATGT TCGTGACGAA AGAGGAGCGC GAACTGGCCA AGACCTCCTT TTGGCCCTAT
CGGGCGCGGG CGCGCGTGGT GCCTTATGGA ATCGTCGACG TCAGCGGCGA CGCAGAGGCC
CAGATCAAGA GCTTTCGCGC CGCCCTTCCG CAGCTCGGCG AGCGCCGTTT TCTGCTGTTC
CTCAGCCGGA TTCACCCCAA GAAGGGATGC GACATCCTGG TCGAAGCCTT CGCCAAGATG
GCGGGCGGGG ATCCCGACCT CGATCTGGTG ATCGCCGGTC CGGACTCGGT CGGGGCCGTT
AAAGAGCTCC AGGAGGTCGC GGCGCAGCGC GGCGTGGCTG ATCGCATCCA CTGGCCCGGC
ATGCTGAAGG GCGATCTGAA ATGGGGCGCC TTTCGCGCCT GCGATGGATT CATCCTGCCT
TCGCACCAGG AAAACTTCGG CATTGTCATC GCCGAGGCGC TCGCCTGCGG CAAGCCGGTG
CTGACCACAG ACAAGGTCGC CACTTGGCGC GAGGTGGCTG ACAATAATGC CGGATTCGTC
GAAAATGACG ACCTTCCTGG CGTCACCCGG CTGATCGAGC ATTTTTTGAG CCTTTCGCCC
CTCGAAAAAC AGGAAATGAG CAAACGGGCG CGGGCGACCT ATCTTACGAA GTTCGACATG
GGCAGCATGG CTCCGGAACT GATCGAGGCT TTCAGGACTT CGCAAGCCGC ATGA
 
Protein sequence
MIILHIIPTC NPEYGGPIEG IFTSAPALRA QGCDREIVSL DMPTDPWVKT SPVRVYPMGN 
PSPAYHAWKK RIPFLRYGYS PAIVPWIREN AKRYDAVIVN GLWNFASLAA RQALVGTDTR
YFVYAHGMLD PYFNKISPVK AFFKQLLWWA SEGRLINNAT SVMFVTKEER ELAKTSFWPY
RARARVVPYG IVDVSGDAEA QIKSFRAALP QLGERRFLLF LSRIHPKKGC DILVEAFAKM
AGGDPDLDLV IAGPDSVGAV KELQEVAAQR GVADRIHWPG MLKGDLKWGA FRACDGFILP
SHQENFGIVI AEALACGKPV LTTDKVATWR EVADNNAGFV ENDDLPGVTR LIEHFLSLSP
LEKQEMSKRA RATYLTKFDM GSMAPELIEA FRTSQAA