Gene Msil_3537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3537 
Symbol 
ID7092394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3885444 
End bp3886910 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content60% 
IMG OID643466828 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_002363788 
Protein GI217979641 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0569176 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTTTA ATCGCTTAAG CCAGATTGCT GAATCCGAGA TTGACGGGGA CGTGGTAGCC 
GCTTCCTCTG CAAAACTTTA TGTATCCTAT AAAAACATTG AGGTGATCGC GGGCTGCGTC
GATATATTCT TGATTACCCT GTCCAGCGTA TTGGGCGTCC TTTTCTATCA GTATATTTGG
TCGGGCGAGA GCGCCCCTAT CGAAATAAGC CTCGGCGTCG GGCTGTCTCA GGCGTTGCTC
TATACCTATG TCGCCAGCTC CCGCGGTCTT TACCGCCTGC CGGTTTTACT TGCGCCCTCG
CGATATTTAG GCCGCATCTT CATGACCTGG GCGGTCGTCG TGCTGTTTGT GGCGATCTTT
CTGGTTTTCC TGCGCGGCGA GACCGTGCTG CCGCGCGGAG CCATGACGGC CGCCGCCGTC
ATGCAGATTT CTCTGCTTCT CGTCGCGCGC TGGCTCGCCG AGAAGACGTC GCGGTCGCTG
ATGGCCAGGG GCGGCCTCGC CGGCCGCCGG GTCGTCACGC TCGGCGAGCC CTCCGAACTG
CTGCGGCTCT CGACGGCTGT CCTGTTCCGT TATTTCGGCC TCACCGAAGT CGCCCGCGTC
TCATTGGCGA GCGGAAGCGG GGCATCCGCG GAGGATGTGC TGGTGGATCT CGACCGCGCC
TTGCACGAGG CTCGTGAATC TCACGCCGAT CAATTCGTGG TGGCTCTGCG CTGGAACAAC
GCCGCGCTGC TCGAAACGGT GCGGGAAAGA TTGCGCGCGT CGCCGCTGCC CGTGGAGCTT
CTTCCCGATT ATACGATTCG TTCGGTTCTG GGACGCCGCC TGCTGTCGAC CAGCGGGCCT
GGCCTGACGC TCGAAATCCA GCGGGCGCCG CTGACCCGCG TCGAGCAGAC GATCAAGCGC
ACGCTCGACA TCGTCTGCTC CTCAATCGGC ATTGTGCTCC TATCGCCGCT GTTCGTCATG
ATCGCCGTTT TGATCAAGCT CGACAGCAAA GGTCCCGTCA TCTTCAAGCA ACGCCGCAAC
GGCTTCAACG CCCGGCAGTT CCAGATTTAC AAATTCCGTT CGATGACGGT GCAGGAAGAC
GGCGACAAGA TTGTGCAGGC GAGACGCAAC GACCGCCGCG TGACGCGCGT GGGTCAGTTT
CTGCGCCAGT CCAGCATGGA TGAGCTGCCG CAGCTGTTCA ATGTGTTGAA GGGCGATATG
TCGCTGGTGG GCCCGCGCCC GCACGCCCTC GCGCATGACA ATGAATATAA GGTGCTGATC
GCGAAATATG CGTTCCGCCA TCACGTCAAG CCGGGGATCA CCGGCTGGGC GCAATGCAAT
GGCCTGCGCG GGGAAACCGG CCAGCTCGAG CAGATGATCG AGCGCGTCAA ACTCGACCTC
TGGTACGTCA ACCATTGGTC GATCGCGCTC GACATCAACA TCCTGCTGCG CACCTGCTTT
GAAGTGCTGC GCAACCGCGC CTATTGA
 
Protein sequence
MYFNRLSQIA ESEIDGDVVA ASSAKLYVSY KNIEVIAGCV DIFLITLSSV LGVLFYQYIW 
SGESAPIEIS LGVGLSQALL YTYVASSRGL YRLPVLLAPS RYLGRIFMTW AVVVLFVAIF
LVFLRGETVL PRGAMTAAAV MQISLLLVAR WLAEKTSRSL MARGGLAGRR VVTLGEPSEL
LRLSTAVLFR YFGLTEVARV SLASGSGASA EDVLVDLDRA LHEARESHAD QFVVALRWNN
AALLETVRER LRASPLPVEL LPDYTIRSVL GRRLLSTSGP GLTLEIQRAP LTRVEQTIKR
TLDIVCSSIG IVLLSPLFVM IAVLIKLDSK GPVIFKQRRN GFNARQFQIY KFRSMTVQED
GDKIVQARRN DRRVTRVGQF LRQSSMDELP QLFNVLKGDM SLVGPRPHAL AHDNEYKVLI
AKYAFRHHVK PGITGWAQCN GLRGETGQLE QMIERVKLDL WYVNHWSIAL DINILLRTCF
EVLRNRAY