Gene Msil_1158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1158 
SymbolmdoG 
ID7093921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1245160 
End bp1246743 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content65% 
IMG OID643464499 
Productglucan biosynthesis protein G 
Protein accessionYP_002361489 
Protein GI217977342 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.025306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTGT TGAACCGCCG CACTCTCGTC ACGGGCCTGC TCGCTTCGAG CGCGTTGACG 
ACGCATCTTG CCTCTGCAGC CTCCCAGGCC GGCCAGCCCG CGCCGGCCGC CTCGCCCGCG
CCGCAGCCGA AATTCGATTT CGACGACGTG CTGCGCCGCG CGAAGGACCT CGCCTCGGCC
CCCTTCGACG CGGCGATCGC GCCTCTGCCG GAGGCGCTGA ACAAGCTCGA CTTCGACGCC
TGGCGCGACA TCCGCTTCCG GCCGGACAAG GCTTTCCTGA ACAGCCCCGG CAGCCAGTTC
CGGCTGCAGC TGTTTCATCT CGGACATCTC TACAAGCGCC CGGTTACGAT CAACACCATC
CGCGACGGCA TCCCGACGCC GATCCCCTTT ACCACCAGCC TGTTCGACTA TGGACGGACG
AAGCCGGAGA AGCCGATTCC GGTCAATCTC GGCTTCGCCG GGTTCCGGCT GCACTATCCG
CTGAATTCGC CGCGCGTTTA CGACGAGGTC ATCGCCTTTC TCGGCGCGAG CTATTTCCGC
TTTCTCGGCC GCGACCAGCA TTACGGCATA TCGGCGCGCG CGCTCGCCAT CGGCGCCGGC
GGCGAGGAGG AGGAATTTCC GTTCTTCCGC GAATTCTGGA TCGATTCGCC CGAGGTCAAC
GCCGACCGCA TCACGATCTT CGGCCTGCTC GACAGCCCCT CGACGACGGG AGCCTACCGG
TTCGACCTGT TTCCCGGCGT CGAGACGGCG ATGGAGGTGT CGACCGTCCT ATATCCGCGC
AAGGCCGGCG TCCGCTTTGG CCTCGCGCCG CTGACCTCGA TGTTTTTTCT CGGCGAGAAC
GACCGCCGCT TCAACGAGGA TTTCCGCCCC GAACTGCATG ATTCGGATGG GCTTCTCATC
CATTCGGCGA CTGGCGAATG GATCTGGCGG CCTTTGCGCA ACCCGACCAA GCCCGTCATC
TCCTCTTTCT TCGATCGCGA CGTTCGCGGG TTCGGCCTGC TGCAGCGCGA TCGCGAGTTC
GACCATTATC AGGACCTCGA TCTCGCCTAT GAGCGGCGCC CGAGCTATTT CGTCGAACCG
CGCGAAAGCT GGGGCGAAGG CCATGTCGAT CTCGTCGAGC TGCCGACCGA GCATGAGGCC
AACGACAATA TCGTCGCTTT CTTCACGCCG AAGGATTCGC CCGAGGCCAA TAAGCCTTTC
AGCTACGCCT ATCGCCTCGT CTCCAGCCTC AATCTGACGC GGCTGTCGCC GAACGGACGC
GCGCTCAATA CCTATCAGAC GACGGCCGCC GCGCTTGGCT CCGCCGAGGC TCCGGCCCCC
GGCACGCGCC GCTTCATCAT CGATTTCACC GGCGGCGATC TTCCCTTCTA CGCAACGGAT
CCCGGCTCGG TCGAGGTCGT GCCCTCGACC AGCCAGGGCA AGATCGTGCG CTCGTTTCTG
GTGCCGAACC CGCATGTCAG GGGATTTCGC GCCGCATTCG ACGTCCAGCT CGACGGCGGT
CAATCGGCGG ATCTCCGCGC ATTCTTGCGG CGCGGATCGC AGGCGCTCAC TGAGACCTGG
ACCTATCCCT GGCGGCCGGA CTGA
 
Protein sequence
MTLLNRRTLV TGLLASSALT THLASAASQA GQPAPAASPA PQPKFDFDDV LRRAKDLASA 
PFDAAIAPLP EALNKLDFDA WRDIRFRPDK AFLNSPGSQF RLQLFHLGHL YKRPVTINTI
RDGIPTPIPF TTSLFDYGRT KPEKPIPVNL GFAGFRLHYP LNSPRVYDEV IAFLGASYFR
FLGRDQHYGI SARALAIGAG GEEEEFPFFR EFWIDSPEVN ADRITIFGLL DSPSTTGAYR
FDLFPGVETA MEVSTVLYPR KAGVRFGLAP LTSMFFLGEN DRRFNEDFRP ELHDSDGLLI
HSATGEWIWR PLRNPTKPVI SSFFDRDVRG FGLLQRDREF DHYQDLDLAY ERRPSYFVEP
RESWGEGHVD LVELPTEHEA NDNIVAFFTP KDSPEANKPF SYAYRLVSSL NLTRLSPNGR
ALNTYQTTAA ALGSAEAPAP GTRRFIIDFT GGDLPFYATD PGSVEVVPST SQGKIVRSFL
VPNPHVRGFR AAFDVQLDGG QSADLRAFLR RGSQALTETW TYPWRPD