Gene Mpe_A0726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0726 
Symbol 
ID4784972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp753014 
End bp754063 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content69% 
IMG OID640089287 
Productputative glycosyltransferase protein 
Protein accessionYP_001019923 
Protein GI124265919 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCCT CCTTGCCGAA GATCAGCGTG GTGATCCCAT GCTTCAACTA CGCCCGCTAC 
GTGGGGCAGG CCATCGAGAG CGCGCTGGCG CAGGCGCATC CCGATACGGA GGTGGTGGTC
GTCAACGACG GCTCCACCGA CGGCTCGCTG GCGGTGATCG AGCGCTACGC GCAGCGGGTG
GTGGTGATCG ACCAGGTGAA CCAGGGCTCC ATCGCCGCCT ACAACCGCGG CTTCTCGGAA
TCGAGCGGCG ACGTGGTGAT CTTCCTCGAC GCCGACGACC TGCTGGAGCC CGGCGCGCTG
GCCGCCGTGG CGGCGGCCTG GACGCCGGCC TGCGCCAAGC TGCAGTACGA CCTGAAGATC
ATCGACGCCG AAGGCCGCGA CACCGGCCGC CGCTTCTGCA ACTTCGCCAA CGGCTACGGC
ACGGCCGAGG CCCGCAGCGC CTTCCTGCGC ACCGGCACCT ACCGCTGGCC CGTGACGACC
GGAAACGCCT ACTCGCGCTG GTTTCTCGAA CCGATGTTTC CGCTGCGCAT CGAGCACGGC
CCCGATGGCC ACCTGAACAC CGTGGCACCG GTGTACGGCG ACGTGAAGGT GCTGCCGCAG
GTGCTGGGCG CCTACCGGGT GCACGGCGCC AACATGTGGT CCAGCGACGG CTCCGACCAT
TCGCGCCTGC CCTTCCGCAT CCACACCCGC CAGCGCGAAG TGGCCTTCAT GCAACTGCAC
GCGCAGCAGC GCGGTGTGTT CCTGCCGGCC GGCAACGTGC TGGATCGGGA ACTGCCTTTC
CTCAACTACC GGCTCATGGC GCTGAAGCTC GGCCTGGCCT ACACCGGCCA GGAGCACGAC
TCGCCCTGGT CGCTGGTGCG GCGGGCTTGG TCGCTCATCG TGTCGGAGCC CATGTCGCTC
AAGCACCGCG TGGGCCACCT CGGATGGTTC GGCGTGCTGG CGCTCGCACC GCGGCAGGCG
GTGCCGGCGC TCTTGCACGT GCGCTTCAAC CGCAGCGAAC TGCTTCAGTC GCTGCGGCGC
TCCGTGGGGC TGGCGCCCGT GCGCACCTGA
 
Protein sequence
MTPSLPKISV VIPCFNYARY VGQAIESALA QAHPDTEVVV VNDGSTDGSL AVIERYAQRV 
VVIDQVNQGS IAAYNRGFSE SSGDVVIFLD ADDLLEPGAL AAVAAAWTPA CAKLQYDLKI
IDAEGRDTGR RFCNFANGYG TAEARSAFLR TGTYRWPVTT GNAYSRWFLE PMFPLRIEHG
PDGHLNTVAP VYGDVKVLPQ VLGAYRVHGA NMWSSDGSDH SRLPFRIHTR QREVAFMQLH
AQQRGVFLPA GNVLDRELPF LNYRLMALKL GLAYTGQEHD SPWSLVRRAW SLIVSEPMSL
KHRVGHLGWF GVLALAPRQA VPALLHVRFN RSELLQSLRR SVGLAPVRT