Gene Mpe_A1065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1065 
Symbol 
ID4785520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1137301 
End bp1138695 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content60% 
IMG OID640089627 
Productpolysaccharide biosythesis protein, putative 
Protein accessionYP_001020261 
Protein GI124266257 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.624133 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGAAG ACCGCAGCTA CAAGAGAACG TTCTACAGCG CGCCGCAGTC GGTGACCTCG 
CTGGTTGCCG CGTTTCTCGA GCCGACGGTA TCGGTGGTGA CCTATGTCGC GGTCAGCCAT
TGGTTCGATG ATCCTATCCT TCGTGCGTCG CTTACGCTGT GCCTGCTCGT CTTCGTGCTC
ACGTTCCCGG GGCGCAATAG GTTCCGAGAC AACATGGTCG CGGCCGGCAT CGACATCGTG
TCGTCCTGGC TCGTCTTGCT GGCCATCTTG GCATTGTGCG GGTATGCAAC ACGTAGCTTG
CAGTTCTTCA GCCGCGACGT CCTGATCAGC TGGGCCCTGC TCGTCCCCGT CTTGCAGTGG
GTCGCCGTCT GGATCGGCAA GGTCGTCATC CGGCGCCGCT CCGCACTGCC CGAAGCTCGA
CGTACTGCAG TGGTGGTCGG CGCAAGCCCG CTCGGCGTCA AGGTCGCTCG AGCCCTCACC
ACGGGCGGCG ACGCCGGCAT CGATTTCGTC GGCTACTTCG ACGACCGAAC AGACGAGCGC
GTGCATGTCG ACGGAATGGC CAAACGTCTG GGCGGACTGG CCGACGTGGC GTCGTATGTT
TCGACCCACG GCATCCGCGA GGTCTACATC ACGTTGCCGC TGGGCTCACA GCCTCGCATC
GTCGAACTTC TCGAATCGGT GCAGGGGACG ACGGCCTCGC TCTACTTCGT GCCGGACGTC
TTCGGCATCA GCATCATTCA AGGGCGATTC GAGGACGTCA ACGGGGTCCC AGTCGTCGGC
ATTTGCGAGA CGCCCTTCAC TGGCACTAAC GATCTGGTCA AGCGAGTGAG TGACATGGTG
ATCGCGTCGA TCATTCTGGT GTTGATCTCG CCAGTCCTGA TCGCCGTCGC CATCGGCGTC
AAGCTCAGTT CGCCGGGCCC GATCATCTTT CGGCAGAAGC GCAACGGTCT CGACGGTGAC
GAGATCACCG TGTACAAGTT TCGCTCAATG ACCACTCAAG ACAACGGTGC GGTCATCAAG
CAGGCCACCA AGGGCGACAG CCGGATCACG AAGTTCGGTG CCTTCATCCG ACGCACTTCG
TTGGACGAGC TACCGCAGTT CTTCAATGTC CTCCAGGGCC GCATGAGCAT CGTTGGTCCG
CGCCCGCACG CGGTCGCCCA CAATCAGATG TATCGCGAGT TGATAAAGGC CTATATGGTT
CGGCACAAGG TCAAACCCGG CATCACGGGT TGGGCCCAAG TCAACGGCTT CCGTGGCGAA
ACGGACACGG TCGAAAAAAT GCAAGCTCGT GTCGAATACG ATTTGGAGTA TCTACGCAAT
TGGTCGCTCG CTCTTGATCT GCAGATCATC ATCCGGACGG TCCGCATGAT GTTCTTCGAC
AGGAACGCGT ACTGA
 
Protein sequence
MFEDRSYKRT FYSAPQSVTS LVAAFLEPTV SVVTYVAVSH WFDDPILRAS LTLCLLVFVL 
TFPGRNRFRD NMVAAGIDIV SSWLVLLAIL ALCGYATRSL QFFSRDVLIS WALLVPVLQW
VAVWIGKVVI RRRSALPEAR RTAVVVGASP LGVKVARALT TGGDAGIDFV GYFDDRTDER
VHVDGMAKRL GGLADVASYV STHGIREVYI TLPLGSQPRI VELLESVQGT TASLYFVPDV
FGISIIQGRF EDVNGVPVVG ICETPFTGTN DLVKRVSDMV IASIILVLIS PVLIAVAIGV
KLSSPGPIIF RQKRNGLDGD EITVYKFRSM TTQDNGAVIK QATKGDSRIT KFGAFIRRTS
LDELPQFFNV LQGRMSIVGP RPHAVAHNQM YRELIKAYMV RHKVKPGITG WAQVNGFRGE
TDTVEKMQAR VEYDLEYLRN WSLALDLQII IRTVRMMFFD RNAY