Gene Mpe_A0829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0829 
Symbol 
ID4786964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp869060 
End bp870364 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content71% 
IMG OID640089390 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_001020026 
Protein GI124266022 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.112593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGCG GTGACGAGCA CCGCACCATG GACAAACTCA TCATTCGCGG CGGCCGCCGT 
CTCCACGGCG AGGTCGCGAT CTCGGGCGCC AAGAACGCGG CGCTGCCGGA GCTGTGCGCT
GCGCTGCTGA CGGCCGAGCC GGTGCGCTTC TCGAACGTGC CGCGCCTGCA GGACGTCTCG
ACCACGCTGA AGCTGCTGCG CAATATGGGT GCCGTCGCCG AGCGCAGCGA GAGCCGCCCC
GACGAAGTGA CCATCGACGC CGGCCCGGTC AGCAGCCCCG AGGCCCCCTA CGACCTGGTG
AAGACCATGC GGGCGTCGAT CCTGGTGCTG GGACCGCTGC TGGCGCGCTT CGGCGAGGCC
ACGGTGTCGC TGCCGGGCGG CTGCGCGATC GGCTCGCGGC CGGTGGATCA GCACATCAAG
GGCCTGCAGG CGATGGGCGC GCAGATCAGC GTCGAGCACG GCTACATCAT CGCGAAGGCG
CCCAAGGCCT CGGGCGGGCT GCGCGGTGCG CGCATCACGA CGGACATGGT GACCGTCACC
GGCACCGAGA ACCTGCTGAT GGCCGCGACA CTGGCCGACG GCGAGACCGT GCTCGAGAAC
GCCGCGCAGG AGCCCGAGAT CCCCGACCTG GCCGAGATGC TGATCTCGAT GGGCGCGAAG
ATCGAGGGCC ACGGCAGCAG CAAGATCCGC ATCCAGGGCG TCGCGCAACT GCACGCGCCG
CGCGGCGGGC ACCGCGTGGT GCCCGACCGC ATCGAGGCCG GCACCTTCCT GTGCGCGGTG
GCCGCGGCCG GCGGCGAGGT GCTGCTGAAG CACGCGCGCG CCGACCACCT GGACGCGGTG
ATCGACAAGC TGCGCGAGGC CGGCGTCCGC ATCGAGGCCG GCAGCGACTG GATCCGCGTG
GCCTCCGACG GCAAGCTGAA GGCGGTGGGC TTCCGCACCA GCGAGTACCC GGCCTTCCCG
ACCGACATGC AGGCGCAGTT CATGGCGCTC GACTGCATTG CCGAGGGCAC GGCGCGCGTC
ACCGAGACCA TCTTCGAGAA CCGCTTCATG CACGTCGATG AACTGGTGCG CCTGGGCGCG
AAGATCGAGG TCGACGGCCA CACGGCCATC GTCACCGGCG TGCCGCAGCT GTCGGGCGCC
ACGGTGATGG CCACCGACCT GCGCGCCTCG GCCAGCCTGG TGATCGCCGG CCTGGTGGCC
AGCGGCGAGA CCCTGGTCGA CCGCATCTAC CACCTGGACC GCGGCTACGA CCAGATGGAA
ACGAAGCTGC GCGCCCTGGG CGCGGACATC GAAAGAGTGA AATGA
 
Protein sequence
MQGGDEHRTM DKLIIRGGRR LHGEVAISGA KNAALPELCA ALLTAEPVRF SNVPRLQDVS 
TTLKLLRNMG AVAERSESRP DEVTIDAGPV SSPEAPYDLV KTMRASILVL GPLLARFGEA
TVSLPGGCAI GSRPVDQHIK GLQAMGAQIS VEHGYIIAKA PKASGGLRGA RITTDMVTVT
GTENLLMAAT LADGETVLEN AAQEPEIPDL AEMLISMGAK IEGHGSSKIR IQGVAQLHAP
RGGHRVVPDR IEAGTFLCAV AAAGGEVLLK HARADHLDAV IDKLREAGVR IEAGSDWIRV
ASDGKLKAVG FRTSEYPAFP TDMQAQFMAL DCIAEGTARV TETIFENRFM HVDELVRLGA
KIEVDGHTAI VTGVPQLSGA TVMATDLRAS ASLVIAGLVA SGETLVDRIY HLDRGYDQME
TKLRALGADI ERVK