Gene Mpe_A3474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3474 
Symbol 
ID4786292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3685083 
End bp3686351 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content75% 
IMG OID640092054 
Productglycosyl transferases group 1 protein 
Protein accessionYP_001022662 
Protein GI124268658 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.649202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAGCC GTCGCCTGCG CATCGGCCTG CTGACCCATT CGGTCCATCC GCGCGGTGGC 
GTCATCCACA CGCTGGAACT GGCCGATGCA TTGCACGAGG CCGGCCACGA GGTGACGGTG
ATGGCGCCGG CGCTGCCCGG GCAGGCGCTG TTCCGCACGC CGCGTTGCGC GGTCGAGCTG
GTGCCGGTGG CGCCGGCGCC CGCCGATCTG GCGAGCATGG TCGCGTCGCG CCGCGACGCC
TGCATCGACC ATCTCGCGCC GCGGCTGGAG CGCGGGGTTG GCTGGGACGT GCTGCACGCC
CAGGATGGCA TCGGCGGCAA CGCGCTTGCG ACGCTGCAGG AGCGCGGCCT GATCGACGGC
TTCGTGCGCA CCGTGCACCA CCTCGACCGC TTCGACGACG CGCGCGTGAT GGCCTGGCAG
GAACGCGCCT TCCTGCGCGC GCGCCAGGTG CTGTGCGTCA GCCAGACCTG GTGCGACACG
CTGCGGCGCG AGCACGGCGT GGCCGCGGCG CTGGTTCACA ACGGCGTCGA CCTGCAGCGC
TACGGCCGCC AAGCCGGCGC GGCCGATGCG CGCGTGCGGC GCCGCTTCGG CCTGCGCGTC
GGCGCGGCCC ACGACGCGCC GGTCTACCTG GCGGTGGGGG GCATCGAGGA GCGCAAGAAC
ACGGTGCGTG TGCTGCAGGC CTTTGCGGCC CTGCGGGCGC GGCAGCCGCA GGCGCAGCTG
GTGATCGCCG GCGGTGCCAG CCTGCTCGAC CACGACCGCT ATGCGCGCGA GTTCACCGAG
GCGCTGGCCG CCAGCGGCCT GCGCGTCGGG CCGGGGGCCG ACGTGGTGAT CACCGGCACC
GTCGCCGACG ACGAGATGCC GGCGCTGTTC CGTGCCGCCG ACGTGCTGGT GATGGCCTCG
CTGCGCGAGG GCTTCGGCCT GGTGGTGCTG GAGGCACTGG CCTGCGGCAC GCCGGTCGTG
GTGTCGCGCC AGGCGCCGTT CACCGAGTAC CTGCCGGCCG ACGAACGGCA CGGCGAGGCC
TGCTGGGCCG ACCCGCTGAA CCCGCTGTCG ATCGCCGACG CGATGGCGCG GGCCTGCGAA
CCGGAGCGCG CGCAGGCGCT GGCCCGGGCC GTGCCCGAGG TCTGCCGGCG CTACAGCTGG
ACGGCCAGCG CCGCGCGCCA CGTCGCGCTG TACCGCGCGA TGCGGGCGCT GGTCGGTCAT
GGCGTGCCGC TCGCCGCGGC GGTGCCCACC GAACCGGCCG CGATGGACGC CGCCCCTGTC
GTTTCCTGA
 
Protein sequence
MSSRRLRIGL LTHSVHPRGG VIHTLELADA LHEAGHEVTV MAPALPGQAL FRTPRCAVEL 
VPVAPAPADL ASMVASRRDA CIDHLAPRLE RGVGWDVLHA QDGIGGNALA TLQERGLIDG
FVRTVHHLDR FDDARVMAWQ ERAFLRARQV LCVSQTWCDT LRREHGVAAA LVHNGVDLQR
YGRQAGAADA RVRRRFGLRV GAAHDAPVYL AVGGIEERKN TVRVLQAFAA LRARQPQAQL
VIAGGASLLD HDRYAREFTE ALAASGLRVG PGADVVITGT VADDEMPALF RAADVLVMAS
LREGFGLVVL EALACGTPVV VSRQAPFTEY LPADERHGEA CWADPLNPLS IADAMARACE
PERAQALARA VPEVCRRYSW TASAARHVAL YRAMRALVGH GVPLAAAVPT EPAAMDAAPV
VS