Gene Mpe_A1453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1453 
Symbol 
ID4783736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1570322 
End bp1571392 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content73% 
IMG OID640090020 
Producthypothetical protein 
Protein accessionYP_001020650 
Protein GI124266646 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.480657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGA TGATCGTCAC CGACGCCTGG GAACCGCAGG TCAACGGCGT GGTGCGCACG 
CTGAAGATGA CGCGCCGCGA ACTGCAGAAG CTGGGCCACG AGGTGGAGCT GCTGTCGCCG
CAGGGCTTCC GCAGCGTGCC CTGCCCCAGC TACCCGGAGA TCGAGCTGGC GCTCGCCACG
CGCAGCGGCG TGGCGCGGCG CATCGACGCC TTCGCGCCCG ACTGCCTGCA CATCGCCACC
GAGGGCCCGC TGGGCTGGCT GGCCCGCTCG GTGGCCCGCT CGCGCGGCTG GCCGTTCACC
ACCGCGTACC ACAGCCGCTT TCCCGAATAC GTGCACGCGC GCACGCGCCT GCCCACGGCC
TGGAGCTATG CGCTGCTGCG GCGCTTTCAC AATGCCGGGC TGGGCACGCT GACGCCCACG
CCGGCCATCG TGGACGACCT GCGCGCCCGC GGTTTCCGGC ATGCGCGCTG GTGGTCGCGC
GGTGTCGACC TGGGCCTGTT CAGCGCCGAG GGCGCGCGAC TGCCGCGCGC CGAGCACCCG
GTGTTCCTCT ACGTCGGCCG GGTCGCGGTG GAGAAGCAGG TCGACGCCTT CCTGAAGCTC
GACCTGCCCG GCGAGAAATG GATCGCCGGC GAAGGCCCGT CGCGGGCCCG CCTGGAGGCG
CGCTATCCGG GCGTGCGCTG GTTCGGCGTG CTCGACGGCC CGGCGCTGGC CACGCTCTAC
CGCAGCGCCG ACGTGATGGT CTTCCCGAGC GTCACCGACA CCTTCGGGCT GGTGATGGCC
GAGGCCATGG CCTGCGGCAC GCCGGTGGCC GCCTTCCCGG TGCCGGGCCC GATCGACGTG
GTGGGCCGCT CCGGCGGCGG CGTGCTGCAC ACGGACCTGC GCGAGGCCTG CCTGCGCGCG
CTGCAGCTGC CGCGCGACGC GGTGCGCCGC CACGGCGAGC AGTATTCGTG GGCACGCGCG
ACGCAGCAGT TCCTGGCCGC ACTGCGGCCG ATCGACGGCC ACCGAGCGTC GGAGGAGACG
CTCAGCGCGG CGGAGCGTTC GCCGAACGTC GGCGCCCGAG ATCCTGCATG A
 
Protein sequence
MKLMIVTDAW EPQVNGVVRT LKMTRRELQK LGHEVELLSP QGFRSVPCPS YPEIELALAT 
RSGVARRIDA FAPDCLHIAT EGPLGWLARS VARSRGWPFT TAYHSRFPEY VHARTRLPTA
WSYALLRRFH NAGLGTLTPT PAIVDDLRAR GFRHARWWSR GVDLGLFSAE GARLPRAEHP
VFLYVGRVAV EKQVDAFLKL DLPGEKWIAG EGPSRARLEA RYPGVRWFGV LDGPALATLY
RSADVMVFPS VTDTFGLVMA EAMACGTPVA AFPVPGPIDV VGRSGGGVLH TDLREACLRA
LQLPRDAVRR HGEQYSWARA TQQFLAALRP IDGHRASEET LSAAERSPNV GARDPA