Gene Mpe_A1216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1216 
Symbol 
ID4787063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1310998 
End bp1312008 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content72% 
IMG OID640089781 
Productthiamine biosynthesis lipoprotein 
Protein accessionYP_001020413 
Protein GI124266409 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATGT CCAGCCTGTC GAGTGTCGGC CGCCGTGCGG CGGGCGCGGT ACGCATGCCG 
GCCGCGGCCT GGGTTCAGGG CGGTTGGATG CGGCGCGAGG AGGCCATCAT GGGCACCTCG
ATCAGCGTCG AGCTGTGGAG CGAAGACCCG TCCGCCGGCA ACGCCGCGAT GGATCTGGTG
ATCGGCGAGA TGCACCGCAT CGACCGCGGC ATGAGCCCGC ACAAGCCGGA CTCCGAGCTG
TCGCGCATCA ACCGAGAGGC GTCGGTTCGG CCGGTACCGC TCAGCGAAGA GATGTTCGCG
CTGCTGGCGC GCTCGCTGGA GTTCTCGCGC CGCTCCGAAG GTGCCTTCGA CATCACCTTC
GCCGGCGCCG GCCGGCTGTA CGACTACCGC GAGCGCATCC GGCCGACCGA TGCCGCGCTG
GCACAGGCCT GTGCGGCCGT CGGCCACCAG TACCTGGAGC TCGACGCCGC CGCGCGCAGC
GTGCGCTTCG CCCGCGACGG CCTGCGCATC GACCTGGGCG GCTTCGCGAA GGGGCATGCG
GTGGACAACG CCGCCGCGAT CCTTGCGCGC CGCGGCATCC GCCATGCCTT CATCAGCGCC
GGCGGCGACA GCCGCGTCAT CGGCGACCGC CGCGGCCGGC CCTGGACCAT CGGTGTGCGC
GATCCGCGGC GGCCTGGCGA GATCATCGCG CTGCTTCCGC TCGAGGACGC GGCGGTCTCC
ACCTCCGGGG ACTACGAGCG CTACTTCGAC ACGCCCGACG GCGCACGCTG CCATCACATC
CTCGATCCGA GGACCGGCAA ATCCCCGGAC AGCGTGCGCA GCGTGACCAT CATCGCGCCG
GACGGGCTGA CCAGCGAAGC GCTCTCGAAG TGCCTGTTCG TGATGGGCGT CGAGCGCGGC
CTGCGCTTCG TCGAATCGCA CGCCGGTGTC GACGCCGTGG TGGTCGACGC GGCGGGGGCG
CTGCACTACT CGTCCGGACT GCTCGCCGCC GGCGCGCAGC CGCGGCAGTG A
 
Protein sequence
MSMSSLSSVG RRAAGAVRMP AAAWVQGGWM RREEAIMGTS ISVELWSEDP SAGNAAMDLV 
IGEMHRIDRG MSPHKPDSEL SRINREASVR PVPLSEEMFA LLARSLEFSR RSEGAFDITF
AGAGRLYDYR ERIRPTDAAL AQACAAVGHQ YLELDAAARS VRFARDGLRI DLGGFAKGHA
VDNAAAILAR RGIRHAFISA GGDSRVIGDR RGRPWTIGVR DPRRPGEIIA LLPLEDAAVS
TSGDYERYFD TPDGARCHHI LDPRTGKSPD SVRSVTIIAP DGLTSEALSK CLFVMGVERG
LRFVESHAGV DAVVVDAAGA LHYSSGLLAA GAQPRQ