Gene Mpe_A1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1041 
Symbol 
ID4785644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1110956 
End bp1112311 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content72% 
IMG OID640089603 
Producttype I secretion outer membrane efflux protein 
Protein accessionYP_001020237 
Protein GI124266233 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01844] type I secretion outer membrane protein, TolC family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCAGC GTTCCCCGCA CTGCCCGCCG TCGCCGCTGC CGCGGCGGCA CCGCCTGGCC 
CGGGCGCTGA CGGCCGGTCT GCTGCTCGGT GCCGCCGCGG TGGCGCAGGC TCAGAGCCTG
CAGTCGCTGT ACGACGCCGC ACGCGGCTAC GACGCCAGCT ACCTGGCGGC CCGCGCGCTG
GCCGATTCGG CACAGTTCCG CTACGAGCAG GTGAAGGCGC TCAACCGCCC GAGCGTGGCC
CTGTCGGCGA GCACCACCCG CAGCGAGACC GACACATCGG CCGGCAGCGC GAGCGGCACC
AGCACCGGCG CGCAGATCTC GGCGCTGCAA CCGCTGTTCA ACCGCAGCAA CAGCAGCACG
ATCGATCAGG CCGAGAAGAG CTATGCGGTG TCGATGGCCG ACCTCGAGAG CGCCGAGCAG
GACCTGGTGG TGCGGCTGAG CCAGGCCTAC TTCGACGTGC TGGCGGCACA GGACACGCTG
GCGACGACAC GCGCCAACAA GACCGCCATC GCCGAGCAGC TGGCCTCGGC GAAGCGCAAT
TTCGAGGTCG GCACCGCAAC CATCACCGAC ACGCGCGAAG CGCAGGCCCG CTACGACCTC
GCCACCGCGC AGGAGATCGC CGCCGAGAAC GACCTGCGCG TACGGCGCAT CGCACTCGAC
CAGCTCGTCG GGCGCACCGA CGTCGAACCG CGGCCGCTGA CGGTCCCGGT GCAGCTGCCC
GAGGTGCTGC CGGCCAACGT GGAGGACTGG GTGACGCAGG CGGGCCAGTC GCCCAGCGTG
CGCAAGGCCC AGCTGGCCTA CGAGGTGGCG CAGCTCGAGA CCGAGAAGGC GCGTGCCGGC
CACCTGCCGA CAGTCGACCT GGTGGGCGGC GTCGGCCGCA ACCGCAACAC CGGCCGCACC
GCCGGCAGCG GTCTGAGCGG CAGCACCACC AGCGCGCAGA TCGGCGTGGA GCTGAACCTG
CCCCTGTTCG CCGGCTACTC GATCCAGAAC CGCGTGAAGG AAACGCTGTC GCTGGAGGAG
AAGTCGCGCA ACGACCTGGA GTTCGCGCGC CGCAGCGTCA CCCAGGGCAC GCGCCAGGCC
TACTTCACCG TCCAGTCGGG CCTGGCCACG GTGAAGGCCC TCGAAGCCGC CGAGGCCTCC
AACAAGCTCG CGCTCGAGGC CACCCAGCTC GGCTACAAGG TCGGCGTGCG CGTCAACCTC
GACGTGCTGA ACGCGCAGAC GCAGCTCTAC ACCACCCAGC GCGACCTGGC GCGCGCCCGC
TACGACGTGG TGCTGGGCAA CCTGCTGCTG CGCCAGGCGG CGGGCACGCT CAAGCCCGAC
GACGTGGGCA GCGTCAACCG GCTGCTCGCG CCCTGA
 
Protein sequence
MPQRSPHCPP SPLPRRHRLA RALTAGLLLG AAAVAQAQSL QSLYDAARGY DASYLAARAL 
ADSAQFRYEQ VKALNRPSVA LSASTTRSET DTSAGSASGT STGAQISALQ PLFNRSNSST
IDQAEKSYAV SMADLESAEQ DLVVRLSQAY FDVLAAQDTL ATTRANKTAI AEQLASAKRN
FEVGTATITD TREAQARYDL ATAQEIAAEN DLRVRRIALD QLVGRTDVEP RPLTVPVQLP
EVLPANVEDW VTQAGQSPSV RKAQLAYEVA QLETEKARAG HLPTVDLVGG VGRNRNTGRT
AGSGLSGSTT SAQIGVELNL PLFAGYSIQN RVKETLSLEE KSRNDLEFAR RSVTQGTRQA
YFTVQSGLAT VKALEAAEAS NKLALEATQL GYKVGVRVNL DVLNAQTQLY TTQRDLARAR
YDVVLGNLLL RQAAGTLKPD DVGSVNRLLA P