Gene Mpe_A0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0140 
Symbol 
ID4784840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp145726 
End bp147861 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content72% 
IMG OID640088687 
Producthypothetical protein 
Protein accessionYP_001019337 
Protein GI124265333 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID[TIGR02595] PEP-CTERM putative exosortase interaction domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0386904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTTCC CCTCCCTGCG TCCGCGCGCC GTCATCCCGC TGCGCCCGCT GTCGCGCCCG 
GGCCGGCTGT GGGCCGTGGT CCTGGCGCTC GCCTGCCTGG CGAGCCTGCT GCTCGTCGCG
CTGCCGGCGC AGGCCGATGA CGCCGACGCG CGGCGCGCCG AGAGCCCGTA CTTCTTCGTC
AAGAGCGACG ACCCGAGCGT CGACCGCCTG CCGCTGAAGG CCACCGAGGT CGACGCGCGC
ATCGCCGGGC CGATCGCCGA CGTCACGGTG ACGCAGCGCT ACCGCAACGA GGGCCAGCGT
CCCATCGAGG CGCGCTATGT CTTCCCCGGC TCCACGCAGG CCGCGGTGCA CGCGATGACG
GTGCGCGTCG GCCACCGCGT GATCGTCGCC GACATCCGCG AGAAGCAGCG CGCCCGCATC
GAGTTCGAGG CCGCCAAGCG CGAAGGCAAG ACCGCCGCGC TGCTGGAGCA GGAGCGGCCC
AACGTGTTCT CGATGAACGT GGCCAACATC CTGCCGGGCG ACGAGGTGGC GGTGGAGCTG
CGCTACACCG AGCTGCTGCC GCCCACCGAG GGCCGCTACC AGTTCGTGTT CCCCACCGTG
GTGGGCCCGC GCTATCGCTC CCCCGCCAAC AAGGTCGCAA CCACTGAAGA GCAAGCAAAC
GGCACGGCAG CGCCGTCCGG CAGCTTCCCC GCCGTGCCCT ACCTGCCCGC GGGCGAGGCG
TCGGACACCC GCTTCGACCT GCGCGTGGCC TTCGCCTCGC CGCTGCCGGT GAGCGGCCTG
CGCTCCAGCT CCCACCAGAT CGAGGTGGAA GGCGAAGGCA GCAACGGCGC GCGCGTCGCG
CTGGGCGGTG ACGCGGAGTC CTCCCGCCAC AACGGCAACC GCGATTTCAT CCTCGACTAC
CGCCTCGCCG GTGACGGCAT CGCCTCGGGC CTCACGCTGT TCCCGGGCGC GCCCGGCGAG
GAGAACTTCT TCCTCGCGAT GGTGGAGCCG CCCCGGGCCA TCGCCACCAC GCAGATCAAC
CCGCGCGACT ACGTGTTCGT GGTCGACATC TCGGGTTCGA TGCACGGCTA CCCGCTCGAC
ACCGCCAAGA CGCTGCTGCG CCACCTGATC GGCGGGCTGC GTCCCAGCGA CACGTTCAAC
GTGCTGCTGT TCTCGGGCAG CAACCGCATG CTCAACGAGA CCTCGGTGCC GGCCACGCAG
GCCAACGTCG CGCAGGCGCT GCGCACCATT GCGCAGATGG GCGGCAGCGG CAGCACCGAG
ATCGTGCCGG CGCTCAAGCG CGTGGCCGCG CTGCCCAAGT CGCCCGACGT GTCGCGCAGC
GTGATCGTGG TGACCGACGG CTACGTCACG GTGGAGAGCG AGGTGTTCCA GCTCATCCGC
AGGAACCTCG GCCAGACCAA CGTGTTCGCG TTCGGCATCG GCAGCTCGGT CAACCGCCAT
CTCATCGAGG GCATTGCACG CGCCGGCCAG GGCGAGCCCT TCATCGTCAC GCGGCCCGAG
CAGGCTGCCG CGCAGGCCGA GCGCCTGCGC CGCATGATCG ACGCGCCGGT GCTCACGCAG
GTGAAGGCGC GCTTCGAGGG CCTGGACACC TACGACGTGG AACCCGAGCG CCTGCCTGAC
GTGCTGGGTG GCCGCCCGGT GCTGGTGTTC GGCAAGTGGC GCGGCGAGCC GCGCGGCCAG
CTCGTCGTCG AAGGCCAGGC GGCCCACGGC GCCTGGCAGG CGATGCTGCC GGTCGCCACG
CCCGACGCGC AGGCCGTGGC GCTGCGCCAC CTGTGGGCGC GCCACCGCAT CCAGTCGCTG
TCGGACCAGG AGGCGCTGCA GGGCGGCGAC ACGCAGCGCG AGGCCATCAC CGCGCTGGGC
CTGCGCTACA GCCTGCTCAC GCAGTACACC AGCTTCATCG CCGTGGACCG CGTGGTGCGC
AACCCCGCCG GCGGCAGCAC GCCGGTGGAC CAGCCCTCGC CCCTGCCGCA GGGCGTGAGC
AACCTGGCCA TCGGCGCCGA GGTGCCCAGC ACGCCGGAGC CTTCGGCCTG GATCGCGCTG
GGCGTGGTGC TGGTGCTGCT GGGTGCGGCG GCCTGGCACC GCGGCGGATC TAGCGCCAAG
CGCCTGTGGC GCAAGCCGCG GCGCCTGGTG CGGTGA
 
Protein sequence
MFFPSLRPRA VIPLRPLSRP GRLWAVVLAL ACLASLLLVA LPAQADDADA RRAESPYFFV 
KSDDPSVDRL PLKATEVDAR IAGPIADVTV TQRYRNEGQR PIEARYVFPG STQAAVHAMT
VRVGHRVIVA DIREKQRARI EFEAAKREGK TAALLEQERP NVFSMNVANI LPGDEVAVEL
RYTELLPPTE GRYQFVFPTV VGPRYRSPAN KVATTEEQAN GTAAPSGSFP AVPYLPAGEA
SDTRFDLRVA FASPLPVSGL RSSSHQIEVE GEGSNGARVA LGGDAESSRH NGNRDFILDY
RLAGDGIASG LTLFPGAPGE ENFFLAMVEP PRAIATTQIN PRDYVFVVDI SGSMHGYPLD
TAKTLLRHLI GGLRPSDTFN VLLFSGSNRM LNETSVPATQ ANVAQALRTI AQMGGSGSTE
IVPALKRVAA LPKSPDVSRS VIVVTDGYVT VESEVFQLIR RNLGQTNVFA FGIGSSVNRH
LIEGIARAGQ GEPFIVTRPE QAAAQAERLR RMIDAPVLTQ VKARFEGLDT YDVEPERLPD
VLGGRPVLVF GKWRGEPRGQ LVVEGQAAHG AWQAMLPVAT PDAQAVALRH LWARHRIQSL
SDQEALQGGD TQREAITALG LRYSLLTQYT SFIAVDRVVR NPAGGSTPVD QPSPLPQGVS
NLAIGAEVPS TPEPSAWIAL GVVLVLLGAA AWHRGGSSAK RLWRKPRRLV R