Gene Mpe_A1188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1188 
Symbol 
ID4785589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1283549 
End bp1286617 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content63% 
IMG OID640089753 
Producthypothetical protein 
Protein accessionYP_001020386 
Protein GI124266382 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTACC CGATGCCACC TGTGATCAGT CCGCCGATCA AGACGGAGAT CAGCGTGGTC 
AAAGGTGTCG TCCACCACGC TGACGGCAGC TTTCCTTCGA TCCGGGCGAC CGCTGATTCA
AGGCCAGTCA GCGACTTCGT TAACGCAGCT TCATCTGGTT GCGATCCCTT GCCGTGGCCT
GTCGCTGCGG GCGCTGGTGG AGGACTGACC TGGGTCGTTA TCACTGGCGC CACCGCCGCG
GAGTCAACGG CTTTCGGTGG GGGCGTCTGC TGCGCGCTGG CCAGCGTTGT CACGGTGGCC
AAGGCAAACG CGATGAGCCA TGTCTTCATG TTCCCTCCGG AGCGCGTTGA TCGCGAATTG
TGCCGCCATC GGAAGGCAGC CACCGCTCGT GCGGCCGACG TGAAGAAGGG CGCTAACATG
ATGCTGCCGT GCTGGAATGT TGCGCACGGG CAGACCGCCT GGAGCGACGC AGCGGTCGAG
TTAAGCGGTG CACTTGGAGA GGCCTCGATG CGCAACGGCG CGGGGAAGCG GGTCGGCGAC
GACCTCTACG TCCATCTCGA GTTCGTTGAC GATCTCGCGG ATTCGTCGCA ACGCGCGATC
ATTCGGAATG CGCTTCAGCG GCTCGACGAT GGAGGGCGAG CGTTGGCCAA CGTCGCGAAG
ATCAACACCC GCTCACAGCG CGTTTCACTT CTCGCCTACC CGGAGTTCGA TGACGAGCCG
TTTCCAGCCC TTGCTTTGAG CTGGTCCCCA GGGAGGAGCG GCAACAGCGA GCCAGTGCTA
CGGTCCTATG TCGAATCGCT GAACCCACCG ATCCTGCACA GGAAGGAGCT GCTGGTGTCG
GATCGGCATC CTGCGCGGGA ACGTTGGTGC GCGATGACCT CACAGGCCGA AGCGCTCGGC
TTGTTCGACG ATCCGCTGTC AATCGGCTTC CGAATGAACT GGGAGCGACA CATTGCGGCA
AAGGGATATC AGCTGCTCGG TGGCCAGCTC GCGCCGTTGG GCAACGCCAT CGACGATGTT
GATGGGCGGG CGAACCGGGT AGGGCAATGT GTTCAGCGTC ACCTGACCGC GCTCGGGCGG
TCGGCGCTCT CGGCACCGGT CCAGACTTTG CTGCGGTTGG GTTTGCTAAG CCGCGAGATG
TCATTCTTCG ACTATGGCTG CGGACGAGGC GACGACCTGT CAACGCTCAG TGGCGAAGGT
TTCGCTGCCC AAGGCTGGGA TCCTCACTAC GCGCCGAACA GGCCGCTCTC CACTGCCGAA
GTGGTCAACG TCGGCTTCGT GATCAACGTC ATCGAAGATC CCGCGGAGCG CGTCGACGTG
CTCCACCGCG CTTTCTCCCT CGCCCAGCGC GTGATGTCCG TGGCAGTGAT GCTGTACGGA
CCGGAGAACG CTGGAAAGCC ATTCGGCGAT GGCTTCATGA CGTCGAGGGG CACGTTCCAG
AAGTACTTTC AGCAGGCGGA ACTGAAAGAC TACCTCGAGC AGGCGCTGCA CCAGGAGGCA
ATCCTTGTTG GGCCAGGCAT GGCATTGGTC TTCAAGGACA AGGATTGGGA GCAGCGATAC
CTCGCCGGCC GATACAGACG GCGCGACGTG ACGGAGCGCC TGCTTGCTGT CCGGCCGAGG
CCGCCCAAGC CAGTCAGGGA GCAACCTGTC CGTGAGATCG CCGCGCCACG GACGCCTGAA
CCGCCACATC CACTGCTCAC GGAGCTGTGG CGTGCAACCC TTGACCTGGG CCGATACCCA
GAAGAGGCTG AGATTGACAG GCTGCCGGAA CTCATCGACG TCTTCGGCAG CTTGGGCCGA
GCCATCAGGA AGATGGTCCG CAGCTTCGAT GGGGCTATGC TGGCGAAGGC GCAGGCGGCG
CGCGCCGACG ACTTGCTGCT CTACTTCGCG ATCCAGCAGT TCAGCAAGCG CCCTCGGTAT
CGTCAGCTGG AGGTCCGCCT ACAGCGAGAC GTCAAGGCGT TCTTTGGTGA CTATGCGAGT
GCCCAGGCGG CCGGTATGGA ACTGCTCACT CGTGCGGCTG ATGGAGATGC GCTGCTCACG
GCTTGCCGAG AAGCGGTGAC TAGCGGGCTC GGCTGGCTGG ACGCCGACAA GCTGCAGCTT
CATGTCTCGC TGGTCGAGCG GCTCCCGATT GTCTTGCGCG CCTTCGTTTC GTGCGGCCTC
CTGGTCTACG GTGACCTGGG CAAAGTGGAT CTCGTCAAGG TGCACTGCGG CTCAGGGAAA
CTGACGCTGA TGCAGTTCGA AAACTTCGAT GCACAGCCAC TGCCGCTGAT GACCAGGCGC
ATCAAGGTGA ACGTACGCCG CGCCGACTAC GACCTCTTCG TGTACGGGGC CGAATACCCC
AAGCCGCCTC TCTATCTCAA GGGTCGCTAC ATGCACGAAG AAATGGCGCA CTACGAAGAG
CAGGAAGCCT TCGATCGGGC ACTGGAGGAA GCTGGTGTCC TGGGCAGCGC CGAACATGGA
CCGACCTATG AGCAGCTGGA GAAGAGCTTG GCAAGGCGCA GGCTGGAGGT CAAAGGCTTT
TCCCTTCGCC GCAGCACGAC CATCCCATCG CTCGACGAAG CGTGTGGCGC GACGCTTTCC
TACCGCAGCT TCATCGAGTG CGGCGAAACC CAGGCGCGGC TGCGGCTGCC CAACACCCCG
CTGAATCCGG AGAGCTATAA CGCCTTGTAC GACTTGGCCG TGAAGCTGCT CGATCCGATC
GTGGAGTACT TCGGCGCGAT CCGCTTGACC TATGGCTTCT GTTCCCCTGG TCTGGGCGCG
CACATCAAGA GGCGGGTGGC ACCCGACTTG GACCAGCATG CGGCACATGA ACTGAACCGG
CGCGGCCAGC CGCTTTGCGA GCGCGGTGGC GCCGCATGCG ACTTCATCGT CGACGATGAG
AACATGGAGG AGGTCGCCGA TTGGATTGTC GAGAACCTCC CTTTCGATCG GCTCTACTAC
TACGGGAAGG ACAGGCCGAT CCACCTCAGC TATTCGCCTA CCGAGTTGGG AGAGGCGATC
GAGTTGCGGG CTGGACCTTC CGGCCGCCTG GTGCCCCGGC GCTACAAGTC GGCAGGAACT
CCCAAGTGA
 
Protein sequence
MKYPMPPVIS PPIKTEISVV KGVVHHADGS FPSIRATADS RPVSDFVNAA SSGCDPLPWP 
VAAGAGGGLT WVVITGATAA ESTAFGGGVC CALASVVTVA KANAMSHVFM FPPERVDREL
CRHRKAATAR AADVKKGANM MLPCWNVAHG QTAWSDAAVE LSGALGEASM RNGAGKRVGD
DLYVHLEFVD DLADSSQRAI IRNALQRLDD GGRALANVAK INTRSQRVSL LAYPEFDDEP
FPALALSWSP GRSGNSEPVL RSYVESLNPP ILHRKELLVS DRHPARERWC AMTSQAEALG
LFDDPLSIGF RMNWERHIAA KGYQLLGGQL APLGNAIDDV DGRANRVGQC VQRHLTALGR
SALSAPVQTL LRLGLLSREM SFFDYGCGRG DDLSTLSGEG FAAQGWDPHY APNRPLSTAE
VVNVGFVINV IEDPAERVDV LHRAFSLAQR VMSVAVMLYG PENAGKPFGD GFMTSRGTFQ
KYFQQAELKD YLEQALHQEA ILVGPGMALV FKDKDWEQRY LAGRYRRRDV TERLLAVRPR
PPKPVREQPV REIAAPRTPE PPHPLLTELW RATLDLGRYP EEAEIDRLPE LIDVFGSLGR
AIRKMVRSFD GAMLAKAQAA RADDLLLYFA IQQFSKRPRY RQLEVRLQRD VKAFFGDYAS
AQAAGMELLT RAADGDALLT ACREAVTSGL GWLDADKLQL HVSLVERLPI VLRAFVSCGL
LVYGDLGKVD LVKVHCGSGK LTLMQFENFD AQPLPLMTRR IKVNVRRADY DLFVYGAEYP
KPPLYLKGRY MHEEMAHYEE QEAFDRALEE AGVLGSAEHG PTYEQLEKSL ARRRLEVKGF
SLRRSTTIPS LDEACGATLS YRSFIECGET QARLRLPNTP LNPESYNALY DLAVKLLDPI
VEYFGAIRLT YGFCSPGLGA HIKRRVAPDL DQHAAHELNR RGQPLCERGG AACDFIVDDE
NMEEVADWIV ENLPFDRLYY YGKDRPIHLS YSPTELGEAI ELRAGPSGRL VPRRYKSAGT
PK