Gene Mpe_A2491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2491 
Symbol 
ID4784887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2650948 
End bp2653188 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content65% 
IMG OID640091061 
Productputative RNA polymerase sigma(sigma-70) factor transcription regulator protein 
Protein accessionYP_001021681 
Protein GI124267677 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.046137 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.87444 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGA AGAAGAGCGC TCCCGCCCCT GCGAGCAAGC CATCGAAGAC CGCCGCGCCG 
GCGCCGGCCG CCAAGGCCGC AGCCAAGTCG GCCGGCAAGG CCGAGATCGA CCCGAAGAAG
GCCAAGGCGG CTGCCGCCCC GGCCAAAGCC GCGGGGAAGA CGGCCAAAGT CGCCGTGCCC
GAGGCCCCCA AGGGCAAGCC GGGTCGCAAG CCGGCCGCCA AGCAAGCGCT GCCGGTGCTC
GACGAGGACC TGGGCGACAT CGAGGCCGAC CTCGAAGGCG AAGCCAGCGC CGAGGACACC
GGTGGCGACA AGCCCAAGGC CAAGCCGCTG CGCATGAAGG TCTCGCGCGC CAAGGAGCGC
GCGCTGATGC GCGAATTCGG TCTCGATGAG ACGGCGCTGA CCGAGGACGA GGTGGCCAAG
CGCCGCCAGG AGCTCAAGAC GCTCATCAAG ATGGGCAAGA CCCGGGGTTT CCTGACACAC
CAGGAAATCA ACGACCACCT GCCCGAAAAG CTGATCGAGG CCGAGATCCT CGAGGCCATC
GTGTCGATGC TCAACGACAT GGGCATCGCC GTCTACGAGC AAGCACCCGA CGCGGCCACG
CTGCTGATCG CCGGCGGCTC GACCGCCACC TCGACCGATG AGGAGGCCGA GGAAGAGGCC
GAGGCGGCGC TGTCGACCGT CGACTCCGAG TTCGGCCGCA CCACCGACCC GGTGCGCATG
TACATGCGCG AGATGGGCAC GGTCGAGCTG CTGACGCGCG AGGGTGAGAT CGAGATCGCC
AAACGCATCG AGGGCGGGCT GCAGGCGATG ATGCTGGCGA TCAGCGAATC GCCCACCACG
ATCGCCGAGA TCCTGGTGCT GGCCGACAAG ATCCGCGTCG GCGAGATGCA GATCTCGGAG
GCGGTCGACG GCTTCGTGTC GAACGAGGAG GCCGACGACT ACGTCGCCGA AGAGGACTTC
GACGAGTTCG ACGAGGAAGA CGACGACGAC GGCAACGGCG GCTCGAAGGC GCTGACCAAG
AAGCTCGAGG AACTGAAGAC GCAGGCGCTG GTGCGTTTCG ACGAACTGCG CACGCACTTC
GACCGCATGC GCAAGGCCTA CGAGAAGGAA GGCTACAAGT CGCCGGCCTA CAACCGCGCG
CAGATGGGCG TGAGTTCCGA GATCATGAGC CTGCGCTTCA CGGTCAAGAC CATCGAGCGG
CTGTGCCAGA TCCTGCGCTC GCAGGTCGAC GACATCCGCC GCTACGAGCG CGAACTGCGC
AAGATCGTGG TCGACAAGTG CGGCATGCCG CAGGACCACT TCATCAAGAC CTTCCCGCCA
AACTCGCTGA ACCTCAAGTG GGCCGAGAAG GAGATCGCAG CCAACAAGTC GTACAGCGCG
GTGATGGCGC GCAACCTGCC GCCGATCCAG GACCTGCAGC AGAAGCTGAT CGACCTGCAG
AGCCGTGCGG TGGTGCCGAT CGACGACCTG AAGCTCATCA ACAAGAAGAT GAACCAGGGC
GAGAAGGCCT CGCGAGACGC GAAGAAGGAG ATGATCGAGG CCAACCTGCG CCTGGTGATC
TCGATCGCGA AGAAGTACAC CAACCGCGGC CTGCAGTTCC TCGATCTGAT CCAGGAAGGC
AACATCGGCC TGATGAAGGC GGTCGACAAG TTCGAATACC GCCGCGGCTA CAAGTTCTCG
ACCTACGCGA CGTGGTGGAT CCGCCAGGCC ATCACGCGCT CGATCGCCGA CCAGGCGCGC
ACCATTCGCA TTCCGGTGCA CATGATCGAG ACGATCAACA AGATGAACCG CCTGTCGCGC
CAGCACCTGC AGGAGTTCGG CTTCGAGCCG GACGCCCCGA CGCTGGCCGA AAAGATGGAG
ATGCCCGAGG ACAAGATCCG CAAGATCATG AAGATCGCCA AGGAGCCGAT CTCCATGGAG
ACGCCGATCG GCGACGACGA CGATTCGCAC CTGGGCGACT TCATCGAGGA CACCAACAAC
ACCGCGCCGA TCGAGGCGGC GATGCAGGCA GGTCTGCGCG ACGTGGTGAA GGACATCCTC
GACTCGCTGA CGCCGCGCGA GGCCAAGGTG CTGCGCATGC GCTTCGGCAT CGAGATGTCG
ACCGACCACA CGCTGGAGGA AGTGGGCAAG CAGTTCGACG TGACGCGCGA GCGCATCCGC
CAGATCGAAG CCAAGGCGAT CCGCAAGCTC AAGCACCCGA GCCGTTCCGA CAAGCTGAGG
ACCTACCTGG ACAATCTCTG A
 
Protein sequence
MTAKKSAPAP ASKPSKTAAP APAAKAAAKS AGKAEIDPKK AKAAAAPAKA AGKTAKVAVP 
EAPKGKPGRK PAAKQALPVL DEDLGDIEAD LEGEASAEDT GGDKPKAKPL RMKVSRAKER
ALMREFGLDE TALTEDEVAK RRQELKTLIK MGKTRGFLTH QEINDHLPEK LIEAEILEAI
VSMLNDMGIA VYEQAPDAAT LLIAGGSTAT STDEEAEEEA EAALSTVDSE FGRTTDPVRM
YMREMGTVEL LTREGEIEIA KRIEGGLQAM MLAISESPTT IAEILVLADK IRVGEMQISE
AVDGFVSNEE ADDYVAEEDF DEFDEEDDDD GNGGSKALTK KLEELKTQAL VRFDELRTHF
DRMRKAYEKE GYKSPAYNRA QMGVSSEIMS LRFTVKTIER LCQILRSQVD DIRRYERELR
KIVVDKCGMP QDHFIKTFPP NSLNLKWAEK EIAANKSYSA VMARNLPPIQ DLQQKLIDLQ
SRAVVPIDDL KLINKKMNQG EKASRDAKKE MIEANLRLVI SIAKKYTNRG LQFLDLIQEG
NIGLMKAVDK FEYRRGYKFS TYATWWIRQA ITRSIADQAR TIRIPVHMIE TINKMNRLSR
QHLQEFGFEP DAPTLAEKME MPEDKIRKIM KIAKEPISME TPIGDDDDSH LGDFIEDTNN
TAPIEAAMQA GLRDVVKDIL DSLTPREAKV LRMRFGIEMS TDHTLEEVGK QFDVTRERIR
QIEAKAIRKL KHPSRSDKLR TYLDNL