Gene Mpe_A3450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3450 
Symbol 
ID4786324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3654893 
End bp3659131 
Gene Length4239 bp 
Protein Length1412 aa 
Translation table11 
GC content65% 
IMG OID640092028 
ProductDNA-directed RNA polymerase subunit beta' 
Protein accessionYP_001022638 
Protein GI124268634 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.889451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.177974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGAC TACTCGACCT TTTCAAGCAG TTCACCCCGG ATGAGCATTT CGATGCCATC 
AAGATCGGCC TGGCGTCCCC GGAGAAGATC CGGAGCTGGT CGTTCGGCGA GGTGAAGAAG
CCCGAGACCA TCAACTACCG TACCTTCAAG CCGGAACGCG ACGGCTTGTT CTGCGCCAAG
ATCTTCGGCC CGATCAAGGA CTACGAGTGC CTGTGCGGCA AGTACAAGCG CCTGAAGCAC
CGCGGCGTGA TCTGCGAGAA ATGCGGCGTC GAGGTCACCC AGACCAAGGT GCGCCGTGAC
CGCATGGGCC ACATCGATCT GGCCGCGCCC TGCGCGCACA TCTGGTTTCT GAAGTCGCTG
CCGTCGCGCC TGGGCCTGGT GCTCGACATG ACGCTGCGTG ACATCGAACG CGTGCTGTAC
TTCGAGGCCT ACGTGGTCGT CGACCCGGGC ATGACCGAGC TGAAGAAGTT CTCGATCATG
ACCGAGGACG ACTACGACGC GAAGAAACAG CAGCACGGCG ACGAATTCGT CGCGCTGATG
GGCGCCGAGG GCATCCAGAA GTTGCTGGCC GAGATCGACC TCGACGTCGA GATCGAGCGC
CTGCGCGGCG ACATGACTGG CTCCGAGCTC AAGGTCAAGA AGAACTCGCG TCGCCTGAAG
GTGATGGAGG CCTTCAAGAA GTCGGGCATC AAGCCGCAGT GGATGGTGAT GAACGTGCTG
CCGGTGCTGC CGCCGGACCT GCGTCCGCTG GTGCCGCTGG ACGGCGGGCG CTTCGCCACG
TCCGACCTGA ACGATCTGTA CCGCCGCGTC ATCAACCGCA ACAACCGCCT GGCCCGCCTG
CTCGAACTGA AGGCGCCGGA GATCATCGTG CGCAACGAGA AGCGCATGCT GCAGGAGGCG
GTGGACTCGC TGCTCGACAA CGGCCGCCGC GGCAAGGCGA TGACGGGCGC GAACAAGCGC
GCCCTGAAGT CGCTGGCCGA CATGATCAAG GGCAAGAGCG GTCGCTTCCG CCAGAACCTG
CTGGGCAAGC GCGTCGACTA CTCGGGCCGT TCGGTCATCG TGGTCGGCCC GACGCTCAAG
CTGCACCAGT GCGGCCTGCC CAAGCTGATG GCGCTCGAGC TGTTCAAGCC CTTCATCTTC
TCGCGCCTGG AAGCCATGGG CATCGCCACC ACCATCAAGG CGGCGAAGAA GGAAGTGGAA
TCGGGCACGC CGGTGGTCTG GGACATCCTT GAAGAAGTCA TCAAGGAACA CCCGGTCATG
CTGAACCGCG CGCCGACGCT GCACCGCCTG GGCATCCAGG CGTTCGAGCC GGTGCTGATC
GAGGGCAAGG CGATCCAGCT GCACCCGCTG GTCTGCGCGG CGTTCAACGC CGACTTCGAC
GGTGACCAGA TGGCCGTCCA CGTGCCGCTG TCGATCGAGG CGCAGATGGA AGCCCGCACG
CTGATGCTGG CCTCCAACAA CGTGCTGTTC CCGGCCAACG GCGAACCGTC GATCGTGCCG
TCGCAGGACG TCGTGCTGGG TCTGTACTAC GCGACCCGCG ACCGCATCAA CGCCAAGGGC
GAGGGCCTGA TCTTCTCCGA CGTGGTCGAG GTGCAGCGCG CGCTCGACAA CGGCCAGGTC
GAGATCACCG CGAAGATCGC CGTGCGCCTG ACCGAGTGGA CCAAGGACAA GGAGAGCGGC
GAGTTCGTGC CGGAGTCCAA GCTGGTCGAC ACTACGGTCG GCCGTGCTCT GCTGTCGGAG
ATCCTGCCCA AGGGCCTGCC GTTCGCCAAC ATCAACAAGG CGCTGAAGAA GAAGGAGATC
TCGCGGCTGA TCAACACCTC CTTCCGCAAG TGCGGTCTGA AGGAGACGGT GGTGCTGGCC
GACAAGCTGC TGCAGAGCGG CTTCCGTCTG GCCACGCGCG CCGGTATTTC GATCTCGATC
GACGACATGC TGGTGCCCAA GCAGAAGCAC GACCTGATCG AGCGCGCCGA GAAGGAAGTC
AAGGAGATCG AACAGCAGTA CGTCTCGGGT CTCGTGACGG CCGGGGAGCG CTACAACAAG
GTCGTCGACA TCTGGGGCAA GACCGGCGAC GAGGTCGGCA AGGTCATGAT GGCCCAGCTG
TCCAAGCAGA AGGTCGAGGA CCGTCACGGC AAGCTGGTGG ACCAGGAGTC GTTCAACTCC
ATCTACATGA TGGCCGACTC CGGTGCGCGC GGTTCCGCCG CGCAGATCCG CCAGTTGGCC
GGCATGCGCG GCCTGATGGC CAAGCCGGAC GGCTCGATCA TCGAGACGCC CATCACGGCG
AACTTCCGTG AAGGCCTGAA CGTGCTGCAG TACTTCATCT CCACCCACGG CGCCCGCAAG
GGCTTGGCGG ACACGGCGCT GAAGACCGCG AACTCCGGCT ACCTGACGCG CCGCCTGGTC
GACGTCACAC AGGATCTGGT GGTGACCGAG GACGATTGCG GCACTGACGC CGGCATCGCG
ATGCGCGCGC TGGTCGAGGG CGGCGAGGTC ATCGAGTCGC TGCGCGACCG CATTCTCGGC
CGCGTGACGG CCATCGAGGT GCTGCATCCA GAGACCCAGC AGGTGGTCGT GCCGGCCGGA
CTGATGCTCG ACGAGGACAC GCTCGACATC GTCGAGGCGG CTGCCGTCGA CGAGGTGAAG
GTCCGCACGC CGTTGACCTG CCATACGCGC TTCGGCCTGT GCGCCAAGTG CTACGGCCGC
GACCTCGGTC GCGGCGGGCT GGTGAACGCC GGCGAGGCGG TGGGCGTGAT CGCCGCGCAG
TCGATCGGCG AGCCCGGCAC GCAGCTGACG ATGCGCACCT TCCACATCGG TGGCGCGGCG
TCGCGTGCCG CGGTGGCCTC CAGCGTCGAG GCGAAGTCCG ATGGTCACAT CGGCTTCAAT
GCGACGATGC GCTACGTGAC CAACGGCAAG GGCGAATTGG TGGTGATCTC GCGTTCCGGC
GAGATCATCA TCTCCGATCA GCACGGCCGA GAGCGCGAAC GCCACAAGGT ACCGTACGGC
GCGACGCTCA ACATCAAGGC CGACCAGCAG GTCAAGGCCG GCACCGTGCT GGCCAACTGG
GATCCGCTGA CCCGGCCGAT CATCACCGAG TTCGCCGGCA AGGCGAAGTT CGAGAACGTG
GAAGAGGGTG TCACCGTCGC CAAGCAGGTG GACGAGGTGA CGGGCTTGTC GACGCTGGTG
GTCATCGACC CGAAGCGCCG TGGCGCAGCC AAGGTGGTTC GCCCGCAGGT GAAGCTGCTC
GACGCAGCCG GGAACGAGGT GAAGATCCCC GGTACCGACC ACTCGGTGAC GATCGGTTTC
CCGATCGGCT CGCTGGTGCA GATCCGCGAC GGTCAGGACC TCGCCCCGGG CGAAGTGCTG
GCCCGCATCC CGGTCGAAGG CCAGAAGACG CGCGACATCA CCGGCGGTCT GCCGCGGGTG
GCCGAGCTGT TCGAGGCCCG CACGCCCAAG GACAAGGGCA CCCTGGCCGA GATGACCGGG
ACCGTGTCGT TCGGCAAGGA GACCAAGGGC AAGGTGCGCC TGCAGATCAC CGACCCGGAC
GGCAAGGTCT ACGAAGAGCT GGTGCCGAAG GAGAAGAACA TCCTGGTGCA CGAAGGCCAG
GTGGTCAACA AGGGCGAGTC CATCGTCGAC GGCCCGGCCG ATCCGCAGGA CATCCTGCGA
CTGCTGGGGA TCGAGGAACT GGCGCGCTAC ATCGTCGACG AGGTGCAGGA CGTCTACCGC
CTGCAGGGTG TGAAGATCAA CGACAAGCAC ATCGAGGTGA TCGTTCGCCA GATGCTGCGC
CGTGTGCAGA TCGTCAACCC GGGCGACACG CACTACATCC TGGGTGAGCA GGTCGAGCGC
GCCTCGATGC TGGACACCAA CGACAAGATG CGCGCCGAAG GCAAGATGAT CGCGACGCAT
GCCGACGTGC TGCTGGGTAT CACCAAGGCC TCGCTGTCGA CCGACTCGTT CATCTCGGCG
GCGTCGTTCC AGGAGACCAC GCGCGTGCTG ACCGAAGCGG CGATCATGGG CAAGCGCGAC
GAACTCCGTG GTCTGAAGGA GAACGTCATC GTCGGTCGTC TGATCCCGGC CGGGACCGGC
CTGGCTTTCC ACCGGGCCCG CAAGGCCAAG GAGGAAATGG ACGACGCCGA ACGCCGCTCG
ATCGCTCTGC AGGAGGCTGA GGAGCAGGCC CTGCTGACGC CAGCGACGAC CGCCGAGGCT
GTGGTGGGCG AGGAGCCGGC GCCGCCCCCG GCGCAGTAG
 
Protein sequence
MKGLLDLFKQ FTPDEHFDAI KIGLASPEKI RSWSFGEVKK PETINYRTFK PERDGLFCAK 
IFGPIKDYEC LCGKYKRLKH RGVICEKCGV EVTQTKVRRD RMGHIDLAAP CAHIWFLKSL
PSRLGLVLDM TLRDIERVLY FEAYVVVDPG MTELKKFSIM TEDDYDAKKQ QHGDEFVALM
GAEGIQKLLA EIDLDVEIER LRGDMTGSEL KVKKNSRRLK VMEAFKKSGI KPQWMVMNVL
PVLPPDLRPL VPLDGGRFAT SDLNDLYRRV INRNNRLARL LELKAPEIIV RNEKRMLQEA
VDSLLDNGRR GKAMTGANKR ALKSLADMIK GKSGRFRQNL LGKRVDYSGR SVIVVGPTLK
LHQCGLPKLM ALELFKPFIF SRLEAMGIAT TIKAAKKEVE SGTPVVWDIL EEVIKEHPVM
LNRAPTLHRL GIQAFEPVLI EGKAIQLHPL VCAAFNADFD GDQMAVHVPL SIEAQMEART
LMLASNNVLF PANGEPSIVP SQDVVLGLYY ATRDRINAKG EGLIFSDVVE VQRALDNGQV
EITAKIAVRL TEWTKDKESG EFVPESKLVD TTVGRALLSE ILPKGLPFAN INKALKKKEI
SRLINTSFRK CGLKETVVLA DKLLQSGFRL ATRAGISISI DDMLVPKQKH DLIERAEKEV
KEIEQQYVSG LVTAGERYNK VVDIWGKTGD EVGKVMMAQL SKQKVEDRHG KLVDQESFNS
IYMMADSGAR GSAAQIRQLA GMRGLMAKPD GSIIETPITA NFREGLNVLQ YFISTHGARK
GLADTALKTA NSGYLTRRLV DVTQDLVVTE DDCGTDAGIA MRALVEGGEV IESLRDRILG
RVTAIEVLHP ETQQVVVPAG LMLDEDTLDI VEAAAVDEVK VRTPLTCHTR FGLCAKCYGR
DLGRGGLVNA GEAVGVIAAQ SIGEPGTQLT MRTFHIGGAA SRAAVASSVE AKSDGHIGFN
ATMRYVTNGK GELVVISRSG EIIISDQHGR ERERHKVPYG ATLNIKADQQ VKAGTVLANW
DPLTRPIITE FAGKAKFENV EEGVTVAKQV DEVTGLSTLV VIDPKRRGAA KVVRPQVKLL
DAAGNEVKIP GTDHSVTIGF PIGSLVQIRD GQDLAPGEVL ARIPVEGQKT RDITGGLPRV
AELFEARTPK DKGTLAEMTG TVSFGKETKG KVRLQITDPD GKVYEELVPK EKNILVHEGQ
VVNKGESIVD GPADPQDILR LLGIEELARY IVDEVQDVYR LQGVKINDKH IEVIVRQMLR
RVQIVNPGDT HYILGEQVER ASMLDTNDKM RAEGKMIATH ADVLLGITKA SLSTDSFISA
ASFQETTRVL TEAAIMGKRD ELRGLKENVI VGRLIPAGTG LAFHRARKAK EEMDDAERRS
IALQEAEEQA LLTPATTAEA VVGEEPAPPP AQ