Gene Mpe_A2558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2558 
Symbol 
ID4785220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2730682 
End bp2732991 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content69% 
IMG OID640091127 
ProductDNA topoisomerase IV subunit A 
Protein accessionYP_001021746 
Protein GI124267742 
COG category[L] Replication, recombination and repair 
COG ID[COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 
TIGRFAM ID[TIGR01062] DNA topoisomerase IV, A subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAAC TGGAACTGAA TGCTGCCGGC GGCGACAACG GCGATGCCAT CACGCTCGCC 
CACTATGCCG AGCGGGCCTA CCTCGAGTAC GCGCTGAGCG TCGTCAAAGG CCGCGCGCTG
CCCGACGTGT GCGACGGCCA GAAGCCGGTG CAACGTCGCA TCCTGTACGC GATGGAGCGG
CTGGGCCTGG CCTTCACCAC CGCCGGCGGC CCCAAGGCGA TGAAGAGCGC GCGCGTGGTC
GGCGACGTGC TGGGCCGCTA CCACCCGCAC GGCGACACCG CGGCCTACGA CGCGCTGGTG
CGGATGGCGC AGGACTTCTC GCAGCGCTAT CCGCTGATCG ACGGCCAGGG CAACTTCGGC
TCGCGCGACG GTGACGGCGC GGCGGCGATG CGCTACACCG AAGCGCGTCT CGCGCCGATC
ACGCGGCTGC TGCTCGACGA GATCGACGAG GGCACGGTCG ACTTCATCCC CAACTACGAC
GGCTCCACGC AGGAGCCGAA GCAGTTGCCC GCGCGCCTGC CCTTCGTGCT GCTGAACGGC
GCGAGCGGCA TCGCGGTGGG CCTGGCGACC GAGATCCCCA GCCACAACCT GCGCGAGGTG
GCCGCCGCCG CGGTGGCGCT GATCAAGCAG GAAAAGCTGT CCGACGACGA GCTGTTCGCC
TTGCTGCCCG GCCCCGACTA CCCGGGCGGC GGCCAGATCA TCAGCGCCGA GGCCGACATC
CGCGACGCCT ATCGCAGCGG CCGCGGTTCG CTGAAGGTGC GGGCGAAATG GAAGATCGAG
GATCTGGCGC GCGGTCAATG GAACCTCGTG GTCACCGAAC TGCCGCCCGG CACCAGCGCG
CAGAAGGTGC TGGAGGAGAT CGAGGAGCTG ACCAACCCCA AGGTCAAGGC CGGCAAGAAG
GCACTGAGCG CCGAGCAGAC CCAGCTCAAG ACCACCGTGC TCGCGGTGCT GGACGCGGTG
CGCGACGAAT CCGGCAAGGA AGCCGCGGTG CGTCTGGTGT TCGAGCCCAA GAGCCGCACC
GTCGAGCAGC AGGAGCTGAT CAACGTGCTG CTGGCGCACA CCAGCCTGGA GACCTCGGCG
TCGATCAACC TGACGATGGT GGGGGCCGAC GGCCGGCCGA CGCAGAAGTC GATGCGTCAG
ATGCTGACCG AGTGGATCGG TTTCCGGCTC GAGACGGTGC AGCGCCGCAC CCGCCACCGT
CTTGGCAAGG TGCTGGACCG CATCCACATC CTCGAAGGCC GGCAGCTTGT GTTGCTGAAC
ATCGACGAGG TGATCCGCAT CATCCGCAAT GCCGATGAGC CCAAGCCGGC GCTGATCGAG
CGCTTCCGGC TCAGCGACCG GCAGGCCGAG GACATCCTCG AGATCCGGCT GCGCCAATTG
GCGCGGCTGG AAGCCATCAA GATCGAGCAG GAGCTGAAGA GCCTGCGCGA GGAGCAGGGT
CGGCTCGAGG AGATTCTCAA CAGCCCGGCC GCGCTGAAGC GCACGGTCGT GAAGGAGATC
GAGGCCGATG CCAAGACGCA CGGCGACGAA CGCCGCACGC TGATCCAGGC CGAGAAGAAG
GCGGTCGCCG AGGTCAAGGT GGTCGACGAG CCGGTCACGG TGGTCATCAG CAGCAAGGGC
TGGGTGCGCG CGCGCCAGGG TCATGGCCAC GACGCCGCGG CCTTTGCCTT CAAGGCCGGC
GACACGCTCT ACGGCACCTT CGAGTGCCGC ACCGTCGACA CTTTGCTGGT GTTCGGATCG
AACGGGCGCG TGTACTCGGT CGCCGTCAGC GGCCTGCCCG GCGCGCGTGG CGACGGGCAG
CCGATCACGT CGATGATCGA GCTCGAATCG GGCACCCAGC CGCAGCACTA CTTCGCCGGC
CCGGCCGACG CGACGCTGCT GCTCGCCAAC ACCGGCGGCT ACGGTCTGCT GGCGAAGGCC
GGCGACCTGC AGTCGCGCCA GCGCGGCGGC AAGGGCTTCC TCACGCTGGC CGAGGGCGAG
AAGCCGTTGC CGCCCAGCCG TGCCGATGCC GCGGGGCAGA TCGCCTGCCT CAGCCTCGGC
GGTCGCCTGC TGGTGTTCGC GCTCGATGAT CTGAAGCTGC AGCCCAAGGG CGGCCGCGGC
CTGACGCTGA TGGACCTGGA GGCCAAGGAC GCTCTGGTGA GCGTCGCTGC CTTCGACAAG
ACCCTGCGCG TGCTGGGCAG CGGCCGTGGC GGCAAGGCCA AGGACGAGAC GCTCGGCGCG
GCGGCGGTGG CCGTCTACAA GGGTGCGCGG GCGCGCAAGG GCAAGCCGGC CGACATCGGC
TTCAAACCGA CCTGGCTGCT GCGCGCCTGA
 
Protein sequence
MNQLELNAAG GDNGDAITLA HYAERAYLEY ALSVVKGRAL PDVCDGQKPV QRRILYAMER 
LGLAFTTAGG PKAMKSARVV GDVLGRYHPH GDTAAYDALV RMAQDFSQRY PLIDGQGNFG
SRDGDGAAAM RYTEARLAPI TRLLLDEIDE GTVDFIPNYD GSTQEPKQLP ARLPFVLLNG
ASGIAVGLAT EIPSHNLREV AAAAVALIKQ EKLSDDELFA LLPGPDYPGG GQIISAEADI
RDAYRSGRGS LKVRAKWKIE DLARGQWNLV VTELPPGTSA QKVLEEIEEL TNPKVKAGKK
ALSAEQTQLK TTVLAVLDAV RDESGKEAAV RLVFEPKSRT VEQQELINVL LAHTSLETSA
SINLTMVGAD GRPTQKSMRQ MLTEWIGFRL ETVQRRTRHR LGKVLDRIHI LEGRQLVLLN
IDEVIRIIRN ADEPKPALIE RFRLSDRQAE DILEIRLRQL ARLEAIKIEQ ELKSLREEQG
RLEEILNSPA ALKRTVVKEI EADAKTHGDE RRTLIQAEKK AVAEVKVVDE PVTVVISSKG
WVRARQGHGH DAAAFAFKAG DTLYGTFECR TVDTLLVFGS NGRVYSVAVS GLPGARGDGQ
PITSMIELES GTQPQHYFAG PADATLLLAN TGGYGLLAKA GDLQSRQRGG KGFLTLAEGE
KPLPPSRADA AGQIACLSLG GRLLVFALDD LKLQPKGGRG LTLMDLEAKD ALVSVAAFDK
TLRVLGSGRG GKAKDETLGA AAVAVYKGAR ARKGKPADIG FKPTWLLRA