Gene Mpe_A2342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2342 
Symbol 
ID4784562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2510537 
End bp2512552 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content68% 
IMG OID640090911 
ProductDNA topoisomerase III 
Protein accessionYP_001021533 
Protein GI124267529 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTGT TCCTGTGCGA GAAGCCGTCC CAGGGCAAGG ACATCGCCCG TGTGCTGGGT 
GCCGGTCAAC GCGGCAACGG CTGCTACAGC GGCGCGGGTG TCGTCGTGAC CTGGTGCATC
GGTCATTTGG TGGAGGCGGT TCCGCCCGAA GGCTACGGCG AGCAATACAA GCGCTGGGCC
ATCGAACAAC TGCCTATTCT TCCTGAGCGT TGGCGTGTCG AGCCCAAGGC GGCGACCGCA
GCGCAATTCA AGGTCGTGCA GCAGCTCGTC GCCAAGGCGG GCGAGCTGGT GATCGCGACT
GACGCCGACC GCGAGGGCGA GATGATCGCC CGCGAGATCA TCGACCTATG CGGCTACCGC
GGGCCGATTC AGCGCCTGTG GCTGTCGGCG CTCAACGATG CGTCGATCCG CAAAGCGCTG
GGTGCGCTCA AGCCGTCCGA CGAGACGCTG CCGCTGTATT TCTCCGCACT CGCCCGATCG
CGCGCCGACT GGCTGATTGG GATGAACCTG AGCCGCTTGT TCACACTGCT GGGGCGCCAG
GCCGGCTATA CCGGCGTGCT GTCGGTGGGG CGCGTGCAGA CGCCGACGCT GAAGTTGGTC
GTGGACCGCG ATCGCGAGAT CGCGCGATTC GTCTGCGTAC CGTTCTGGGC CATCGAGGTT
GCGCTTTCGC ATGCAGGCCA GTCCTTCGTC GCAAGCTGGA CGCCGCCGCA AGGCAGCGCC
GACGACGCCG ACCGCTGCTT GCAGCAGCCG GTGGCGCAGC AGGCAGCGGA ATTCCTGCGC
GCGGCCGGCA CCGCCCAGGT GCTGTCGGTG GAGACCGAGC GCGTGCGCGA AGGGCCGCCG
CTGCCGTTCG ACCTGGGCAC GCTGCAGGAG GTGTGCTCCA AGCAGTTGGG CCTCGACGTG
CAGGAGACGC TGGACATTGC CCAGGCGCTG TACGAGACGC ACAAGGCGAC AACGTATCCG
CGCTCGGATT CGGGCTACCT GCCCGAGAGC ATGCTGGCCG AGGTGCCGAC GGTACTCAAC
AGCCTGGTCA AGACCGACCC CAGCTTGCGG CCGCTGATCG AGCGCCTGGA TCGCCAACAG
CGTTCGCGTG CATGGAACGA CGGCAAGGTG TCGGCTCACC ACGGCATCAT CCCGACGCTG
GAGCCCGCCA ACCTGTCGGC CATGAACGAG AAGGAACTGG CCGTCTACCG GCTGATCCGC
GCTCATTACC TCGCGCAGTT CCTCCCACAC CATGAGTTCG ACCGGACGGT GGCGCAGTTC
TCGTGCGGCA GTCAGTCGCT GGCGGCCGTG GGCAAGCAGA TCGCCGTCAT CGGCTGGCGT
GAGGTGCTGG CGACGCCGGG GCCGGACGAT GCCGATGGCG AGGATGCGCA GCGCAGCCAG
GTGCTGCCCG CCCTGCATGC GGGCCTGTCC TGCCCGGTCG GAAAGGTGGA TCTCAAGGCG
CTGAAGACGC TGCCGCCCAA ACCCTACACG CAGGGCGAGC TGATCAAGGC CATGAAGACC
GTCGCCAAGC TCGTGACCGA CCCGCGCCTG AAGCAGAAGC TGCGAGATAC CACCGGCATC
GGCACCGAGG CGACACGCGC CAACATCATC AACGGTCTGA TCGGTCGCGG CTACCTGGTC
AAGAAAGGCC GCGCCGTCCG CGCTTCCGAC GCGGCATTCA CGCTCATCGA CGCGGTGCCC
TCAGCCATCG CCGACCCCGG CACCACGGCG GTGTGGGAGC AGGCGCTCGA CATGATCGAG
GCCGGCCAGA TGACGCTGGA CACCTTCATC GAGAAGCAGT CCGTGTGGGT CGGCCAGCTC
GTGCAGCAGT ACCGCGGCGC AACGCTCTCG CTCAAGCTGC CGCCGGCGCC GGCCTGCCCG
CAGTGCGCCG CACCGATGCA GCAGCGCACG GGCAAGAGCG GCGCGTTCTG GTCCTGCTCG
CGCTACCCGG ACTGCAAGGG CACGTTGCCG ATCGAGTCCC CGACGGGCCG GCGCAGCGCA
CCGCGCAAGC GGCGCGCTGC CTCCAAGGCG TCCTGA
 
Protein sequence
MRVFLCEKPS QGKDIARVLG AGQRGNGCYS GAGVVVTWCI GHLVEAVPPE GYGEQYKRWA 
IEQLPILPER WRVEPKAATA AQFKVVQQLV AKAGELVIAT DADREGEMIA REIIDLCGYR
GPIQRLWLSA LNDASIRKAL GALKPSDETL PLYFSALARS RADWLIGMNL SRLFTLLGRQ
AGYTGVLSVG RVQTPTLKLV VDRDREIARF VCVPFWAIEV ALSHAGQSFV ASWTPPQGSA
DDADRCLQQP VAQQAAEFLR AAGTAQVLSV ETERVREGPP LPFDLGTLQE VCSKQLGLDV
QETLDIAQAL YETHKATTYP RSDSGYLPES MLAEVPTVLN SLVKTDPSLR PLIERLDRQQ
RSRAWNDGKV SAHHGIIPTL EPANLSAMNE KELAVYRLIR AHYLAQFLPH HEFDRTVAQF
SCGSQSLAAV GKQIAVIGWR EVLATPGPDD ADGEDAQRSQ VLPALHAGLS CPVGKVDLKA
LKTLPPKPYT QGELIKAMKT VAKLVTDPRL KQKLRDTTGI GTEATRANII NGLIGRGYLV
KKGRAVRASD AAFTLIDAVP SAIADPGTTA VWEQALDMIE AGQMTLDTFI EKQSVWVGQL
VQQYRGATLS LKLPPAPACP QCAAPMQQRT GKSGAFWSCS RYPDCKGTLP IESPTGRRSA
PRKRRAASKA S