Gene Mext_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1052 
Symbol 
ID5832070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1149666 
End bp1151912 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content68% 
IMG OID641366847 
ProductDNA topoisomerase IV subunit A 
Protein accessionYP_001638528 
Protein GI163850485 
COG category[L] Replication, recombination and repair 
COG ID[COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 
TIGRFAM ID[TIGR01062] DNA topoisomerase IV, A subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0774521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCAGC CCGTCCTGCC GCCGCCGAGC GACGGCATCG AGAGCGTCGA GCTGAAGACG 
GCGCTGGAGG AGCGCTACTA TGCCTATGCG CTCTCCACGA TCATGCAGCG CGCGCTGCCC
GATGCCCGCG ACGGCCTGAA ACCCGTGCAC CGGCGCATCC TCTACGGCAT GCGCCTGCTG
CGGCTCGACC CGACCTCGGC CTTCAAGAAA TGCGCGAAGA TCGTCGGCGA CGTGATGGGT
GACTTCCACC CGCATGGCGA CCAAGCGATC TACGACGCGC TGGTGCGCCT CTCCCAGGAC
TTCGCCCAGC GCTACCCGCT GGTCGATGGC CAGGGGAACT TCGGCAACAT CGACGGCGAC
GGTCCGGCGG CCTACCGCTA CACCGAGGCG CGGCTGACGG AAGTCGCCCG CCTGCTGCTC
GACGGGATCG ACGAGGACAC GGTCGATTTC CGCGCCTCCT ACAATGGCGA GAAGGAGGAG
CCGATCGTCC TGCCGGCGGC CTTCCCGAAC CTGCTCGCCA ACGGCAGCCA GGGCATCGCG
GTCGGCATGG CGACCTCGAT CCCGCCGCAC AACGCGGCCG AACTCTGCGA CGCGGCGCTC
TACCTGATCC AGAACCGCGA GGCGACCTCC GAGCAGCTCT GCACCTTCGT GCAGGGGCCG
GACTTCCCCA CCGGCGGCAT CCTGATCGAC TCAGCCGAGA GCATCCGCGA GGCCTACCGC
ACCGGCCGCG GCGGGTTCCG CGTGCGGGCG CGCTGGGCCA AGGAGGATCT CGGCCGCGGC
ACGTGGAACA TCGTCGTCAC CGAGATTCCC TACGGCGTGC CGAAGGCCCG CCTCATCGAG
AAGCTCGCCG ACCTGCTTCA GGAGAAGAAG CTGCCGCTGC TGGCCGATGT GCGCGACGAA
TCGGCGGAGG ATGTGCGCGT CGTGCTGGAG CCGCGCTCGC GCTCGGTCGA TCCGGTGATG
CTGATGGAGT CGCTGTTCCG GCTCTCCGAG TTGGAATCGC GGATTCCGCT GAACCTCAAC
GTGCTCGTCG GCGGCGTCGT GCCCCGGGTC ATCGGTCTCA CCGAGTGTCT GCGCGAGTGG
GTCGATCACC GCCGCGTCGT GCTCCAGCGA CGCTCGAGCT ACCGCCTCGG CCAGATCGAG
CGCCGCCTCG AAATCCTCGG CGGCCTGCTG ATCGTTTATC TCGACCTCGA CGAGGTGATC
CGCATCATCC GTGAGGAGGA CGAGCCGAAG GCCGCCTTGA TGGCGCGGTT CGAACTCACC
GAGGTCCAGG CCAACGCGAT CCTCGACACC CGCCTGCGCT CCCTGCGCAA GCTCGAAGAG
ATGGAGCTGA AGCGCGAGTT CGAAGCGCTG ACCGCGGAGA AGGAGGGGAT CGAGGGATTG
CTGGCCTCCG AGAAGCTCCA GTGGACCGAG ATCACCAAGC AGATCCGTGC GGTGAAGAAG
ACGTTCGGGC CCGAGACCAA ACTCGGCCGC CGCCGCACCA CCCTCGAGAA CCCGCCCGAC
ACCGCCGGAA TCGACTTCAC CGCCGCCATG GTCGAGCGCG AGCCGATCAC GGTGATCCTG
TCCGAGAAGG GCTGGATCCG GGCCCTCAAG GGGCATGTGA CCGAACTGGC GGGGGTCGCC
TTCAAGGGCG ACGACACGCT GAAGGTCGCC TTCCTCAGCG AAACGACGGC CAAGATCCTG
CTGCTCGCCT CGAACGGCAA GGTCTTCACC ATCGAGGCCT CGAAGCTGCC CGGCGGGCGC
GGCTTCGGCG ATCCGGTGCG GCTGATGGTC GATCTCGACG ACGGCACCGA GATCGTCGCG
GCGTTGCCCT ACAAGCCGGA GAGCAAGCTG CTCGTCGGCG GCTCGGACGG GCGCGGCTTC
ATCGCGCCCT CCGATGCGCT GGTCGCCAAC ACCCGGAAGG GCAAGGCGAT CCTCGGCCTC
GACGAGGGGA CGCGCGCGGT GCTGCTGGTG CCGGCCGAGG GCGACCACGT CGCCGTCTGT
TCATCCGACA AGCTGATGCT GGTCTTCCCG GCCTCCGAAG TCACGGAACT CGGCCGCGGC
AAGGGCGTGC GCCTGCAGCG CTGCCGCCAG AGCCAGCTTG CGGATGCCTG CGTCTTTACG
CTGGCGGAAG GCCTGCCCTG GCGCGACGGC TCGGGTCAGG CGCGGCTGGC CAATGCGGGC
ATGCTGGAGA AGTGGATGGG CCACCGGTCC GATGCCGGCA CGCTGATGAC CCGTAGCTTC
CCGAAATTCG AGCGGTTCGG GAAGTAA
 
Protein sequence
MGQPVLPPPS DGIESVELKT ALEERYYAYA LSTIMQRALP DARDGLKPVH RRILYGMRLL 
RLDPTSAFKK CAKIVGDVMG DFHPHGDQAI YDALVRLSQD FAQRYPLVDG QGNFGNIDGD
GPAAYRYTEA RLTEVARLLL DGIDEDTVDF RASYNGEKEE PIVLPAAFPN LLANGSQGIA
VGMATSIPPH NAAELCDAAL YLIQNREATS EQLCTFVQGP DFPTGGILID SAESIREAYR
TGRGGFRVRA RWAKEDLGRG TWNIVVTEIP YGVPKARLIE KLADLLQEKK LPLLADVRDE
SAEDVRVVLE PRSRSVDPVM LMESLFRLSE LESRIPLNLN VLVGGVVPRV IGLTECLREW
VDHRRVVLQR RSSYRLGQIE RRLEILGGLL IVYLDLDEVI RIIREEDEPK AALMARFELT
EVQANAILDT RLRSLRKLEE MELKREFEAL TAEKEGIEGL LASEKLQWTE ITKQIRAVKK
TFGPETKLGR RRTTLENPPD TAGIDFTAAM VEREPITVIL SEKGWIRALK GHVTELAGVA
FKGDDTLKVA FLSETTAKIL LLASNGKVFT IEASKLPGGR GFGDPVRLMV DLDDGTEIVA
ALPYKPESKL LVGGSDGRGF IAPSDALVAN TRKGKAILGL DEGTRAVLLV PAEGDHVAVC
SSDKLMLVFP ASEVTELGRG KGVRLQRCRQ SQLADACVFT LAEGLPWRDG SGQARLANAG
MLEKWMGHRS DAGTLMTRSF PKFERFGK