Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1052 |
Symbol | |
ID | 5832070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1149666 |
End bp | 1151912 |
Gene Length | 2247 bp |
Protein Length | 748 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641366847 |
Product | DNA topoisomerase IV subunit A |
Protein accession | YP_001638528 |
Protein GI | 163850485 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit |
TIGRFAM ID | [TIGR01062] DNA topoisomerase IV, A subunit, proteobacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0774521 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCAGC CCGTCCTGCC GCCGCCGAGC GACGGCATCG AGAGCGTCGA GCTGAAGACG GCGCTGGAGG AGCGCTACTA TGCCTATGCG CTCTCCACGA TCATGCAGCG CGCGCTGCCC GATGCCCGCG ACGGCCTGAA ACCCGTGCAC CGGCGCATCC TCTACGGCAT GCGCCTGCTG CGGCTCGACC CGACCTCGGC CTTCAAGAAA TGCGCGAAGA TCGTCGGCGA CGTGATGGGT GACTTCCACC CGCATGGCGA CCAAGCGATC TACGACGCGC TGGTGCGCCT CTCCCAGGAC TTCGCCCAGC GCTACCCGCT GGTCGATGGC CAGGGGAACT TCGGCAACAT CGACGGCGAC GGTCCGGCGG CCTACCGCTA CACCGAGGCG CGGCTGACGG AAGTCGCCCG CCTGCTGCTC GACGGGATCG ACGAGGACAC GGTCGATTTC CGCGCCTCCT ACAATGGCGA GAAGGAGGAG CCGATCGTCC TGCCGGCGGC CTTCCCGAAC CTGCTCGCCA ACGGCAGCCA GGGCATCGCG GTCGGCATGG CGACCTCGAT CCCGCCGCAC AACGCGGCCG AACTCTGCGA CGCGGCGCTC TACCTGATCC AGAACCGCGA GGCGACCTCC GAGCAGCTCT GCACCTTCGT GCAGGGGCCG GACTTCCCCA CCGGCGGCAT CCTGATCGAC TCAGCCGAGA GCATCCGCGA GGCCTACCGC ACCGGCCGCG GCGGGTTCCG CGTGCGGGCG CGCTGGGCCA AGGAGGATCT CGGCCGCGGC ACGTGGAACA TCGTCGTCAC CGAGATTCCC TACGGCGTGC CGAAGGCCCG CCTCATCGAG AAGCTCGCCG ACCTGCTTCA GGAGAAGAAG CTGCCGCTGC TGGCCGATGT GCGCGACGAA TCGGCGGAGG ATGTGCGCGT CGTGCTGGAG CCGCGCTCGC GCTCGGTCGA TCCGGTGATG CTGATGGAGT CGCTGTTCCG GCTCTCCGAG TTGGAATCGC GGATTCCGCT GAACCTCAAC GTGCTCGTCG GCGGCGTCGT GCCCCGGGTC ATCGGTCTCA CCGAGTGTCT GCGCGAGTGG GTCGATCACC GCCGCGTCGT GCTCCAGCGA CGCTCGAGCT ACCGCCTCGG CCAGATCGAG CGCCGCCTCG AAATCCTCGG CGGCCTGCTG ATCGTTTATC TCGACCTCGA CGAGGTGATC CGCATCATCC GTGAGGAGGA CGAGCCGAAG GCCGCCTTGA TGGCGCGGTT CGAACTCACC GAGGTCCAGG CCAACGCGAT CCTCGACACC CGCCTGCGCT CCCTGCGCAA GCTCGAAGAG ATGGAGCTGA AGCGCGAGTT CGAAGCGCTG ACCGCGGAGA AGGAGGGGAT CGAGGGATTG CTGGCCTCCG AGAAGCTCCA GTGGACCGAG ATCACCAAGC AGATCCGTGC GGTGAAGAAG ACGTTCGGGC CCGAGACCAA ACTCGGCCGC CGCCGCACCA CCCTCGAGAA CCCGCCCGAC ACCGCCGGAA TCGACTTCAC CGCCGCCATG GTCGAGCGCG AGCCGATCAC GGTGATCCTG TCCGAGAAGG GCTGGATCCG GGCCCTCAAG GGGCATGTGA CCGAACTGGC GGGGGTCGCC TTCAAGGGCG ACGACACGCT GAAGGTCGCC TTCCTCAGCG AAACGACGGC CAAGATCCTG CTGCTCGCCT CGAACGGCAA GGTCTTCACC ATCGAGGCCT CGAAGCTGCC CGGCGGGCGC GGCTTCGGCG ATCCGGTGCG GCTGATGGTC GATCTCGACG ACGGCACCGA GATCGTCGCG GCGTTGCCCT ACAAGCCGGA GAGCAAGCTG CTCGTCGGCG GCTCGGACGG GCGCGGCTTC ATCGCGCCCT CCGATGCGCT GGTCGCCAAC ACCCGGAAGG GCAAGGCGAT CCTCGGCCTC GACGAGGGGA CGCGCGCGGT GCTGCTGGTG CCGGCCGAGG GCGACCACGT CGCCGTCTGT TCATCCGACA AGCTGATGCT GGTCTTCCCG GCCTCCGAAG TCACGGAACT CGGCCGCGGC AAGGGCGTGC GCCTGCAGCG CTGCCGCCAG AGCCAGCTTG CGGATGCCTG CGTCTTTACG CTGGCGGAAG GCCTGCCCTG GCGCGACGGC TCGGGTCAGG CGCGGCTGGC CAATGCGGGC ATGCTGGAGA AGTGGATGGG CCACCGGTCC GATGCCGGCA CGCTGATGAC CCGTAGCTTC CCGAAATTCG AGCGGTTCGG GAAGTAA
|
Protein sequence | MGQPVLPPPS DGIESVELKT ALEERYYAYA LSTIMQRALP DARDGLKPVH RRILYGMRLL RLDPTSAFKK CAKIVGDVMG DFHPHGDQAI YDALVRLSQD FAQRYPLVDG QGNFGNIDGD GPAAYRYTEA RLTEVARLLL DGIDEDTVDF RASYNGEKEE PIVLPAAFPN LLANGSQGIA VGMATSIPPH NAAELCDAAL YLIQNREATS EQLCTFVQGP DFPTGGILID SAESIREAYR TGRGGFRVRA RWAKEDLGRG TWNIVVTEIP YGVPKARLIE KLADLLQEKK LPLLADVRDE SAEDVRVVLE PRSRSVDPVM LMESLFRLSE LESRIPLNLN VLVGGVVPRV IGLTECLREW VDHRRVVLQR RSSYRLGQIE RRLEILGGLL IVYLDLDEVI RIIREEDEPK AALMARFELT EVQANAILDT RLRSLRKLEE MELKREFEAL TAEKEGIEGL LASEKLQWTE ITKQIRAVKK TFGPETKLGR RRTTLENPPD TAGIDFTAAM VEREPITVIL SEKGWIRALK GHVTELAGVA FKGDDTLKVA FLSETTAKIL LLASNGKVFT IEASKLPGGR GFGDPVRLMV DLDDGTEIVA ALPYKPESKL LVGGSDGRGF IAPSDALVAN TRKGKAILGL DEGTRAVLLV PAEGDHVAVC SSDKLMLVFP ASEVTELGRG KGVRLQRCRQ SQLADACVFT LAEGLPWRDG SGQARLANAG MLEKWMGHRS DAGTLMTRSF PKFERFGK
|
| |