Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_4317 |
Symbol | |
ID | 6130603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 4757202 |
End bp | 4759448 |
Gene Length | 2247 bp |
Protein Length | 748 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641644456 |
Product | DNA topoisomerase IV subunit A |
Protein accession | YP_001771094 |
Protein GI | 170742439 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit |
TIGRFAM ID | [TIGR01062] DNA topoisomerase IV, A subunit, proteobacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.175098 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAGC CGTTCGAGCC GCCATCCGAC CGCGACGGGA TCGAGAACGT CGAGCTGAAA ACGGCGCTGG AGGAGCGCTA CCTCGCCTAC GCGCTCTCGA CGATCATGCA CCGCGCGCTG CCGGACGCGC GCGACGGGCT CAAGCCCGTC CACCGGCGCA TCCTCTACGC GATGCGGATG CTGCGCCTCG ACCCGGGCAC CGCGCACAAG AAGTGCGCCC GCGTGGTCGG CGACGTGATC GGCAAGTACC ACCCGCACGG CGACACGGCG GTCTACGACG CGCTGGTGCG CCTCGCGCAG GATTTCGCGC AGCGCTACCC GCTGGTCGAC GGCCAGGGCA ATTTCGGCAA CATCGACGGC GACAACCCGG CGGCCCAGCG CTACACCGAG TGCCGGATGA CCGAGGTGGC GCGGCTCCTC CTGGACGGGA TGGAGGAGGA CGCGGTCGAT TTCCGGCCGA ACTACGACGG CCAGGAGGAG GAGCCGGTCG TGCTCCCGGC CGCCTTCCCG AACCTGCTCG CCAACGGCGC CCAGGGCATT GCGGTCGGCA TGGCGACCTC GATCCCGCCC CACAACGCCG CCGAACTCTG CGACGCGGCG CTCTACCTCA TTACCCACCC GACCGCGACC TCCGCGCAGC TGACGAGCTT CGTGAAGGGC CCGGACTTCC CGACCGGGGG CGTGGTGATC GACTCGGCCG AGGCCATCGC GGAGGCCTAC CGGACCGGCC GCGGCGCCTT CCGGGTCCGC GCCCGCTGGA CGCGGGAGGA TCTCGGCCGC GGCCTCTGGC AGGTGGTGGT CACCGAGATC CCCTACGGCG TGCCGAAGGC GCGGCTCATC GAGAAGATGG CCGAGCTCCT GACCGAGAAG AAGCTGCCGC TCCTCGCCGA TGTCCGGGAC GAATCGGCGG AGGATGTCCG CATCGTGCTG GAGCCGCGCT CGCGCACGGT GGATCCGGTC GTGCTGATGG AATCGCTGTT CCGGCTCACC GAACTGGAGA GCCGCTTCTC CCTCAACATG AACGTGCTGA TGAGCGGGCA GGTGCCCCGG GTCGTCGGCC TCGCCGAGGT GCTGCGCGAG TGGCTCGACC ACCGCCGCAC CGTGCTGCAG CGCCGCTCGC GCCACCGCCT CGGCCAGATC GAGCGGCGCC TGGAGATCCT GGGCGGCCTG CTCATCGTCT ATCTCGACCT CGACCAGGTG ATCCGCATCA TCCGCGAGGA GGACGAGCCC AAGGCCGAGC TGATGCGCCA CTTCCAGCTC ACCGAGGTGC AGGCGAACGC GATCCTCGAC ACCCGCCTGC GCAGCCTGCG CAAGCTGGAG GAGATGGAGC TCAAGCGCGA GCAGGCCGCC CTGACCGCCG AGAAGGCGGA TCTCGACGCG CTGCTCGACT CCGAGCCGCT GCAGTGGAAG TCGATCGCCG GCCAGGTGCG GGCGGTGAAG AAGACCTTCG GGCCCGAGAC GGCGCTCGGC CGCCGCCGCA CCACCCTGGA GAACCCGCCC GACACCTCCG ACCTCGACCT CGCCGAGGCG ATGATCGAGC GCGAGCCGAT CACCGTGATC GTGTCGCAGA AGGGCTGGAT CCGGGCGCTC AAGGGCCACG TCGCCGATCT CTCGACCGTG CAGTTCAAGG GCGACGACGC GCTCAAGCTG AGCTTCCCGA CCGAGACGAC CGCGAAGCTC CTGGTTCTCG CCACCAACGG CAAGGTCTTC ACCCTGGACG CGGCCAAGCT CCCGGGCGGG CGCGGCTTCG GCGACCCGAT CCGCCTGATG GCCGATCTCG ACGAGGGCAG CGACATCGTC ACGGTCTTCC CCTTCCGGGC GGGGGCGAAG CTCTTGTTCG GCACGAGCGA CGGGCGCGGC TTCACCACGC TCGCGGACGG GCTGGTGGCG AATACCCGCA AGGGCAAGCA GGTCGTGGCC CTCGACGGCG CCGCCACGGT GACCCACTGC GTCGAGGCCG GGGCGGGCGA CCACGTGGCG GTCTGCGGCG AGAACCGCCT GCTGGTGGTC TTCCCGCTCA CCGAGATCCC CGACATGGCC CGCGGCAAGG GCGTCCGGCT GCAGCGCTAC CGGGAGAGCA CGCTCGCCTA TCTGAGCGTC TTCAGGCTCG CCGAGGGCCT GAGCTGGCCC GACAGCGCTG GGCGCACGCG CACCGTCGTC GGCGAGGAAC TGGCGAAGTG GGTGGGCCAC CGCGGCACCG TCGGCGCGAT GGCGCCGCGG GGCTTCCCGC GCAGCAACCG GCCGTAG
|
Protein sequence | MGKPFEPPSD RDGIENVELK TALEERYLAY ALSTIMHRAL PDARDGLKPV HRRILYAMRM LRLDPGTAHK KCARVVGDVI GKYHPHGDTA VYDALVRLAQ DFAQRYPLVD GQGNFGNIDG DNPAAQRYTE CRMTEVARLL LDGMEEDAVD FRPNYDGQEE EPVVLPAAFP NLLANGAQGI AVGMATSIPP HNAAELCDAA LYLITHPTAT SAQLTSFVKG PDFPTGGVVI DSAEAIAEAY RTGRGAFRVR ARWTREDLGR GLWQVVVTEI PYGVPKARLI EKMAELLTEK KLPLLADVRD ESAEDVRIVL EPRSRTVDPV VLMESLFRLT ELESRFSLNM NVLMSGQVPR VVGLAEVLRE WLDHRRTVLQ RRSRHRLGQI ERRLEILGGL LIVYLDLDQV IRIIREEDEP KAELMRHFQL TEVQANAILD TRLRSLRKLE EMELKREQAA LTAEKADLDA LLDSEPLQWK SIAGQVRAVK KTFGPETALG RRRTTLENPP DTSDLDLAEA MIEREPITVI VSQKGWIRAL KGHVADLSTV QFKGDDALKL SFPTETTAKL LVLATNGKVF TLDAAKLPGG RGFGDPIRLM ADLDEGSDIV TVFPFRAGAK LLFGTSDGRG FTTLADGLVA NTRKGKQVVA LDGAATVTHC VEAGAGDHVA VCGENRLLVV FPLTEIPDMA RGKGVRLQRY RESTLAYLSV FRLAEGLSWP DSAGRTRTVV GEELAKWVGH RGTVGAMAPR GFPRSNRP
|
| |