Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0947 |
Symbol | |
ID | 4269681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1071703 |
End bp | 1073595 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638125699 |
Product | DNA topoisomerase IV subunit B |
Protein accession | YP_741791 |
Protein GI | 114320108 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0187] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit |
TIGRFAM ID | [TIGR01055] DNA topoisomerase IV, B subunit, proteobacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.798803 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACC GTTACGACGC CGCGGATATC GAAGTCCTCA CCGGGCTGGA GCCGGTGCGC AAGCGCCCGG GCATGTACAC CGACACCAGC CGGCCCAACC ACCTGGCCCA GGAAGTCATC GACAACAGCG TCGACGAGGT GATGGCCGGC CACGCCACCC GCGTGGACGT CACCCTTTTC CGCGACGGCA GCCTGGAGGT GCGGGACGAC GGCCGGGGCA TGCCGGTGGA CGTCCACCCG GGGCAGGGGC GCCCGGGGGT GGAGGTCATC CTCGGCACCC TGCACGCCGG CGGCAAGTTC TCCGGCAAGA ACTACCAGTA CTCCGGCGGC CTGCACGGCG TGGGGGTGTC GGTGGTGAAT GCCCTCTCCC GGCGGCTGGA GGTGCGGGTG CGCCGCGGTG GCATCGAGTA CATGATGAGC TTCGCCCACG GCGAAAAGAC CTCAGAGCTG ACCGAGGTGG GCAAGGTGGC CAAGAAGGAC ACCGGCACCC TGCTGCGCTT TTGGCCCGAC ACCAAGTACT TCGACTCACC GAAATTCTCC ATCCCGCGGA TGCGGCATGT GCTGCGCGCC AAGGCGGTGC TCTGCCCGGG GCTGGTGGTG CGTTTTTATG ACGAAGCCGC GGAGGAGGAG ACCGTCTGGT GTTACGAGGA CGGGCTGAAG GACTACCTCA GCGGCGCCCT GCAGGAGTGG CAGACCCTGC CCACCGAGCC CTTCATCGGG CGGATGAGCT CGGACCACGA GGCCGCCGAA TGGGCCGTCA CCTGGCTGCC GGAGGGCGGC GAGGCCATCA CCGAGAGCTA CGTCAACCTC ATCCCCACCG CCCAGGGCGG CACCCACGTC AACGGCCTGC GCTCCGGGCT CACCGAGGCC ATCCGCGAGT TCTGCGAGTT CCGCAATCTG TTGCCGCGCG GCGTGCGTAT CACCCCGGAG GACGTCTGGG AGCGGGTCAG TTATGTGTTG TCGGTCAAGC TGGAGGACCC GCAGTTCTCC GGTCAGACCA AGGAGCGGCT CTCCTCGCGC GAGTGCGCCA CCTTCGTCTC GGGGGTGGTT AAGGATGCCT TCAGCCTGTG GCTGAACCAG CACGTGGAGG ACGCCGAGGC CATCGTCCAG CTCATCATCT CCGCCGCCCA GCGGCGGATG CGCGCGTCCC GCAAGGTGGC CCGCAAGCGC GTCACCCAGG GCCCGGCACT GCCCGGCAAG CTGGCCGACT GCGCCGCCCA GGACCCGGCG CGCACCGAGC TCTTCCTGGT GGAGGGCGAC TCCGCCGGCG GCTCCGCCAA ACAGGCCCGC GACCGCGAGT TCCAGGCCGT CATGCCCCTG CGTGGCAAGA TCCTGAACAC CTGGGAGGTC GCGCCCGACG AGGTGATGGC CTCGCAGGAG GTGCACGATA TCGCCGTGGC CCTGGGCACC GACCCAGGCT CAGAGCAGCT CGATGGCCTG CGCTACGGCA AGATCTGCAT TCTCGCCGAC GCCGACCCCG ACGGCGCCCA CATCGCCACC CTGCTGTGCG CCCTCTTTCT CAAGCACTTC CCGGCGCTGG TGCGCGCCGG CCATGTCTTC GTGGCCATGC CGCCGCTCTA CCGCATCGAC GTGGGCAAAC AGACCTTCTA CGCCCTGGAC GAGCACGAGC GCCAGGGCGT GCTCGACCGC ATCGCCGCCG AGAAGCTGAA GGGCAAGGTG GCCGTCACCC GGTTCAAGGG CCTGGGCGAG ATGAACCCGC TGCAATTGCG CGAGACCACC ATGGCCCCCG ACACCCGGCG ACTGGTGCAA TTGATGGTGG ACGACGCCGA GGCCACCGAG GCGTTGATGG CCCAGTTGCT GGGCAAGCGC AATGCGTCGC AGCGGCGGCG GTGGTTGGAG GATAAGGGGA ATATGGCGGA GGCGATGGTT TGA
|
Protein sequence | MSNRYDAADI EVLTGLEPVR KRPGMYTDTS RPNHLAQEVI DNSVDEVMAG HATRVDVTLF RDGSLEVRDD GRGMPVDVHP GQGRPGVEVI LGTLHAGGKF SGKNYQYSGG LHGVGVSVVN ALSRRLEVRV RRGGIEYMMS FAHGEKTSEL TEVGKVAKKD TGTLLRFWPD TKYFDSPKFS IPRMRHVLRA KAVLCPGLVV RFYDEAAEEE TVWCYEDGLK DYLSGALQEW QTLPTEPFIG RMSSDHEAAE WAVTWLPEGG EAITESYVNL IPTAQGGTHV NGLRSGLTEA IREFCEFRNL LPRGVRITPE DVWERVSYVL SVKLEDPQFS GQTKERLSSR ECATFVSGVV KDAFSLWLNQ HVEDAEAIVQ LIISAAQRRM RASRKVARKR VTQGPALPGK LADCAAQDPA RTELFLVEGD SAGGSAKQAR DREFQAVMPL RGKILNTWEV APDEVMASQE VHDIAVALGT DPGSEQLDGL RYGKICILAD ADPDGAHIAT LLCALFLKHF PALVRAGHVF VAMPPLYRID VGKQTFYALD EHERQGVLDR IAAEKLKGKV AVTRFKGLGE MNPLQLRETT MAPDTRRLVQ LMVDDAEATE ALMAQLLGKR NASQRRRWLE DKGNMAEAMV
|
| |