Gene Mlg_0947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0947 
Symbol 
ID4269681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1071703 
End bp1073595 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content68% 
IMG OID638125699 
ProductDNA topoisomerase IV subunit B 
Protein accessionYP_741791 
Protein GI114320108 
COG category[L] Replication, recombination and repair 
COG ID[COG0187] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit 
TIGRFAM ID[TIGR01055] DNA topoisomerase IV, B subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.798803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACC GTTACGACGC CGCGGATATC GAAGTCCTCA CCGGGCTGGA GCCGGTGCGC 
AAGCGCCCGG GCATGTACAC CGACACCAGC CGGCCCAACC ACCTGGCCCA GGAAGTCATC
GACAACAGCG TCGACGAGGT GATGGCCGGC CACGCCACCC GCGTGGACGT CACCCTTTTC
CGCGACGGCA GCCTGGAGGT GCGGGACGAC GGCCGGGGCA TGCCGGTGGA CGTCCACCCG
GGGCAGGGGC GCCCGGGGGT GGAGGTCATC CTCGGCACCC TGCACGCCGG CGGCAAGTTC
TCCGGCAAGA ACTACCAGTA CTCCGGCGGC CTGCACGGCG TGGGGGTGTC GGTGGTGAAT
GCCCTCTCCC GGCGGCTGGA GGTGCGGGTG CGCCGCGGTG GCATCGAGTA CATGATGAGC
TTCGCCCACG GCGAAAAGAC CTCAGAGCTG ACCGAGGTGG GCAAGGTGGC CAAGAAGGAC
ACCGGCACCC TGCTGCGCTT TTGGCCCGAC ACCAAGTACT TCGACTCACC GAAATTCTCC
ATCCCGCGGA TGCGGCATGT GCTGCGCGCC AAGGCGGTGC TCTGCCCGGG GCTGGTGGTG
CGTTTTTATG ACGAAGCCGC GGAGGAGGAG ACCGTCTGGT GTTACGAGGA CGGGCTGAAG
GACTACCTCA GCGGCGCCCT GCAGGAGTGG CAGACCCTGC CCACCGAGCC CTTCATCGGG
CGGATGAGCT CGGACCACGA GGCCGCCGAA TGGGCCGTCA CCTGGCTGCC GGAGGGCGGC
GAGGCCATCA CCGAGAGCTA CGTCAACCTC ATCCCCACCG CCCAGGGCGG CACCCACGTC
AACGGCCTGC GCTCCGGGCT CACCGAGGCC ATCCGCGAGT TCTGCGAGTT CCGCAATCTG
TTGCCGCGCG GCGTGCGTAT CACCCCGGAG GACGTCTGGG AGCGGGTCAG TTATGTGTTG
TCGGTCAAGC TGGAGGACCC GCAGTTCTCC GGTCAGACCA AGGAGCGGCT CTCCTCGCGC
GAGTGCGCCA CCTTCGTCTC GGGGGTGGTT AAGGATGCCT TCAGCCTGTG GCTGAACCAG
CACGTGGAGG ACGCCGAGGC CATCGTCCAG CTCATCATCT CCGCCGCCCA GCGGCGGATG
CGCGCGTCCC GCAAGGTGGC CCGCAAGCGC GTCACCCAGG GCCCGGCACT GCCCGGCAAG
CTGGCCGACT GCGCCGCCCA GGACCCGGCG CGCACCGAGC TCTTCCTGGT GGAGGGCGAC
TCCGCCGGCG GCTCCGCCAA ACAGGCCCGC GACCGCGAGT TCCAGGCCGT CATGCCCCTG
CGTGGCAAGA TCCTGAACAC CTGGGAGGTC GCGCCCGACG AGGTGATGGC CTCGCAGGAG
GTGCACGATA TCGCCGTGGC CCTGGGCACC GACCCAGGCT CAGAGCAGCT CGATGGCCTG
CGCTACGGCA AGATCTGCAT TCTCGCCGAC GCCGACCCCG ACGGCGCCCA CATCGCCACC
CTGCTGTGCG CCCTCTTTCT CAAGCACTTC CCGGCGCTGG TGCGCGCCGG CCATGTCTTC
GTGGCCATGC CGCCGCTCTA CCGCATCGAC GTGGGCAAAC AGACCTTCTA CGCCCTGGAC
GAGCACGAGC GCCAGGGCGT GCTCGACCGC ATCGCCGCCG AGAAGCTGAA GGGCAAGGTG
GCCGTCACCC GGTTCAAGGG CCTGGGCGAG ATGAACCCGC TGCAATTGCG CGAGACCACC
ATGGCCCCCG ACACCCGGCG ACTGGTGCAA TTGATGGTGG ACGACGCCGA GGCCACCGAG
GCGTTGATGG CCCAGTTGCT GGGCAAGCGC AATGCGTCGC AGCGGCGGCG GTGGTTGGAG
GATAAGGGGA ATATGGCGGA GGCGATGGTT TGA
 
Protein sequence
MSNRYDAADI EVLTGLEPVR KRPGMYTDTS RPNHLAQEVI DNSVDEVMAG HATRVDVTLF 
RDGSLEVRDD GRGMPVDVHP GQGRPGVEVI LGTLHAGGKF SGKNYQYSGG LHGVGVSVVN
ALSRRLEVRV RRGGIEYMMS FAHGEKTSEL TEVGKVAKKD TGTLLRFWPD TKYFDSPKFS
IPRMRHVLRA KAVLCPGLVV RFYDEAAEEE TVWCYEDGLK DYLSGALQEW QTLPTEPFIG
RMSSDHEAAE WAVTWLPEGG EAITESYVNL IPTAQGGTHV NGLRSGLTEA IREFCEFRNL
LPRGVRITPE DVWERVSYVL SVKLEDPQFS GQTKERLSSR ECATFVSGVV KDAFSLWLNQ
HVEDAEAIVQ LIISAAQRRM RASRKVARKR VTQGPALPGK LADCAAQDPA RTELFLVEGD
SAGGSAKQAR DREFQAVMPL RGKILNTWEV APDEVMASQE VHDIAVALGT DPGSEQLDGL
RYGKICILAD ADPDGAHIAT LLCALFLKHF PALVRAGHVF VAMPPLYRID VGKQTFYALD
EHERQGVLDR IAAEKLKGKV AVTRFKGLGE MNPLQLRETT MAPDTRRLVQ LMVDDAEATE
ALMAQLLGKR NASQRRRWLE DKGNMAEAMV