Gene Mlg_0969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0969 
Symbol 
ID4270439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1105176 
End bp1107425 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content69% 
IMG OID638125720 
ProductDNA topoisomerase IV subunit A 
Protein accessionYP_741812 
Protein GI114320129 
COG category[L] Replication, recombination and repair 
COG ID[COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 
TIGRFAM ID[TIGR01062] DNA topoisomerase IV, A subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.139272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.119804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGTA GCGCCGATAG CCTGGAGTTT GAGACCCGAC CGCTGCGAGA GTTCACCGAG 
AAGGCGTACC TGGACTATTC CATGTACGTC ATCCTGGACC GCGCCCTGCC CAACGTGGGG
GACGGCCTGA AGCCGGTGCA GCGGCGCATC GTCTACGCCA TGTCCGAGCT CGGGCTGTCC
AATCTCGCCA AGTACAAAAA GAGCGCGCGC ACGGTGGGTG ACGTGCTGGG CAAGTACCAC
CCCCATGGCG ATTCGGCCTG TTACGAGGCC ATGGTGCTCA TGGCCCAGCC CTTCTCCTAC
CGCTATCCGC TGGTGGACGG GCAGGGCAAT TGGGGCAGCG CCGACGACCC CAAGTCCTTC
GCCGCCATGC GCTATACCGA GGCGCGGCTG GCGCCGTACG CCAAGCTGCT GTTGCAGGAG
CTGGGGCAGG GCACGGTGGA TTGGGTGCCC AACTTCGACG GCACCATGGA GGAGCCGGGG
CTGCTGCCCG CCCGTGTCCC CAATGTGCTG CTGAACGGCG GTACCGGGAT CGCCGTGGGC
ATGGCCACGG ACATCCCGCC CCACAACCTG CGCGAGGTGG TCAGCGCCTG TGTGCACCTG
CTGGATGAAC CGGAGGCCGA TACCGTCGCC CTGATGGCCC ACGTGCCCGC CCCGGACTTC
CCCACCGAGG CGGAGATCAT CACGGCCAAG GACGACATCC GGCGCATCTA CGAGACCGGC
AACGGCACCC TGCGCATGCG CGCCCGTTAC GAGCGCGAGA ACGGCGACAT CATCGTTACC
GCGCTGCCCT ATCAGGTCTC CGGCAGCAAG GTGCTGGAGC AGATTGCCGG CCAGATGCAG
TCGAAGAAGC TGCCGATGGT CGAGGACCTG CGCGATGAGT CGGACCACGA GAACCCCACC
CGCCTGGTGA TCACGCCACG CTCCAACCGG GTGGATATCC ACCGGGTGAT GGAGCACCTG
TTTGCCACCA CCGACCTGGA GAAGAACTAC CGGGTCAACC TCAACGTCAT CGCCCTGGAC
GGCCGGCCGC GGGTGCTGGG GCTGCGCGAA CTGCTGCTGG AGTGGCTGAC CTTCCGGACC
GATACCGTGC GCCGGCGGCT GAACTGGCGG CTGCAGAAGG TGCAGGACCG GCTGCACATC
CTCGAGGGCC TGCTGATCGC CTACCTCAAT ATCGACGAGG TGATCGCCAT CATCCGCGAG
GAGGATGAGC CCAAGCCGGT GCTTATGGCC CGTTTCGGGC TCAGTGAGCG TCAGGCCGAG
GCCATCCTGG AGCTCAAGCT GCGCCACCTG GCCAAGCTCG AGGAGATGAA GATCCGCGGC
GAGCAGGGGG ACCTGGAGAG GGAGCGCGAC GAGTTGCAGA CCATCCTGGG TTCGGATGAG
CGGCTGCGCG AGCTTATCAA AGAGGAGCTG CGGGCCGACG CCGAGCAGTA CGGCGATGAG
CGCCGCTCCC CGCTGGTGAC CCGCTCCGCC GCCCGGGCCC TGGACGAGAC CGACCTGATG
CCCAGCGAGC CGGTCACCGT GGTGCTCTCC GAGAAGGGTT GGGTGCGCGC GGCCAAGGGC
CATGAAGTGG ACGCCCCCGG GCTCAACTAC AAGGCGGGCG ACCAGTACCG CGATCACGCC
CCGGGGCGGA GTAACCAGCA GGCGGTCTTC CTGGACCACA CCGGGCGCAG CTACTCGCTG
ACGGCACACA CCCTGCCCTC GGCCCGGGGC CAGGGCGAGC CGCTGACCGG ACGGCTGTCG
CCGGCCCCGG GCGCGCGCTT CGAGCACGTG CTCTGCGGCG ATCCGGCCAG CCTCTGGGTG
CTGGCCACCG ACGCCGGCTA CGGTTTTGTC TGCGCGCTCT CCGACATGTA CGCCAAGAAC
CGCTCCGGCA AGGCACTGCT CACCGTGCCC CAGGGCGCGC GGGTACTGGC CCCGACCCCG
GCCACCGCGG ACGAGGGCGC GGAGCTGGCG GCCGTCTCCA GCGGCGGCCG GTTGCTGGTC
TTCCCGCTTT CCGAGCTGCC GCGACTGGCC AAGGGCAAGG GCAACAAGAT CATCGGTATC
CCGGCGGCGG CGGTGAAGGC GCGCGAGGAG CTGCTGACCG GGCTGGCGGT GATCGCCCCG
GGCCAGGGGC TGAGCCTGAC GGTGGGGCGG CGGGGCATGA CCCTGAAGCC CGACGACCTG
GCCGCCTACC GCGCCCCCCG CGGCCGTCGT GGCGCGCTGT TGCCGCGTGG GCTGCGCCGG
GTGGATGCCA TCGAGCCGGT GGACCTCTAA
 
Protein sequence
MASSADSLEF ETRPLREFTE KAYLDYSMYV ILDRALPNVG DGLKPVQRRI VYAMSELGLS 
NLAKYKKSAR TVGDVLGKYH PHGDSACYEA MVLMAQPFSY RYPLVDGQGN WGSADDPKSF
AAMRYTEARL APYAKLLLQE LGQGTVDWVP NFDGTMEEPG LLPARVPNVL LNGGTGIAVG
MATDIPPHNL REVVSACVHL LDEPEADTVA LMAHVPAPDF PTEAEIITAK DDIRRIYETG
NGTLRMRARY ERENGDIIVT ALPYQVSGSK VLEQIAGQMQ SKKLPMVEDL RDESDHENPT
RLVITPRSNR VDIHRVMEHL FATTDLEKNY RVNLNVIALD GRPRVLGLRE LLLEWLTFRT
DTVRRRLNWR LQKVQDRLHI LEGLLIAYLN IDEVIAIIRE EDEPKPVLMA RFGLSERQAE
AILELKLRHL AKLEEMKIRG EQGDLERERD ELQTILGSDE RLRELIKEEL RADAEQYGDE
RRSPLVTRSA ARALDETDLM PSEPVTVVLS EKGWVRAAKG HEVDAPGLNY KAGDQYRDHA
PGRSNQQAVF LDHTGRSYSL TAHTLPSARG QGEPLTGRLS PAPGARFEHV LCGDPASLWV
LATDAGYGFV CALSDMYAKN RSGKALLTVP QGARVLAPTP ATADEGAELA AVSSGGRLLV
FPLSELPRLA KGKGNKIIGI PAAAVKAREE LLTGLAVIAP GQGLSLTVGR RGMTLKPDDL
AAYRAPRGRR GALLPRGLRR VDAIEPVDL