Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0969 |
Symbol | |
ID | 4270439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1105176 |
End bp | 1107425 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638125720 |
Product | DNA topoisomerase IV subunit A |
Protein accession | YP_741812 |
Protein GI | 114320129 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit |
TIGRFAM ID | [TIGR01062] DNA topoisomerase IV, A subunit, proteobacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.139272 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.119804 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAGTA GCGCCGATAG CCTGGAGTTT GAGACCCGAC CGCTGCGAGA GTTCACCGAG AAGGCGTACC TGGACTATTC CATGTACGTC ATCCTGGACC GCGCCCTGCC CAACGTGGGG GACGGCCTGA AGCCGGTGCA GCGGCGCATC GTCTACGCCA TGTCCGAGCT CGGGCTGTCC AATCTCGCCA AGTACAAAAA GAGCGCGCGC ACGGTGGGTG ACGTGCTGGG CAAGTACCAC CCCCATGGCG ATTCGGCCTG TTACGAGGCC ATGGTGCTCA TGGCCCAGCC CTTCTCCTAC CGCTATCCGC TGGTGGACGG GCAGGGCAAT TGGGGCAGCG CCGACGACCC CAAGTCCTTC GCCGCCATGC GCTATACCGA GGCGCGGCTG GCGCCGTACG CCAAGCTGCT GTTGCAGGAG CTGGGGCAGG GCACGGTGGA TTGGGTGCCC AACTTCGACG GCACCATGGA GGAGCCGGGG CTGCTGCCCG CCCGTGTCCC CAATGTGCTG CTGAACGGCG GTACCGGGAT CGCCGTGGGC ATGGCCACGG ACATCCCGCC CCACAACCTG CGCGAGGTGG TCAGCGCCTG TGTGCACCTG CTGGATGAAC CGGAGGCCGA TACCGTCGCC CTGATGGCCC ACGTGCCCGC CCCGGACTTC CCCACCGAGG CGGAGATCAT CACGGCCAAG GACGACATCC GGCGCATCTA CGAGACCGGC AACGGCACCC TGCGCATGCG CGCCCGTTAC GAGCGCGAGA ACGGCGACAT CATCGTTACC GCGCTGCCCT ATCAGGTCTC CGGCAGCAAG GTGCTGGAGC AGATTGCCGG CCAGATGCAG TCGAAGAAGC TGCCGATGGT CGAGGACCTG CGCGATGAGT CGGACCACGA GAACCCCACC CGCCTGGTGA TCACGCCACG CTCCAACCGG GTGGATATCC ACCGGGTGAT GGAGCACCTG TTTGCCACCA CCGACCTGGA GAAGAACTAC CGGGTCAACC TCAACGTCAT CGCCCTGGAC GGCCGGCCGC GGGTGCTGGG GCTGCGCGAA CTGCTGCTGG AGTGGCTGAC CTTCCGGACC GATACCGTGC GCCGGCGGCT GAACTGGCGG CTGCAGAAGG TGCAGGACCG GCTGCACATC CTCGAGGGCC TGCTGATCGC CTACCTCAAT ATCGACGAGG TGATCGCCAT CATCCGCGAG GAGGATGAGC CCAAGCCGGT GCTTATGGCC CGTTTCGGGC TCAGTGAGCG TCAGGCCGAG GCCATCCTGG AGCTCAAGCT GCGCCACCTG GCCAAGCTCG AGGAGATGAA GATCCGCGGC GAGCAGGGGG ACCTGGAGAG GGAGCGCGAC GAGTTGCAGA CCATCCTGGG TTCGGATGAG CGGCTGCGCG AGCTTATCAA AGAGGAGCTG CGGGCCGACG CCGAGCAGTA CGGCGATGAG CGCCGCTCCC CGCTGGTGAC CCGCTCCGCC GCCCGGGCCC TGGACGAGAC CGACCTGATG CCCAGCGAGC CGGTCACCGT GGTGCTCTCC GAGAAGGGTT GGGTGCGCGC GGCCAAGGGC CATGAAGTGG ACGCCCCCGG GCTCAACTAC AAGGCGGGCG ACCAGTACCG CGATCACGCC CCGGGGCGGA GTAACCAGCA GGCGGTCTTC CTGGACCACA CCGGGCGCAG CTACTCGCTG ACGGCACACA CCCTGCCCTC GGCCCGGGGC CAGGGCGAGC CGCTGACCGG ACGGCTGTCG CCGGCCCCGG GCGCGCGCTT CGAGCACGTG CTCTGCGGCG ATCCGGCCAG CCTCTGGGTG CTGGCCACCG ACGCCGGCTA CGGTTTTGTC TGCGCGCTCT CCGACATGTA CGCCAAGAAC CGCTCCGGCA AGGCACTGCT CACCGTGCCC CAGGGCGCGC GGGTACTGGC CCCGACCCCG GCCACCGCGG ACGAGGGCGC GGAGCTGGCG GCCGTCTCCA GCGGCGGCCG GTTGCTGGTC TTCCCGCTTT CCGAGCTGCC GCGACTGGCC AAGGGCAAGG GCAACAAGAT CATCGGTATC CCGGCGGCGG CGGTGAAGGC GCGCGAGGAG CTGCTGACCG GGCTGGCGGT GATCGCCCCG GGCCAGGGGC TGAGCCTGAC GGTGGGGCGG CGGGGCATGA CCCTGAAGCC CGACGACCTG GCCGCCTACC GCGCCCCCCG CGGCCGTCGT GGCGCGCTGT TGCCGCGTGG GCTGCGCCGG GTGGATGCCA TCGAGCCGGT GGACCTCTAA
|
Protein sequence | MASSADSLEF ETRPLREFTE KAYLDYSMYV ILDRALPNVG DGLKPVQRRI VYAMSELGLS NLAKYKKSAR TVGDVLGKYH PHGDSACYEA MVLMAQPFSY RYPLVDGQGN WGSADDPKSF AAMRYTEARL APYAKLLLQE LGQGTVDWVP NFDGTMEEPG LLPARVPNVL LNGGTGIAVG MATDIPPHNL REVVSACVHL LDEPEADTVA LMAHVPAPDF PTEAEIITAK DDIRRIYETG NGTLRMRARY ERENGDIIVT ALPYQVSGSK VLEQIAGQMQ SKKLPMVEDL RDESDHENPT RLVITPRSNR VDIHRVMEHL FATTDLEKNY RVNLNVIALD GRPRVLGLRE LLLEWLTFRT DTVRRRLNWR LQKVQDRLHI LEGLLIAYLN IDEVIAIIRE EDEPKPVLMA RFGLSERQAE AILELKLRHL AKLEEMKIRG EQGDLERERD ELQTILGSDE RLRELIKEEL RADAEQYGDE RRSPLVTRSA ARALDETDLM PSEPVTVVLS EKGWVRAAKG HEVDAPGLNY KAGDQYRDHA PGRSNQQAVF LDHTGRSYSL TAHTLPSARG QGEPLTGRLS PAPGARFEHV LCGDPASLWV LATDAGYGFV CALSDMYAKN RSGKALLTVP QGARVLAPTP ATADEGAELA AVSSGGRLLV FPLSELPRLA KGKGNKIIGI PAAAVKAREE LLTGLAVIAP GQGLSLTVGR RGMTLKPDDL AAYRAPRGRR GALLPRGLRR VDAIEPVDL
|
| |