Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1538 |
Symbol | |
ID | 4270543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1752206 |
End bp | 1755097 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638126296 |
Product | helicase domain-containing protein |
Protein accession | YP_742377 |
Protein GI | 114320694 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATCA CGGCCCACCA CGCCAAATAC TATGCCTATG ACCTGACGCG TCAGGCCCCT CCCGGCGCGG CGGACCAGCT CTCCATGTCG CTGTTCGATG CGTCGGTCGA TCTGAACCCC CATCAGATCG AGGCAGCGAT GTTCGCCCTG CAGTCCCCGC TCTCCGAGGG AGTCGTGCTT GCGGACGAGG TGGGCCTGGG CAAGACCATC GAGGCGGGCA TCGTGCTGTG CCAGCGGTGG GCGGAGCGCC GCCGGCGCCT GCTGGTGATT TGTCCCGCTG CGATACGCAA GCAGTGGGCC ACCGAGTTGC AGGAGAAATT CAACCTGCCC GCTGTGGTAA TGGATTCCCG CGCCTACCGG CAATTCCAGC AACAGGGCCA ACCACGGCCA TTCGAGCAGC GGGCCGTGAT CCTGGTCTCC TACCACTATG CGGCTCGTAT GCAGGACGAC CTGCGTATGG TGCGCTGGGA GCTGGTGGTC ATTGATGAAG CCCACAAGCT GCGCAATGCC TACCGGCCCA GCAACAAAGT GGGCCAAGCC ATTCGCTGGG CCACCGAAGG CCGCCGCAAA CTGCTTTTGA CCGCTACGCC ATTGCAGAAC AGCCTGATGG AGCTGTACGG GCTGGCGAGC CTGATCGATG ATCACATCTT CGGCGATCCC AACGCCTTCC GTTCCCAGTA CGTCAATGCC GGTGGCGATC TGGCCGGGCT GCGCCAGCGT CTGGCGGGCT TTTGCAAACG CACCTTGCGC CAGGAGGTGC TTGAGTATGT GCGCTACACT GAGCGCCGGG CGCTGACCCA ACCCTTCCGG CCAACTGATG CCGAGCAGAA ACTCTACGAC CGCATCTCCG AGTTTCTGCA GCGGGAAAAC ACCTACGCCA TTCCCGACCG TCAGCGCCAC CTGATCGTGC TGATCCTGCG CAAGCTGTTG GCCTCCAGCT CAAACGCGAT CGCCGGCACT CTCGAGAGCC TTCGCGACCG GCTGATCGAC CTGCGCGACA GTGAGCAACT GGAGCTGGAC TGGAGCAGCC AACTGATCGA AAGCGAGGAA ATGGAAACCG AGCTGCTGGA CGAGTGGCTG CCGCCGGAGG AGAACGGCGA GGCCAATGGT AAGACCGAAG ACAATCGTCC GGAACCGAAG AAGCTGCGCG ATGAAATCAG CGAGATCGAC AGTCTGGCCA AGGCTGCCCG CGCCATTGGT GTGGATGCCA AATCCTGCGC ACTGCTCACC GCGCTCGATG TCGGTTTCCA GGAGATGGGT CGAATGGGGG CGCGGGAAAA GGCGTTGATC TTCACCGAAT CCCGCCGAAC CCAGGACTAT CTGAAAGACT ATCTGGAGCA GAACGGTTAT GCCGGGGAGG TGCTGCTGTT CAACGGCACC AATGCCGGCC CTGAAGCCAA GGCCATCTAC GAGAACTGGG TCGGCAAAAA CCAGGAAAGC GGCCGCGCCA CCGGTTCCCG CGCCATCGAT GCCCGGCTGG CGCTGGTGGA GCATTTCCGC GACAACGCCA AAATCATGAT TGCCACGGAA GCAGCGGCGG AAGGCGTCAA CCTGCAGTTC TGCTCGCTGG TGATCAATTA CGACCTGCCC TGGAACCCCC AGCGTATCGA ACAGCGCATT GGTCGCTGTC ACCGTTACGG CCAGCAGCAC GATGTGATCG TGGTCAACTT CCTCAACGAG CGCAACGAGG CAGACCAGCG CGTGCATGAG TTGTTGACGG AGAAGTTCAG TCTCTTCAAC GGTCTGTTTG GTGCATCCGA TGAAGTGCTG GGCCAGCTTG AGTCTGGCGT CGATTTCGAG AAGCGCATCC TCGCCATCTA CCAGCAGTGC CGCACCTCGG ATGAAATCGA AACCGCCTTT CGCCAGCTCC AAGAAGTACT GGAGGAGCAG ATCAAGAGCC GCATCCAGGA AACCCGCCAG ACCCTGCTGG AGCATTTCGA CGAAGACGTG CACGCCCGGC TGCGCTTGCG GCTGGAAGAC GCCCAGGCCC AGCTCGACCG CATCGGCCAG CGTTTCTGGC AAGTCACCGG GCACATACTG GATGGCCGCG CACGCTTTGA TGATGCCAGG CTGGTCTTCG ATCTCCACGA GCCGCCAAAG GCCGACATCC GGCCTGGCCG CTACCACCTG ATCTCCAAGA CGCGCAGTGG TGAGCAGGGC GAATCGCCGG ACTACGGCTA CTACCTGTAC CGCCTCTCCC ATCCGCTGGG CGAGCACGTC ATCGAATCGG CCAAGCAGCA GCCGACGCCC ACAGCGCATC TGCGCTTCAA CATCACCGTC CATCCGGGGC GCATTGCCCT GGTGGAAGCA CTCAAGGGTA AGTCCGGTTG GCTGACATTG GCTCGGCTGA CCATCGAATC TTTCGAAACC GAGGAATACC TGCTGTTCTC CGGCATCGAC GACGAAGGCC GCTCGCTGGA CAATGAAACC TGCGCCAAGC TGTTTAACGT CGCCGCTGAC AATCGTGGCC CGGCGGACTT GCCCGCCGAG GTGGAGCAGC GCCTTGCCGC TGAGCTTGCC CAACACCAGC GCGCCACTGC GAACCGCTCG CTGGAGGCCA ACAACCAGCA CTTCAACGCC GCCCGCGAAC GCCTGGAACA ATGGGCGGAA GACAAGATGC TGGCCACCGA GAAAACGCTC AAGGACACAA AAGAACAGAT CAAGGCCCTG CGCCGTCAGG CCCGACAGGC CGAAACCCTG GAAGATCAGC ACGACATCCA GGAACGTCTG AAACAACTCG AGCAGCGCCA GCGCAAGCAG CGCCGGGACA TCTTCAAGGT GGAAGACGAA ATCGAAGAAC AGCGCGACAG CCTCATTGCC GCGCTGGAAA AACGCCTTGC CCGAGACCAG CGTAGTGAGC GCCTGTTTAC GCTGCGCTGG TCCGTCGTAT GA
|
Protein sequence | MTITAHHAKY YAYDLTRQAP PGAADQLSMS LFDASVDLNP HQIEAAMFAL QSPLSEGVVL ADEVGLGKTI EAGIVLCQRW AERRRRLLVI CPAAIRKQWA TELQEKFNLP AVVMDSRAYR QFQQQGQPRP FEQRAVILVS YHYAARMQDD LRMVRWELVV IDEAHKLRNA YRPSNKVGQA IRWATEGRRK LLLTATPLQN SLMELYGLAS LIDDHIFGDP NAFRSQYVNA GGDLAGLRQR LAGFCKRTLR QEVLEYVRYT ERRALTQPFR PTDAEQKLYD RISEFLQREN TYAIPDRQRH LIVLILRKLL ASSSNAIAGT LESLRDRLID LRDSEQLELD WSSQLIESEE METELLDEWL PPEENGEANG KTEDNRPEPK KLRDEISEID SLAKAARAIG VDAKSCALLT ALDVGFQEMG RMGAREKALI FTESRRTQDY LKDYLEQNGY AGEVLLFNGT NAGPEAKAIY ENWVGKNQES GRATGSRAID ARLALVEHFR DNAKIMIATE AAAEGVNLQF CSLVINYDLP WNPQRIEQRI GRCHRYGQQH DVIVVNFLNE RNEADQRVHE LLTEKFSLFN GLFGASDEVL GQLESGVDFE KRILAIYQQC RTSDEIETAF RQLQEVLEEQ IKSRIQETRQ TLLEHFDEDV HARLRLRLED AQAQLDRIGQ RFWQVTGHIL DGRARFDDAR LVFDLHEPPK ADIRPGRYHL ISKTRSGEQG ESPDYGYYLY RLSHPLGEHV IESAKQQPTP TAHLRFNITV HPGRIALVEA LKGKSGWLTL ARLTIESFET EEYLLFSGID DEGRSLDNET CAKLFNVAAD NRGPADLPAE VEQRLAAELA QHQRATANRS LEANNQHFNA ARERLEQWAE DKMLATEKTL KDTKEQIKAL RRQARQAETL EDQHDIQERL KQLEQRQRKQ RRDIFKVEDE IEEQRDSLIA ALEKRLARDQ RSERLFTLRW SVV
|
| |