Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1542 |
Symbol | |
ID | 4270547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1762498 |
End bp | 1765812 |
Gene Length | 3315 bp |
Protein Length | 1104 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638126300 |
Product | DEAD/DEAH box helicase domain-containing protein |
Protein accession | YP_742381 |
Protein GI | 114320698 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.546825 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCTC GTAGACGCAA CAACCGGCCG CAGGTGCCGT TCGCCTACAA GCTGGTGCTG AACCAGTGGC TGTTCAGCCT GTTTGGGCTG GCGTCCACTG ACGGCTTCTT TACCTGGAAC GGAAAGCGCC TGCCGTTACT GGAAGCCTTC AAGCAGAAGT TTCAGTTGAG CGAAGACAGC GCCGGCGGGT TGGACGAGAA CAACATCCAC CGCTTCCACA CGGCCCTGAC CAACCAGACC GACCCGCTGC CCGAACTGCC GGCCGATCTA TTGCTCGAGT ATGACCAGAA TATCGTCCGT CATACCCAGC GTTTGAACGA GCGCCGGCTG GCGCGCGGCG AAGAGGCTAT CGCCTGGAAG TACTTCCAGT ACCTGTCGCT GCTGTTCACC GAAATCTATC TGGACCGTTA TTTCTCCGAT GCGGATGCCT TGCTGGCCTC GATCAACGAA CAGATTGAGC GCTACAACGC GGACAAGCCG GAACCGGACC AGGTCCCAAA TCTGGATCCG GAAGGCGATG TCACCGGGCA GTTGAACAAG CTGGCCTTCT GGAGCGCCAC CGGCTCCGGT AAGACGCTGA TCATGCACGC CAATATCCTG CAATATCAGC ACTACCTGAC CAAGCACCGC CGCCGCCGCG AGCTCAACCG GATCATCCTG TTGACGCCCA ACGAAGGGTT GAGCCAGCAA CACCTGCGAG ATTTCCAGGC TGCCGGGATC GAGGCGGAGT TGTTCGACAA GAACGGTCGC GGGCTGTTCG CCGGGCAGGC TGTCGAGATC ATCGACATCA ATAAACTGCG CGACGAGATG GGCGATAAGA CGGTTGCCGT CGATGCCTTC GAGAACAACA ACCTCGTGCT GATCGATGAA GGGCATCGGG GTGCCAGTTC CGGTGAGACC GGCACCTGGA TGCGCTTTCG CAATGCATTG TGCGAGCAGG GTTTTTCCTT CGAATACTCC GCCACTTTCG GCCAGGCGGT AAAGCGCAGC TCCCAACTGG CGGCCCAGTA CGAGCGTTGC GTGCTTTTCG ACTATTCCTA CAAGTATTTC TATGGCGACG GCTATGGCAA GGACTACCAG ATCCTGAACC TGGACCCGCA GACCCAGGAA AACACTATGG AGGTTTACCT GGTGGCCTGC TTGCTGTCTT TCTTCCAGCA GCTGCGCTTG TATCGCGAAC AGGCTGTGGC TTGCCGGCCG TTCAATATTG AGAAGCCGTT GTGGATCTTC GTCGGCGGCC GGGTCACCGC GTCTCTTTCG ACCAAGGATG CCTCCGACAT TGTTGAGATT CTGCGCTTTT TGGCGCGCTT TCTCGGTGAT CGTGCCGGGA GTACGGCGCG CATCCGGCGG GTGCTGGAGG AAGGCTTGAT TGCTGCCGAT GGGCGCAATC TATTTGCCCG GCGTTTTGCC TACCTGAACG GATTGGGGCT TTCGGCCGAG CAACTGTTCG AGGAAGTACA GGCCAGCCTC TTCAATGCCG CAGGGGGCGG CGCGCTGCAC GTGGAGAACC TGAAGGGTGT CCCGGGGGAA ATCGCTTTGC GCGTTGGCGA CAATGAGCCG TTCGGCGTGA TCAATGTCGG CAATGACAGC AGACTGTGCT CGCTCTGTGA GAACGAAGAC GGGCTGCTGG TCGGTGATCG GGATTTCAAG GGCTCCCTGT TCCAGTCCAT CAATGCTCCC GATTCGTCTG TCAATGTACT GATCGGCTCC AAGAAGTTCA CCGAAGGCTG GAACAGTTGG CGCGTCAGTA CCATGGGGCT GATGAATGTG GGCAGCACGG AGGGTTCGCA GATCATCCAG CTTTTTGGCC GCGGGGTGCG CCTGAAGGGT TATGACGGCA GCCTGAAGCG CAGTGCCAGG GTGAACCTCC CGGAGGATGT CGCGCGGCCA GGACACCTGC CGACACTCGA AACCCTGAAT ATCTTCGGCA TCAAGGCCGA CTACATGGCC CAGTTCCGCG AGTTTCTCGA AGAAGAGGGG CTGCCGTCTA ACGAGGAGCG TATCGAGTTC CTGCTGCCCG TGATTCGCAA CCTGGGTCAG AAGCAGCTCA AGACCATTCG CTTGAAGAAG ACCATCAACG GGGTAAGTAC CCAGTTCGGT GACGCCTTCA AAAAGCTCGG GCCAGTGCCG ACACTGGCCC GGCCCGATCC CGCGCAGGAG GCGGCCACCC GGTATCTGCA AACCAATCCG GTTGTGCTGA ATTGGTATCC CAAGATCCAG GCGATGCGCT CCAGTGGCGT CGGTGCCGTG GAGGAGGATC GTCAGCCGGA TCAGGGTGCC CTGAGCAAGA GACACATCGC CTTTCTCGAC CTGGACGCGC TGTATTTCGA GCTACAACGC TTCAAGGCTG AGCGGGCCTG GCACAACCTG AACCTGCCAC GGGATGTGTT GCCCGAACTG CTGGCCGACC AGAGCTGGTA TCGGTTGCTA ATCCCGGAGT CCGAGCTGGC CTTCGATTCC TTCGAGAAGA TTCACCTGTG GCAGGAACTC GCCGAGGCGC TGCTCAAGAA GTACTGTGAG CGCTACTACT CGTTCCGCAA AAAGGAATGG GAGCTGCCAC ACCTGGAGTA TCGCTATCTG GATGCCAACG ATCCGAACTT TCCTCAGGTC AATGAGGATT TCCCGGAGGG ATACCACCGC ATCCTGGTGG AAGAGTCTCA GACCGAGATT GTCGCCAAGC TCAAGGAGCT CAAAGCACAG ATTGATGCGG GCACGCTGAA ACCCTGGCAG TTTGGCGGCC TGCAAGCTAT TCCATTTGGT CGCCACCTGT ACGAGCCACT GTTGCATCTG GCGGGCTCAA CCGTCGAGAT CAGTCCCGCA CCCCTGAACC AGGGCGAGAG GCGCTTCGTG GAAGACCTGA GGGACTACTG TGACTCGAAA CCGGCATTAC TGCAGGACAA GGAACTCTAT CTGCTACGTA ACCTGAGCAA GGGCCGCGGC GTAGGCTTCT TTGAGGCAGG TAACTTCCAT CCCGATTTCA TCGTTTGGTT GATCGCTGAT GGTCAGGAGT ACATCTCGTT CGTCGATCCC AAAGGGATTC GAAACCTCGG TGCCCAGGAC CCGAAAATCC AGTTCCACCA GACCATCAAG GAAATCGAAG ATCGGCTTGG CGATTCAACT GTGACGCTGA ACAGTTTCGT GATTTCCAAT ACGCCAGCGC ATGAGATGCG ATTGCTTTGG GGGGTGGACA AGGCGCAGAT GGAAGCCTGG CATGTTCTTT TCCAGCAGGA GGACAAGGCC AGTTATATCG ATTCGCTACT CGCCAAATCA CTTGAGCCTG TCAAGTCTCC CGCGAAGGGA TCCGTTTCCA AGTAA
|
Protein sequence | MPPRRRNNRP QVPFAYKLVL NQWLFSLFGL ASTDGFFTWN GKRLPLLEAF KQKFQLSEDS AGGLDENNIH RFHTALTNQT DPLPELPADL LLEYDQNIVR HTQRLNERRL ARGEEAIAWK YFQYLSLLFT EIYLDRYFSD ADALLASINE QIERYNADKP EPDQVPNLDP EGDVTGQLNK LAFWSATGSG KTLIMHANIL QYQHYLTKHR RRRELNRIIL LTPNEGLSQQ HLRDFQAAGI EAELFDKNGR GLFAGQAVEI IDINKLRDEM GDKTVAVDAF ENNNLVLIDE GHRGASSGET GTWMRFRNAL CEQGFSFEYS ATFGQAVKRS SQLAAQYERC VLFDYSYKYF YGDGYGKDYQ ILNLDPQTQE NTMEVYLVAC LLSFFQQLRL YREQAVACRP FNIEKPLWIF VGGRVTASLS TKDASDIVEI LRFLARFLGD RAGSTARIRR VLEEGLIAAD GRNLFARRFA YLNGLGLSAE QLFEEVQASL FNAAGGGALH VENLKGVPGE IALRVGDNEP FGVINVGNDS RLCSLCENED GLLVGDRDFK GSLFQSINAP DSSVNVLIGS KKFTEGWNSW RVSTMGLMNV GSTEGSQIIQ LFGRGVRLKG YDGSLKRSAR VNLPEDVARP GHLPTLETLN IFGIKADYMA QFREFLEEEG LPSNEERIEF LLPVIRNLGQ KQLKTIRLKK TINGVSTQFG DAFKKLGPVP TLARPDPAQE AATRYLQTNP VVLNWYPKIQ AMRSSGVGAV EEDRQPDQGA LSKRHIAFLD LDALYFELQR FKAERAWHNL NLPRDVLPEL LADQSWYRLL IPESELAFDS FEKIHLWQEL AEALLKKYCE RYYSFRKKEW ELPHLEYRYL DANDPNFPQV NEDFPEGYHR ILVEESQTEI VAKLKELKAQ IDAGTLKPWQ FGGLQAIPFG RHLYEPLLHL AGSTVEISPA PLNQGERRFV EDLRDYCDSK PALLQDKELY LLRNLSKGRG VGFFEAGNFH PDFIVWLIAD GQEYISFVDP KGIRNLGAQD PKIQFHQTIK EIEDRLGDST VTLNSFVISN TPAHEMRLLW GVDKAQMEAW HVLFQQEDKA SYIDSLLAKS LEPVKSPAKG SVSK
|
| |