Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0504 |
Symbol | |
ID | 4268440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 551304 |
End bp | 554639 |
Gene Length | 3336 bp |
Protein Length | 1111 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638125245 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_741348 |
Protein GI | 114319665 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGCC CGGAGTACAC GGACGTTGAA AAGCCCCTGA TCGACCAACT GGTCGGCCTG GGGTGGACGC ACGTGACCGG CTCGCCGGAG GACCCCGCCG CCACCCACCG GGAGAGCTTC GCCCAGGTGG TCATGGAACC ACTGCTCCGG GAACGCCTGC AGGCCATCAA CCTGCGCGAC GGCCATCCCT GGCTGGACGC CCGCCGGCTC GACCAGGCCG TCTCCGCCAT CACCCGCCTG CCCGCCAGCA AGGTCATGGA GGCCAACCGC CAGGCCACCG AACGGCTGGT CGGGGGGCTG ACCGTGGAGG GGCTGCCCGA CTGGGACGGG GGCCGGGGCC AGACCATCCG CTTCATCGAC TGGGACCGCC CGGCCAACAA CACCTTCACC GTGGTCAACC AGTTCAAGGT GAAGTGCCCG CCTGGCCACG ACACCGGCAA GGGGCACGTC ATCCCCGACC TGGTGCTGTT CGTGAACGGC ATCCCCCTGG TGGTCATCGA GTGCAAGAGC CGCACCGCCC CCGAGGGCCT CAGCGACGCG GTGGACCAGC TCCGCCGCTA CCACGACCAG CGCTTTCAGG ACTTCGAGGT GGAAGAGCAC GAGGGCGCCC CGGCCCTGTT CGCTACCAAC CAGGTGCTGG TGGCCAGCAA CTTCGACGAG GCGCGGGCGG GCACCGTGGG CGCGGCCTTC GCCCACTACC TGAGCTGGAA GACGGTGGTG CCCCGGCGGG AGTCGGAGGT GGCCGAGGCG CTGGGGGTGG CGACGCTCTC CGCCCAGCAG CGGCTGGTGG CCGGCCTGTT GGCCCCCGAC ACCCTGCTGG ATATCGCCCG CCACTACACC CTGTTCATGA ACGCCGGCGG CCAGACCATC AAGGTGGTCT GCCGCTACCA GCAGTATCGC GGCGTCACCC GCGCCATCCA CCGGCTGAAG AGCGGTAAGA CACGTGCGCA GGATGGCGAG ATCGACCGCC GGGGCGGGAT CATCTGGCAC ACCCAGGGCA GCGGTAAGAG CCTGTCCATG GTCTTTCTGG TGCGCAAACT GCGCACCGAC CCCGACCTGC GCCGGTTCAA AGTGGTGGTC ATCACCGACC GCAAGGACCT GCAGGCCCAG CTCTCCGACA CCGCCGAACT CACCGGCGAG ACGGTGGAGA CCGCCCCTGA CACCACCCGC CTGAAACGGC TGCTGGCGCG GGAGGGGCCG GGGCTGGTCT TCGGCACCAT CCAGAAGTAC CGCGACCCGG ACACCGCCGA CGAGGACCCC GACGCCACTC GGGAGGAGGG CAGGGCGGCC GATGGCCGGC AGGCCGCCGA CACCCCCGGC CGCTACACCG TCCCCGCCCG TCCCAAACGC AGCGAGCCCT TCGAGGTGCT GAACACCAGC GAGGACATCC TGGTCCTGGT GGACGAGGCC CACCGGACCC AGGCGGGGGA CCTGCACGCC AACCTGATGC GGGCCCTGCC CAACGCTGCC CGCATCGGCT TCACCGGCAC GCCCATCATC ATGGGCGACA AGAAGCGCAC CCACGATATC TTCGGCGACT ACATCGACCG CTACACCATC AAGGAGGCGG AGCAGGACGG CGCCACCGTG CCCATCCTCT ACGAAGGGCG CACCGCCAAG GGGGCCATCA AGGACGGCGC CAGCCTGGAC GGCCTGTTCG AGGACCTCTT CCGGGACCAC ACCAAGGACG AGCTGGAGGC CATCAAGAAG AAGTACGCCA CCAAGGGGCA GATCTTCGAG GCCCCGCAGC TCATCCGCGA GAAGGCCGCC GACATCCTGC GCCACTACGT CACCCACATC CTTCCCAACG GCTACAAGGC GCAGTTGGTG GCCTACAGCC GCCGGGCCGC CGTGCGCTAC CAGGAGGCCC TGAACGAGGC CCGGGAGGCG CTGCTCCAGG AGGCCGTGGC GCTCCCCGTG GAGGACAAGC AATTGGACGA CTCCGGGCTG TTGCAACGGC CCGCGCGGGT GCAGGCGGCC ATCCAGGCCT GGCGCTACCG GGACACCCTC CGCCGGCTGG AATTCGCCGT GGTCATCAGC GGCGGCAACA ACGACGACTC CGCCTGGGCC CGGTGGAGCG ACCGGGCGGC GGTGGAGAGC CACATCCAGC GGTTCAAAAA GCCGCTGACC CACGACGACC CGGACAAGGC CGATCCTCTT GCCTTCCTGA TCGTCAAATC CATGCTGCTC ACCGGCTTCG ACGCCCCCAT CGAGGGGGTG ATGTACCTGG ATCGCTCCAT CCGCGAGGCG GAACTGCTCC AGGCCGTCGC CCGGGTCAAC CGCACCGGCC ACGGCAAGAC CCACGGCCGG GTGGTGGACT ACTTCGGGGT GGCCAACCAC CTCAAGGACG CCCTGGCGGC CTACAGCGAG GAGGACATCG ACGGCGCCCT GCAAAGCCTC TCCGACGAAA TCCCCCTGCT GCGCGACCGC CACCTGCGCA CGGTGGACGT GCTGCGCCAG CGGGGCGTGG ACAGCCTGGA GGACGTGGAG GAGGCGGTGC AGGCGCTGGC GGACGAGCGG GTGCGGGCCG AGTTCACCGT CAAACTCAAG GAGTTCAGCC GCAGCCTGGA TGACGTGCTG CCCCGGCCGG AGGCGCTGGA GTTCGTCAAC GACGCCAAGC AACTGGCCTA CATCCACGCC CTGGCCCGCA ACCGCTACAA GGACACCCCC TCCCTGGGGC GCGACGTGGG CAACAAGGTG CGCAAACTCA TCGACGAGTA CGTCATCTCC CTGGGCATCG ACCCGCGCAT CCCCCCGGTG CAGCTCACCG ACGCCGACTT CGAAAAGCAC CTGGGCCGCC AGGTGGGGGA CCGGGCCAAG GCCTCCGAAA TGGAGCACGC CATCCGCTCC CACGTCCGCA AACACATGGA CGAAGACCCG GTGAAGTACG GTCGGCTCAG CGAGCGGCTG GAGGAACTGC TGCAGCAACT GGACGGCCAG TGGAAGGACC AGGTGGAGGC CCTGGAGGGG CTGATCGACC AACTGCAGGA AGGCGCCGCC GTAGCGGGCG AGGACCTGGC CGACCTCCCC ACCCACGCCG TCCCCTTCTG GCGGGAGTTG GTGGAGACCA CCGGCGCGGC ATCCGGCCCG GCCGAGGCGG ACGAGCAGGC GCGGCTATTG AAGGCCACCG AGGAACTGGT GGGCATCATC CAGGACGAGA TCGTCGTGCC CGATTTCTGG AAACCCTCCC ACATCCCCGA CCAGGAGCGG TTGCGGGGCC ACCTCTTCCA GCGCCTGATG GAGATGGATC TGGTGTCGGT CGAGGAGGTG GAGGCCCTGG TGGAGCGGCT CTTCGACCTG GCCCGGGCCA ACCACGACCG GCTGGTGGAC GCATGA
|
Protein sequence | MAGPEYTDVE KPLIDQLVGL GWTHVTGSPE DPAATHRESF AQVVMEPLLR ERLQAINLRD GHPWLDARRL DQAVSAITRL PASKVMEANR QATERLVGGL TVEGLPDWDG GRGQTIRFID WDRPANNTFT VVNQFKVKCP PGHDTGKGHV IPDLVLFVNG IPLVVIECKS RTAPEGLSDA VDQLRRYHDQ RFQDFEVEEH EGAPALFATN QVLVASNFDE ARAGTVGAAF AHYLSWKTVV PRRESEVAEA LGVATLSAQQ RLVAGLLAPD TLLDIARHYT LFMNAGGQTI KVVCRYQQYR GVTRAIHRLK SGKTRAQDGE IDRRGGIIWH TQGSGKSLSM VFLVRKLRTD PDLRRFKVVV ITDRKDLQAQ LSDTAELTGE TVETAPDTTR LKRLLAREGP GLVFGTIQKY RDPDTADEDP DATREEGRAA DGRQAADTPG RYTVPARPKR SEPFEVLNTS EDILVLVDEA HRTQAGDLHA NLMRALPNAA RIGFTGTPII MGDKKRTHDI FGDYIDRYTI KEAEQDGATV PILYEGRTAK GAIKDGASLD GLFEDLFRDH TKDELEAIKK KYATKGQIFE APQLIREKAA DILRHYVTHI LPNGYKAQLV AYSRRAAVRY QEALNEAREA LLQEAVALPV EDKQLDDSGL LQRPARVQAA IQAWRYRDTL RRLEFAVVIS GGNNDDSAWA RWSDRAAVES HIQRFKKPLT HDDPDKADPL AFLIVKSMLL TGFDAPIEGV MYLDRSIREA ELLQAVARVN RTGHGKTHGR VVDYFGVANH LKDALAAYSE EDIDGALQSL SDEIPLLRDR HLRTVDVLRQ RGVDSLEDVE EAVQALADER VRAEFTVKLK EFSRSLDDVL PRPEALEFVN DAKQLAYIHA LARNRYKDTP SLGRDVGNKV RKLIDEYVIS LGIDPRIPPV QLTDADFEKH LGRQVGDRAK ASEMEHAIRS HVRKHMDEDP VKYGRLSERL EELLQQLDGQ WKDQVEALEG LIDQLQEGAA VAGEDLADLP THAVPFWREL VETTGAASGP AEADEQARLL KATEELVGII QDEIVVPDFW KPSHIPDQER LRGHLFQRLM EMDLVSVEEV EALVERLFDL ARANHDRLVD A
|
| |