Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1789 |
Symbol | |
ID | 4268708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2041304 |
End bp | 2044393 |
Gene Length | 3090 bp |
Protein Length | 1029 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638126545 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_742623 |
Protein GI | 114320940 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGGCC ACTCCGAACT CGCTTTTGAA ACCGCCATAG AAGCCAGCCT GACCAGTGCG GGCGGCTATG AGAAACGCAA TGCCTCAGCC TACGACGAGG CGTTGGCGAT ATTCCCAGAG GATGTCACCG GTTTTTTGCA GGACAGCCAA GCGGCCAAGT GGGGCCAACT GGAAGCCTTG CTGGGCGAAA AGACTCGCGC CACGGTGCTG CACAGCCTGG CCAAGGAGTT GGAGATCAAG GGCACCCTGC ATGTGCTGCG CTACGGCTTC AAATGTTACG GCAAGACCTT CCGGCTGGCG CATTTCCGGC CCAATTCCGG CATGAACCCG GAGGCGGCGG CCGCCTACGC ACTGAACCGG CTAACGGTCA CCCGGCAGGT GGCTTTTACT TCGGTGATGA AGAAGGCCGA TGGTCCAAAA TCAACCAAAA ACCGCCGCTG CATTATCGAT GTCACTCTGA GCTTGAACGG CCTGCCGGTG GTCACCGCAG AATTGAAAAA CTCGCTCACC GGCCAGCGGG CGGCAGATGC GGTCAAGCAA TACAAGCAGG ACCGCGATGA ACGCGACCTG TTGTTTGCGT TCAAAAAGCG CGCCTTGGTG CATTTTGCCG TGGACCCGGA CGAGGCATGG ATGACCACGC GCCTGAAGGG CAAGGATACC GTCTTTCTGC CCTTCAACCG CGGGCATGAT CATGGCGCAG GCAATCCACC AGTGGAGAAC AACTGGAAGA CCCACTACCT GTGGGATGAG GTGCTGGAGA AGGACAGCCT GATGGACATC CTGCAACGCT TCATGCACCT GGAGGTGAAG GAGCGGCAGG TCAAGACCGA CAAGGGCGTG CGCACCATCC GCAAGGAAAC CATGATCTTC CCCCGCTATC ACCAGCTCGA CGCGGTGCGC AGGCTGGTGG TCCATGCACG GGCCAATGGC TCGGGGCACA ATTATCTGGT CCAGCACTCG GCCGGTTCCG GCAAGTCCAA TTCCATCGCC TGGCTGGCGC ACCGGCTGGC CAGCCTGCAC GATGCCGAGG ATCAGAAGGT GTTCCACTCG GTGGTGGTGG TCACTGACCG GCGCGTGCTG GACCAGCAGT TGCAGAACAC CATCTACCAG TTCGAGCACA AGACCGGGGT GGTGGAGAAG ATCGACGAGA ACACCCAGCA ATTGGCTCAG GCCCTGTCCG GCGGCACGCC GATCATTATC ACCACCTTGC AGAAATTCCC CTTCATTTCC CAGGCCCTTT CTACGCTGGA GAAAAAGGGC ACAGGGGTGA AAATCAGCAC CGCCGGCAAG CGTTTTGCTG TGATTGTGGA CGAGGCGCAT TCCTCCCAGA GCGGCGAGAC AGCTTCTGAA TTGCGGGGCA TGCTGAACAA GGATGGCATC GAGGCGGCCA TAGCCGCCCA GTTGTCGGAT GAGGAAGACG ATGCGCTGTC GGATGAAGCC AAGGCCGCTA TTCTTCGCGA TTCGCTGAAG CGGGCACGGC AACCAAACCT TAGTTTCTTT GCCTTTACCG CCACGCCGAA ATTCAAGACC AAGGCCCTTT TCGATGAACC GGGCCCCTCC GGCAGCTCTC CGTTTCACGA ATACTCCATG CGTCAGGCCA TCGAAGAAGG CTTCATCATG GACGTGCTGC AGAACTACAC CACCTACAAG CGCTTCTTCG GCCTGATCAA GCAGGTTGAG AACGACCCGG AAGTACCGCG CAAGAAAGCC GCCAAGGCCC TGAGCCGCTA CCTGGAATTG CACCCGGTCA ATATCGAGCA GGTGGTCTCG GTGATCGTCG AGCACTTCCG GTTGTATGTG ATGAGCGAAA TGGGCGGGCG TGCCAAAGCC ATGGTAGTGA CTGGATCGCG GCTGGCTGCG GTGAAATACA AATTGGCCTT TGACCGCTAT ATCAAGGACA ACGGCTATGG CGGCATACGC TCGCTGGTCG CTTTCTCCGG TAGCGTTGAA GATCCAGACG ACCCCGGTTC CGCTTACACC GAGGTCGCGA TGAATGACGG TCTGGCGGAA AGCGAGCTGC CGGAAACCTT CGAGCGCGAG GATTACCGTG TGCTGCTGGT CGCGGAGAAA TACCAGACCG GTTTCGACCA GCCGCTGCTG CAGACCATGT ATGTAGTCAA ACGCCTGGCC GGTGTCCAGG CAGTGCAGAC CCTGTCTCGC CTCAATCGTG TGGCACCGGG CAAGGCCCGT ACCTTCGTGC TGGACTTCGC CAACGAGGAA GACGATATCT ATCAGTCCTT CAAGCCCTAC TACGAGGCTA CACCTGTCGG CGAGAATGCC GATCCGCACC AGTTGTCCGA GCTGCAACAC AAACTTATGG GTTGGGCCAT CTTTGCGCCC GATGATGTCA ATGCTTTCGC GGATGTATGG TATCGGAGCA AACGGGATCA CTCTGCTTCC GACCACCGGG TAATGAATGC AGTGTTAGAT GCGGTGGTTG CGCGCTTCAG TGACAGGGAG GAAGCGGAGC AAGAAGAGTT TCGCGGCCAG TTGACCGCCT ACCGTAATCT CTACGCTTTT CTATCTCAGA TCATTCCGTA TCAGGACAGC GAACTGGAAA AATTTTACGC CTTCGTCCGC AACTTACTTT CCAAGCTTCC CCCTCCTGGC GATGGTCAGG CCTTCGCACT CGACGACGAA GTGGCCTTGC GCTATTTCCG CCTGCAACAA ATGACGGAGG GCTCCATCGA CCTGGGCACG GGCGAGGCCT ACCCTCTGAA AGGGCCCACC GATGTCGGCA CTAGTGGCGT GAAGGAGGAA GCCGTGCCGC TATCATCACT GGTCGAAAAG CTAAACGAAC GTTTTGGCAC AGACTTTACC GAAGCCGACC AACTGTTCTT TGATCAGATC ACCGCGAGCG CCGAGGAAAA TGAAAAAATC GTTGAAGCGG CAAGGGCAAA CAACCTGCCA AACTTCTCCG CATTTCTGGA GCGAATGCTG GATGAACTGT TCATTGACCG AATGGAAAAC AACGAAGACA TATTTTCGCG TGTGATGACC GACAAGGAGT TTCGTTCAGC AGCCCATGAG CACCTTGCCG AGGAGATTTT TAAGCGAGTG CGTGAGGAAA AGCGCACCGG CGGCGGATAG
|
Protein sequence | MAGHSELAFE TAIEASLTSA GGYEKRNASA YDEALAIFPE DVTGFLQDSQ AAKWGQLEAL LGEKTRATVL HSLAKELEIK GTLHVLRYGF KCYGKTFRLA HFRPNSGMNP EAAAAYALNR LTVTRQVAFT SVMKKADGPK STKNRRCIID VTLSLNGLPV VTAELKNSLT GQRAADAVKQ YKQDRDERDL LFAFKKRALV HFAVDPDEAW MTTRLKGKDT VFLPFNRGHD HGAGNPPVEN NWKTHYLWDE VLEKDSLMDI LQRFMHLEVK ERQVKTDKGV RTIRKETMIF PRYHQLDAVR RLVVHARANG SGHNYLVQHS AGSGKSNSIA WLAHRLASLH DAEDQKVFHS VVVVTDRRVL DQQLQNTIYQ FEHKTGVVEK IDENTQQLAQ ALSGGTPIII TTLQKFPFIS QALSTLEKKG TGVKISTAGK RFAVIVDEAH SSQSGETASE LRGMLNKDGI EAAIAAQLSD EEDDALSDEA KAAILRDSLK RARQPNLSFF AFTATPKFKT KALFDEPGPS GSSPFHEYSM RQAIEEGFIM DVLQNYTTYK RFFGLIKQVE NDPEVPRKKA AKALSRYLEL HPVNIEQVVS VIVEHFRLYV MSEMGGRAKA MVVTGSRLAA VKYKLAFDRY IKDNGYGGIR SLVAFSGSVE DPDDPGSAYT EVAMNDGLAE SELPETFERE DYRVLLVAEK YQTGFDQPLL QTMYVVKRLA GVQAVQTLSR LNRVAPGKAR TFVLDFANEE DDIYQSFKPY YEATPVGENA DPHQLSELQH KLMGWAIFAP DDVNAFADVW YRSKRDHSAS DHRVMNAVLD AVVARFSDRE EAEQEEFRGQ LTAYRNLYAF LSQIIPYQDS ELEKFYAFVR NLLSKLPPPG DGQAFALDDE VALRYFRLQQ MTEGSIDLGT GEAYPLKGPT DVGTSGVKEE AVPLSSLVEK LNERFGTDFT EADQLFFDQI TASAEENEKI VEAARANNLP NFSAFLERML DELFIDRMEN NEDIFSRVMT DKEFRSAAHE HLAEEIFKRV REEKRTGGG
|
| |