Gene Mlg_1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1789 
Symbol 
ID4268708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2041304 
End bp2044393 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content58% 
IMG OID638126545 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_742623 
Protein GI114320940 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGCC ACTCCGAACT CGCTTTTGAA ACCGCCATAG AAGCCAGCCT GACCAGTGCG 
GGCGGCTATG AGAAACGCAA TGCCTCAGCC TACGACGAGG CGTTGGCGAT ATTCCCAGAG
GATGTCACCG GTTTTTTGCA GGACAGCCAA GCGGCCAAGT GGGGCCAACT GGAAGCCTTG
CTGGGCGAAA AGACTCGCGC CACGGTGCTG CACAGCCTGG CCAAGGAGTT GGAGATCAAG
GGCACCCTGC ATGTGCTGCG CTACGGCTTC AAATGTTACG GCAAGACCTT CCGGCTGGCG
CATTTCCGGC CCAATTCCGG CATGAACCCG GAGGCGGCGG CCGCCTACGC ACTGAACCGG
CTAACGGTCA CCCGGCAGGT GGCTTTTACT TCGGTGATGA AGAAGGCCGA TGGTCCAAAA
TCAACCAAAA ACCGCCGCTG CATTATCGAT GTCACTCTGA GCTTGAACGG CCTGCCGGTG
GTCACCGCAG AATTGAAAAA CTCGCTCACC GGCCAGCGGG CGGCAGATGC GGTCAAGCAA
TACAAGCAGG ACCGCGATGA ACGCGACCTG TTGTTTGCGT TCAAAAAGCG CGCCTTGGTG
CATTTTGCCG TGGACCCGGA CGAGGCATGG ATGACCACGC GCCTGAAGGG CAAGGATACC
GTCTTTCTGC CCTTCAACCG CGGGCATGAT CATGGCGCAG GCAATCCACC AGTGGAGAAC
AACTGGAAGA CCCACTACCT GTGGGATGAG GTGCTGGAGA AGGACAGCCT GATGGACATC
CTGCAACGCT TCATGCACCT GGAGGTGAAG GAGCGGCAGG TCAAGACCGA CAAGGGCGTG
CGCACCATCC GCAAGGAAAC CATGATCTTC CCCCGCTATC ACCAGCTCGA CGCGGTGCGC
AGGCTGGTGG TCCATGCACG GGCCAATGGC TCGGGGCACA ATTATCTGGT CCAGCACTCG
GCCGGTTCCG GCAAGTCCAA TTCCATCGCC TGGCTGGCGC ACCGGCTGGC CAGCCTGCAC
GATGCCGAGG ATCAGAAGGT GTTCCACTCG GTGGTGGTGG TCACTGACCG GCGCGTGCTG
GACCAGCAGT TGCAGAACAC CATCTACCAG TTCGAGCACA AGACCGGGGT GGTGGAGAAG
ATCGACGAGA ACACCCAGCA ATTGGCTCAG GCCCTGTCCG GCGGCACGCC GATCATTATC
ACCACCTTGC AGAAATTCCC CTTCATTTCC CAGGCCCTTT CTACGCTGGA GAAAAAGGGC
ACAGGGGTGA AAATCAGCAC CGCCGGCAAG CGTTTTGCTG TGATTGTGGA CGAGGCGCAT
TCCTCCCAGA GCGGCGAGAC AGCTTCTGAA TTGCGGGGCA TGCTGAACAA GGATGGCATC
GAGGCGGCCA TAGCCGCCCA GTTGTCGGAT GAGGAAGACG ATGCGCTGTC GGATGAAGCC
AAGGCCGCTA TTCTTCGCGA TTCGCTGAAG CGGGCACGGC AACCAAACCT TAGTTTCTTT
GCCTTTACCG CCACGCCGAA ATTCAAGACC AAGGCCCTTT TCGATGAACC GGGCCCCTCC
GGCAGCTCTC CGTTTCACGA ATACTCCATG CGTCAGGCCA TCGAAGAAGG CTTCATCATG
GACGTGCTGC AGAACTACAC CACCTACAAG CGCTTCTTCG GCCTGATCAA GCAGGTTGAG
AACGACCCGG AAGTACCGCG CAAGAAAGCC GCCAAGGCCC TGAGCCGCTA CCTGGAATTG
CACCCGGTCA ATATCGAGCA GGTGGTCTCG GTGATCGTCG AGCACTTCCG GTTGTATGTG
ATGAGCGAAA TGGGCGGGCG TGCCAAAGCC ATGGTAGTGA CTGGATCGCG GCTGGCTGCG
GTGAAATACA AATTGGCCTT TGACCGCTAT ATCAAGGACA ACGGCTATGG CGGCATACGC
TCGCTGGTCG CTTTCTCCGG TAGCGTTGAA GATCCAGACG ACCCCGGTTC CGCTTACACC
GAGGTCGCGA TGAATGACGG TCTGGCGGAA AGCGAGCTGC CGGAAACCTT CGAGCGCGAG
GATTACCGTG TGCTGCTGGT CGCGGAGAAA TACCAGACCG GTTTCGACCA GCCGCTGCTG
CAGACCATGT ATGTAGTCAA ACGCCTGGCC GGTGTCCAGG CAGTGCAGAC CCTGTCTCGC
CTCAATCGTG TGGCACCGGG CAAGGCCCGT ACCTTCGTGC TGGACTTCGC CAACGAGGAA
GACGATATCT ATCAGTCCTT CAAGCCCTAC TACGAGGCTA CACCTGTCGG CGAGAATGCC
GATCCGCACC AGTTGTCCGA GCTGCAACAC AAACTTATGG GTTGGGCCAT CTTTGCGCCC
GATGATGTCA ATGCTTTCGC GGATGTATGG TATCGGAGCA AACGGGATCA CTCTGCTTCC
GACCACCGGG TAATGAATGC AGTGTTAGAT GCGGTGGTTG CGCGCTTCAG TGACAGGGAG
GAAGCGGAGC AAGAAGAGTT TCGCGGCCAG TTGACCGCCT ACCGTAATCT CTACGCTTTT
CTATCTCAGA TCATTCCGTA TCAGGACAGC GAACTGGAAA AATTTTACGC CTTCGTCCGC
AACTTACTTT CCAAGCTTCC CCCTCCTGGC GATGGTCAGG CCTTCGCACT CGACGACGAA
GTGGCCTTGC GCTATTTCCG CCTGCAACAA ATGACGGAGG GCTCCATCGA CCTGGGCACG
GGCGAGGCCT ACCCTCTGAA AGGGCCCACC GATGTCGGCA CTAGTGGCGT GAAGGAGGAA
GCCGTGCCGC TATCATCACT GGTCGAAAAG CTAAACGAAC GTTTTGGCAC AGACTTTACC
GAAGCCGACC AACTGTTCTT TGATCAGATC ACCGCGAGCG CCGAGGAAAA TGAAAAAATC
GTTGAAGCGG CAAGGGCAAA CAACCTGCCA AACTTCTCCG CATTTCTGGA GCGAATGCTG
GATGAACTGT TCATTGACCG AATGGAAAAC AACGAAGACA TATTTTCGCG TGTGATGACC
GACAAGGAGT TTCGTTCAGC AGCCCATGAG CACCTTGCCG AGGAGATTTT TAAGCGAGTG
CGTGAGGAAA AGCGCACCGG CGGCGGATAG
 
Protein sequence
MAGHSELAFE TAIEASLTSA GGYEKRNASA YDEALAIFPE DVTGFLQDSQ AAKWGQLEAL 
LGEKTRATVL HSLAKELEIK GTLHVLRYGF KCYGKTFRLA HFRPNSGMNP EAAAAYALNR
LTVTRQVAFT SVMKKADGPK STKNRRCIID VTLSLNGLPV VTAELKNSLT GQRAADAVKQ
YKQDRDERDL LFAFKKRALV HFAVDPDEAW MTTRLKGKDT VFLPFNRGHD HGAGNPPVEN
NWKTHYLWDE VLEKDSLMDI LQRFMHLEVK ERQVKTDKGV RTIRKETMIF PRYHQLDAVR
RLVVHARANG SGHNYLVQHS AGSGKSNSIA WLAHRLASLH DAEDQKVFHS VVVVTDRRVL
DQQLQNTIYQ FEHKTGVVEK IDENTQQLAQ ALSGGTPIII TTLQKFPFIS QALSTLEKKG
TGVKISTAGK RFAVIVDEAH SSQSGETASE LRGMLNKDGI EAAIAAQLSD EEDDALSDEA
KAAILRDSLK RARQPNLSFF AFTATPKFKT KALFDEPGPS GSSPFHEYSM RQAIEEGFIM
DVLQNYTTYK RFFGLIKQVE NDPEVPRKKA AKALSRYLEL HPVNIEQVVS VIVEHFRLYV
MSEMGGRAKA MVVTGSRLAA VKYKLAFDRY IKDNGYGGIR SLVAFSGSVE DPDDPGSAYT
EVAMNDGLAE SELPETFERE DYRVLLVAEK YQTGFDQPLL QTMYVVKRLA GVQAVQTLSR
LNRVAPGKAR TFVLDFANEE DDIYQSFKPY YEATPVGENA DPHQLSELQH KLMGWAIFAP
DDVNAFADVW YRSKRDHSAS DHRVMNAVLD AVVARFSDRE EAEQEEFRGQ LTAYRNLYAF
LSQIIPYQDS ELEKFYAFVR NLLSKLPPPG DGQAFALDDE VALRYFRLQQ MTEGSIDLGT
GEAYPLKGPT DVGTSGVKEE AVPLSSLVEK LNERFGTDFT EADQLFFDQI TASAEENEKI
VEAARANNLP NFSAFLERML DELFIDRMEN NEDIFSRVMT DKEFRSAAHE HLAEEIFKRV
REEKRTGGG