Gene Mlg_0504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0504 
Symbol 
ID4268440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp551304 
End bp554639 
Gene Length3336 bp 
Protein Length1111 aa 
Translation table11 
GC content69% 
IMG OID638125245 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_741348 
Protein GI114319665 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGCC CGGAGTACAC GGACGTTGAA AAGCCCCTGA TCGACCAACT GGTCGGCCTG 
GGGTGGACGC ACGTGACCGG CTCGCCGGAG GACCCCGCCG CCACCCACCG GGAGAGCTTC
GCCCAGGTGG TCATGGAACC ACTGCTCCGG GAACGCCTGC AGGCCATCAA CCTGCGCGAC
GGCCATCCCT GGCTGGACGC CCGCCGGCTC GACCAGGCCG TCTCCGCCAT CACCCGCCTG
CCCGCCAGCA AGGTCATGGA GGCCAACCGC CAGGCCACCG AACGGCTGGT CGGGGGGCTG
ACCGTGGAGG GGCTGCCCGA CTGGGACGGG GGCCGGGGCC AGACCATCCG CTTCATCGAC
TGGGACCGCC CGGCCAACAA CACCTTCACC GTGGTCAACC AGTTCAAGGT GAAGTGCCCG
CCTGGCCACG ACACCGGCAA GGGGCACGTC ATCCCCGACC TGGTGCTGTT CGTGAACGGC
ATCCCCCTGG TGGTCATCGA GTGCAAGAGC CGCACCGCCC CCGAGGGCCT CAGCGACGCG
GTGGACCAGC TCCGCCGCTA CCACGACCAG CGCTTTCAGG ACTTCGAGGT GGAAGAGCAC
GAGGGCGCCC CGGCCCTGTT CGCTACCAAC CAGGTGCTGG TGGCCAGCAA CTTCGACGAG
GCGCGGGCGG GCACCGTGGG CGCGGCCTTC GCCCACTACC TGAGCTGGAA GACGGTGGTG
CCCCGGCGGG AGTCGGAGGT GGCCGAGGCG CTGGGGGTGG CGACGCTCTC CGCCCAGCAG
CGGCTGGTGG CCGGCCTGTT GGCCCCCGAC ACCCTGCTGG ATATCGCCCG CCACTACACC
CTGTTCATGA ACGCCGGCGG CCAGACCATC AAGGTGGTCT GCCGCTACCA GCAGTATCGC
GGCGTCACCC GCGCCATCCA CCGGCTGAAG AGCGGTAAGA CACGTGCGCA GGATGGCGAG
ATCGACCGCC GGGGCGGGAT CATCTGGCAC ACCCAGGGCA GCGGTAAGAG CCTGTCCATG
GTCTTTCTGG TGCGCAAACT GCGCACCGAC CCCGACCTGC GCCGGTTCAA AGTGGTGGTC
ATCACCGACC GCAAGGACCT GCAGGCCCAG CTCTCCGACA CCGCCGAACT CACCGGCGAG
ACGGTGGAGA CCGCCCCTGA CACCACCCGC CTGAAACGGC TGCTGGCGCG GGAGGGGCCG
GGGCTGGTCT TCGGCACCAT CCAGAAGTAC CGCGACCCGG ACACCGCCGA CGAGGACCCC
GACGCCACTC GGGAGGAGGG CAGGGCGGCC GATGGCCGGC AGGCCGCCGA CACCCCCGGC
CGCTACACCG TCCCCGCCCG TCCCAAACGC AGCGAGCCCT TCGAGGTGCT GAACACCAGC
GAGGACATCC TGGTCCTGGT GGACGAGGCC CACCGGACCC AGGCGGGGGA CCTGCACGCC
AACCTGATGC GGGCCCTGCC CAACGCTGCC CGCATCGGCT TCACCGGCAC GCCCATCATC
ATGGGCGACA AGAAGCGCAC CCACGATATC TTCGGCGACT ACATCGACCG CTACACCATC
AAGGAGGCGG AGCAGGACGG CGCCACCGTG CCCATCCTCT ACGAAGGGCG CACCGCCAAG
GGGGCCATCA AGGACGGCGC CAGCCTGGAC GGCCTGTTCG AGGACCTCTT CCGGGACCAC
ACCAAGGACG AGCTGGAGGC CATCAAGAAG AAGTACGCCA CCAAGGGGCA GATCTTCGAG
GCCCCGCAGC TCATCCGCGA GAAGGCCGCC GACATCCTGC GCCACTACGT CACCCACATC
CTTCCCAACG GCTACAAGGC GCAGTTGGTG GCCTACAGCC GCCGGGCCGC CGTGCGCTAC
CAGGAGGCCC TGAACGAGGC CCGGGAGGCG CTGCTCCAGG AGGCCGTGGC GCTCCCCGTG
GAGGACAAGC AATTGGACGA CTCCGGGCTG TTGCAACGGC CCGCGCGGGT GCAGGCGGCC
ATCCAGGCCT GGCGCTACCG GGACACCCTC CGCCGGCTGG AATTCGCCGT GGTCATCAGC
GGCGGCAACA ACGACGACTC CGCCTGGGCC CGGTGGAGCG ACCGGGCGGC GGTGGAGAGC
CACATCCAGC GGTTCAAAAA GCCGCTGACC CACGACGACC CGGACAAGGC CGATCCTCTT
GCCTTCCTGA TCGTCAAATC CATGCTGCTC ACCGGCTTCG ACGCCCCCAT CGAGGGGGTG
ATGTACCTGG ATCGCTCCAT CCGCGAGGCG GAACTGCTCC AGGCCGTCGC CCGGGTCAAC
CGCACCGGCC ACGGCAAGAC CCACGGCCGG GTGGTGGACT ACTTCGGGGT GGCCAACCAC
CTCAAGGACG CCCTGGCGGC CTACAGCGAG GAGGACATCG ACGGCGCCCT GCAAAGCCTC
TCCGACGAAA TCCCCCTGCT GCGCGACCGC CACCTGCGCA CGGTGGACGT GCTGCGCCAG
CGGGGCGTGG ACAGCCTGGA GGACGTGGAG GAGGCGGTGC AGGCGCTGGC GGACGAGCGG
GTGCGGGCCG AGTTCACCGT CAAACTCAAG GAGTTCAGCC GCAGCCTGGA TGACGTGCTG
CCCCGGCCGG AGGCGCTGGA GTTCGTCAAC GACGCCAAGC AACTGGCCTA CATCCACGCC
CTGGCCCGCA ACCGCTACAA GGACACCCCC TCCCTGGGGC GCGACGTGGG CAACAAGGTG
CGCAAACTCA TCGACGAGTA CGTCATCTCC CTGGGCATCG ACCCGCGCAT CCCCCCGGTG
CAGCTCACCG ACGCCGACTT CGAAAAGCAC CTGGGCCGCC AGGTGGGGGA CCGGGCCAAG
GCCTCCGAAA TGGAGCACGC CATCCGCTCC CACGTCCGCA AACACATGGA CGAAGACCCG
GTGAAGTACG GTCGGCTCAG CGAGCGGCTG GAGGAACTGC TGCAGCAACT GGACGGCCAG
TGGAAGGACC AGGTGGAGGC CCTGGAGGGG CTGATCGACC AACTGCAGGA AGGCGCCGCC
GTAGCGGGCG AGGACCTGGC CGACCTCCCC ACCCACGCCG TCCCCTTCTG GCGGGAGTTG
GTGGAGACCA CCGGCGCGGC ATCCGGCCCG GCCGAGGCGG ACGAGCAGGC GCGGCTATTG
AAGGCCACCG AGGAACTGGT GGGCATCATC CAGGACGAGA TCGTCGTGCC CGATTTCTGG
AAACCCTCCC ACATCCCCGA CCAGGAGCGG TTGCGGGGCC ACCTCTTCCA GCGCCTGATG
GAGATGGATC TGGTGTCGGT CGAGGAGGTG GAGGCCCTGG TGGAGCGGCT CTTCGACCTG
GCCCGGGCCA ACCACGACCG GCTGGTGGAC GCATGA
 
Protein sequence
MAGPEYTDVE KPLIDQLVGL GWTHVTGSPE DPAATHRESF AQVVMEPLLR ERLQAINLRD 
GHPWLDARRL DQAVSAITRL PASKVMEANR QATERLVGGL TVEGLPDWDG GRGQTIRFID
WDRPANNTFT VVNQFKVKCP PGHDTGKGHV IPDLVLFVNG IPLVVIECKS RTAPEGLSDA
VDQLRRYHDQ RFQDFEVEEH EGAPALFATN QVLVASNFDE ARAGTVGAAF AHYLSWKTVV
PRRESEVAEA LGVATLSAQQ RLVAGLLAPD TLLDIARHYT LFMNAGGQTI KVVCRYQQYR
GVTRAIHRLK SGKTRAQDGE IDRRGGIIWH TQGSGKSLSM VFLVRKLRTD PDLRRFKVVV
ITDRKDLQAQ LSDTAELTGE TVETAPDTTR LKRLLAREGP GLVFGTIQKY RDPDTADEDP
DATREEGRAA DGRQAADTPG RYTVPARPKR SEPFEVLNTS EDILVLVDEA HRTQAGDLHA
NLMRALPNAA RIGFTGTPII MGDKKRTHDI FGDYIDRYTI KEAEQDGATV PILYEGRTAK
GAIKDGASLD GLFEDLFRDH TKDELEAIKK KYATKGQIFE APQLIREKAA DILRHYVTHI
LPNGYKAQLV AYSRRAAVRY QEALNEAREA LLQEAVALPV EDKQLDDSGL LQRPARVQAA
IQAWRYRDTL RRLEFAVVIS GGNNDDSAWA RWSDRAAVES HIQRFKKPLT HDDPDKADPL
AFLIVKSMLL TGFDAPIEGV MYLDRSIREA ELLQAVARVN RTGHGKTHGR VVDYFGVANH
LKDALAAYSE EDIDGALQSL SDEIPLLRDR HLRTVDVLRQ RGVDSLEDVE EAVQALADER
VRAEFTVKLK EFSRSLDDVL PRPEALEFVN DAKQLAYIHA LARNRYKDTP SLGRDVGNKV
RKLIDEYVIS LGIDPRIPPV QLTDADFEKH LGRQVGDRAK ASEMEHAIRS HVRKHMDEDP
VKYGRLSERL EELLQQLDGQ WKDQVEALEG LIDQLQEGAA VAGEDLADLP THAVPFWREL
VETTGAASGP AEADEQARLL KATEELVGII QDEIVVPDFW KPSHIPDQER LRGHLFQRLM
EMDLVSVEEV EALVERLFDL ARANHDRLVD A