Gene Emin_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0226 
Symbol 
ID6263127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp243002 
End bp245665 
Gene Length2664 bp 
Protein Length887 aa 
Translation table11 
GC content39% 
IMG OID642610689 
Producttype III restriction protein res subunit 
Protein accessionYP_001875125 
Protein GI187250643 
COG category[V] Defense mechanisms 
COG ID[COG3587] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0000256753 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACTAC ATTTTGATGC TAAGCAGCCG TACCAGCAAG ACGCTATGCG GGCTATAATA 
GACGTGTTTA AAGGGCAACC TATAAATAAT GGGGACTTTG AAGCGTCTTT TGGTGGCGTG
GATGGGCTGG CACTATCTAT AATGGGTGTT AAGAATAATA TTGTACTTTC AGAAGAACAA
ATTTTAAAAA ATGTGTGTGA AGTGCAAGAA AAAAATGGCC TAGATATTGC CACTAAACTG
GAAGGGCTAA ACTTCACCAT AGAAATGGAA ACAGGTACAG GTAAAACCTA TGTTTACCTT
AGAACCATTT ATGAGCTGAA CAAACAGTAC GGCTTTAAAA AATTTGTGAT TGTGGTGCCC
AGCATAGCCA TAAAAGAAGG TGTGTTAAAA AACCTGGAAA TTACGCATGA GCACTTCCAA
GGTTTGTATG AAAACACTTC AGTAAAATTT CAGGTTTACG ACTCTGCTAA AGTATCTGCC
CTGCGTGGGT TTGCTGATAG TAATAACATT GAAATACTTG TTATAAACAT AGACTCATTT
GCCAAGGATG AAAACGTTGT AAATAATGAA AACGATAAAC TGACTGGCAA AAAACCAATA
GAGTTTATTC AGGCTACTAA CCCCATAGTT ATTGTGGACG AACCGCAAAA TATGGAAACT
GAAATACGTA AGCGTGCCAT AGCAAGACTT AACCCTGCTT GCACGTTAAG ATACTCCGCC
ACGCATAAAA ACATGTATAA CCTGCTTTAT AGTTTGGACC CTGTTAAAGC CTATGACCTA
GGGCTTGTTA AACAGATAGA AGTGGATAGC GTGATAACAG AAAACGACCA TAACAGGGCA
TTTATTGAAC TGGTGGAATT TAAACAAAAT AAAAACAGTG TGTTGGCTAA AGTTAAGATT
GAGTGTGAGA GCCCCTCTGG CGTTAAGTCA AAGGTGGTAA CCATAGGCGC AGATGAAAAG
TATGACCTCT ACCAACTTTC TGGACAGCGT GAAATATACA AAGATAATTT TGTGCCAATA
TCTTTAGACG CTGGGGAAGG CTTTGTGGAG TTTGGCAATG GCTTAAGAGT ATCTCTGGGA
CAAAGCAACA GCCAAGTGAA AGATGACATA ATGAAAACGC AGATAGAGCG CACCATAGCC
GAACATTTAG CCAAAGAAAA ACAACTGCGC CCACAAGGCA TTAAAGTATT GTCTTTGTTT
TTTATTGATA GAGTGGCAAA CTATAGGGCC TGGGATGAAA GTGGAAACAT TGTGGAAGGC
AAAATACACA AATGGTTTGA AGAAATTTAT AAAAATATTG TCAGTAAATC TGGCAATAGC
AACATTATGA GCCAGGATAT AAAAGAGATA CATAATGGCT ATTTCTCCCA AGATAAAAAG
GGACACCTGA AAGACTCCTC AGAGAGCAGG GAAACAAAAG ACGATGCTGA TACATACCAG
CTTATTATGA AAGATAAGGA ACGCTTGCTG GATGTTAATA ATCCCCTGCG CTTTATATTT
AGCCACTCTG CCCTTAGGGA AGGCTGGGAC AACCCTAATG TGTTTCAGAT ATGCACGTTA
AATGAAACCA GGTCTGAAAT GAAAAAGAGG CAAGAAATAG GCCGTGGATT ACGCCTTGCT
GTTAATGCAG ATGGTATGCG AATTTATGAC AAAAGTATAA ATAAACTAAC AGTTGTAGCT
AACGAAACTT ATGCCGATTT TTCCGCAAAC TTACAAAAAG AAATAGAAGA TGATTGCGGT
GTGGCCTTCC AGGGTAGGAT TAAAGATAAA AGAGCAAGAA AGAAGGTGGG CCTTAAGAAA
GGATTTGAAC TGGATGCTAA ATTTATTGAA CTCTGGGACA AGATTAAACA CAAAACTACG
TATAAGGTGG ACTATGACTC CCAGGACCTT ATAAAAGCCG CAGGAAAGGC CTTGAGGGAC
CTGGAAACAG AGATAAAAGC CCCTATCATC AGAACTGTTA AAACTGGCGT CAATATAACT
AGCGAAGGCG TATTTGGGAC CGTTAAAGGC AGCACAACAA AGGCTATGGC AGCTGGTTTT
GAGATACCTA ATATCATTGA CTATATACAA AGCAGGCTTA GCTCCAAATT AACCCGTAAA
ACTATATTGG AAATTATCAA GGCCTCTGGC AGGATGGAGG ATGTATTAAA GAACCCACAA
ATGTTTTTAG ACCTAGCTAT AGGCAAAATA AACAATGTAA TGAATGAGCT TATGGTAGAC
GGCATTAAAT ATGAAAAGAT AGCAGGGCAA GAATGGCAGA TGCTTTTGTT TAAGGAACAG
GAGATAGAGT CTTATATTGA AAACCTGTAC CCTATCAAGA ACCAGGATAA GACAATAGCG
GATAGCATTG TTATTGATTC CTTATCCAGC CCCGAGCGTC AGTTTGCCGA AGATTGCGAG
AATAATGATA ACGTGGACTT TTTTATAAAA CTACCCGACT GGTTTAAAAT AAGAACGCCA
ATGGGCACTT ATAATCCAGA CTGGGCGTTA ATCTACAAAA ATGATAAGAG GATATACTTT
GTAGCAGAAA CAAAATCTAC CCTATCACTG GATAGATTGC GAACGGAAGA GCAGTTAAAA
ATAAAGTGTG GTAAGGCACA TTTTAAAGAC TTTAAAGATG TTGAATTTAA GCACGTTACG
AAAGTGGGTG ATTTAATTTC TTAA
 
Protein sequence
MKLHFDAKQP YQQDAMRAII DVFKGQPINN GDFEASFGGV DGLALSIMGV KNNIVLSEEQ 
ILKNVCEVQE KNGLDIATKL EGLNFTIEME TGTGKTYVYL RTIYELNKQY GFKKFVIVVP
SIAIKEGVLK NLEITHEHFQ GLYENTSVKF QVYDSAKVSA LRGFADSNNI EILVINIDSF
AKDENVVNNE NDKLTGKKPI EFIQATNPIV IVDEPQNMET EIRKRAIARL NPACTLRYSA
THKNMYNLLY SLDPVKAYDL GLVKQIEVDS VITENDHNRA FIELVEFKQN KNSVLAKVKI
ECESPSGVKS KVVTIGADEK YDLYQLSGQR EIYKDNFVPI SLDAGEGFVE FGNGLRVSLG
QSNSQVKDDI MKTQIERTIA EHLAKEKQLR PQGIKVLSLF FIDRVANYRA WDESGNIVEG
KIHKWFEEIY KNIVSKSGNS NIMSQDIKEI HNGYFSQDKK GHLKDSSESR ETKDDADTYQ
LIMKDKERLL DVNNPLRFIF SHSALREGWD NPNVFQICTL NETRSEMKKR QEIGRGLRLA
VNADGMRIYD KSINKLTVVA NETYADFSAN LQKEIEDDCG VAFQGRIKDK RARKKVGLKK
GFELDAKFIE LWDKIKHKTT YKVDYDSQDL IKAAGKALRD LETEIKAPII RTVKTGVNIT
SEGVFGTVKG STTKAMAAGF EIPNIIDYIQ SRLSSKLTRK TILEIIKASG RMEDVLKNPQ
MFLDLAIGKI NNVMNELMVD GIKYEKIAGQ EWQMLLFKEQ EIESYIENLY PIKNQDKTIA
DSIVIDSLSS PERQFAEDCE NNDNVDFFIK LPDWFKIRTP MGTYNPDWAL IYKNDKRIYF
VAETKSTLSL DRLRTEEQLK IKCGKAHFKD FKDVEFKHVT KVGDLIS