Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4829 |
Symbol | |
ID | 6143084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4925727 |
End bp | 4930649 |
Gene Length | 4923 bp |
Protein Length | 1640 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641619633 |
Product | hypothetical protein |
Protein accession | YP_001746740 |
Protein GI | 170679644 |
COG category | [V] Defense mechanisms |
COG ID | [COG1002] Type II restriction enzyme, methylase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTGG TCGGTATTAA TAACGAAAAC GAATTTTACT CTAACCACTA TTTGGGTGAG GTATTCACCA GTGATATCCG CGATGTGCTG GAACCCTGGA TAGCCCAGGA AAATGCAGCG CGTGAAGCGG AGCGTGCCGC TCGTGAACAG GGCAAAGACG TAGAGCCGGG ATACCGCGCT CCGTGGAACC AGTTTAACAG TCTGGCGACT GAGTTTTTCC GCAAACTTGC CGAGCACGAA AAACAGCGTC AGATCCCGCA GCGTCTGGCC GATCAACGTA ATCGCTGGCA GCCATTGTTA AAGGCGCTGG GCTACGAAAT TACGCCGCAG ATCCAAATGC TGGATGACGA TACACCACTG CCAGTACTGG CGCGTTACAA CAGCACTGAC GGTAGCCCGT GGCTGTGGAT TGTTGAAGCA CACGATCAGG AAGAAGGAAC GCTGGATCCG CTGGCGCTCT CCTTACTGAC CGCGCAATTC CCGGCGGATA CCGACAAACA TAAGCGCGAC AGCCTGCGCA AAAAAGCCAA CGGTGAATAT CGCAGCTGGC AGGATCTGCT CTCTACGGCG GTCTTCACCC AAAATGAACC GCCGCGTTTT GTGCTGCTGC TCGGTAACCG TCAGCTATTG TTGTTGGACC GTACTAAGTG GGCGCAAAAC CGTCTGCTAC GTTTTGATTT TGAAGAGATT TTAAGTCGTC GTGAAACGGA TACGCTGAAA GCGACGGCAG TGTTGCTACA TAAAGATTCG CTGCTGCCGG GCAGTGGGGC ACCATATCTT GACTCGCTGG ATGACAATTC GCACAAACAT GCGTTTGGTG TTTCGGAAGA TCTGAAATAT GCCCTGCGCG AAAGCATAGA GTTGTTGGGC AACGAAGCGA TGCATTATCT GATCGACCGT GGCCTGGCAA ACTATACCGG TAACCGTGCG GTGGACCCGG ATGAACTGAG CCGCGAATGT CTGCGTTACA TGTACCGCCT GCTGTTCCTG TTCTACATTG AAGCGCGCCC GGAGCTGGGT TATGCGCCAA TGACAGCCAA AACCTATCTG CAAGGTTACA GCCTGGAAAC GTTGCGCGAT CTGGAGATGA TCCCGCTGAC CAGCGAAGAA GATCGCAACG GGCGCTACTT CCACGACAGC CTGAATATGC TGTTTAAACT GGTGCGCGAA GGCTACAACG GCGGCGTGAA AATGCAGAGT GACCTGGAGA GCGGCGACCG GATCACCATC CATAGTCATC AGTTCAGCGT CCCGCGTCTG GAAAGTCATC TGTTTGATGC CAACAACACG CGCATTCTTA ACCGCGTGGT ATTCCGTAAC GAAACCCTGC AACAGATTAT CCAGGCGATG TCGTTAAGCC GCCCGGGCAA AGGGCGCTTT AACCGCCGCG GACGTATTTC TTATCGCCAG TTGGGTATCA ACCAGTTGGG TGCGGTGTAT GAGGCGCTGC TCTCCTATCG CGGATTCTTC GCCAGCGAAG ATCTCTACGA GGTGAAGAAA GCCGGGGAAG AGTTTAACGA GCTGGAGACG GGTTACTTCG TCAGTAAGGA TGAGATTGGC AAATACCACG AAGACGAGAA GGTCTACGAG AAAGACGGCA GTCTGCGCAT TCACCGCAAA GGCAGTTTTA TCTACCGTAT GGCCGGGCGC GACCGTGAGA AATCTGCCTC TTATTACACC CCGGAAGTGC TGACCCGCTC ACTGGTTAAA TATGCCCTGA AAGAACTGTT TAAAGAGCAA ATTGATCCGA TTACCGATCT GCACGCTAAA GCTGATGCCA TCTTAAACCT CACCGTGTGC GAACCGGCGA TGGGCAGCGC GGCGTTCCTT AACGAAGCCA TCAACCAGCT GGCGGAAGCG TATCTGTTCC ACAAGCAGCA GGCGGAAGGT CGCCGTATTC CGCAGGATCG TTACACCCAG GAGTTACAGC GGGTGAAAAT GTACATTGCC GACAACAACG TTTTCGGCGT GGACTTAAAC CCGGTGGCGG TGGAACTGGC GGAAGTGTCG CTGTGGCTGA ACGCCATCAG TGGCGATGCC TTTGTACCGT GGTTTGGTTA CCAGCTGCAC TGCGGTAACT CGCTGGTGGG CGCGCGCCGT CAGGTGTTCA ACAAGAGTGA ACTGACCTAC AAAAAAGCCA AAGATCCGAG CTGGCTTAAC AGCGAGCCGG TCGAACTGGC GATGAACACG CCGCGTGAAG AGACGCAGAT TTTTCACTTC CTGCTGCCCG ACGGCGGTAT GGCTAACTAC AGCGATAAAA CCGTTAAGCA GCGTTATCCG GATGACTTCA AAGCCCTGGA CAGCTGGCGC AAAGAGTTTA TTAAAAGCTT TGCCGGGCAT GAGATTGCTG ATGTGCAGCG TATCAGCGAA AAGGTGGAAG CTCTGTGGAA CACCTATCGC CAGCAACTTA AAGCAGAACG TCTGAAAACC GCCGACAGCT ACCCGGTGTG GCCGGCAGAA AACAGCGAGC AGACGCGTTC TTCGCTGAGC AGTAAAGATG AAACCTTCAG TGGTCGTCTT GAAGATAACA GTGCCTACCA GAAGCTGCGT TGGGTGATGG ACTACTGGTG CGCGCTGTGG TTCTGGCCGA TCGACAAAGC CGATGAGCTA CCGGATCGCG GCACCTGGTT GTTTGAGATT GAAACCCTGC TCGACGGGAT TGTAATCACG GAAAAAGTCA CTGAAGTTGC GGAGCACACC ACCGGCGATC TGTTTGCCGA AGAAGGTCTG CTGCGAGAAG AGTCTTCACT GTTTTCCGGC GCAGGTCGTC TGAAAACCGA GGTGTTGTTC CGTCATTTGC CGCGTCTGGC GATTGTCGAT GCCCTGAGAA AGCAGTACCG TTTCTTCCAC TGGGATCTGG AGTTCTGCGA CCTGTTTGCC GAGCGCGGCG GTTTTGACCT GATGCTCGGA AACCCGCCGT GGCTAAAAGT TGTATGGGAA GAACGCGGAA TATTGGGAGA TTATAAACCT GAGCTTGTAC TGCACAATCT TAGCGCTGCC AATATATCTT CTCTCCGAGA AGAATCTTTT GCACGAGTCT ATGGATTAGA AGATGCATGG CGAAGTGAGT TTGAATCTGC CGTTGGATTA CAGAACTATC TAAATAGTCA GCAAAATTTT TCATGCTTGT TAGGGCAAAA AACAAATCTA TATAAATGTT TTTTACCAGT CTCATGGCGA CTTATTTCTG AACATGGTGT TTCAGGGTTA TTGCACCCTG ATGGAGTTTA TGATGATAGC AAGGCGGGGG AACTTCGTGC CAAAATTTAC TGCCGTTTGC GTGACCATTA TCAATTTCAA AATGAATTAG GGCTTTTTCC CGATGTTCAT CATGCAACAA CATTTAGTAT TAATATTTAT GGTCCATTGC AAGAATCGCC TTCATTTACA CACATGTCAA ATTTGTTCTC TGTGTATACA ATAGATTCAA GTATTTTGCA TGATAATCAA GGGGAAATTC CTGGTATCAA GGAAGAATCT TACAATGGGG ATAAACTGAA AATAAATTGG AATATATCAC CCCATAAATC AAGAGTATTA ACAATTACTA TTGATGAGTT ACGTTTGTTT GCGGGGATAT TTGATGATAA TTCAACACCT GCACTGCACG CAAGGCTTCC AGCTTTACAT ACAGTACAAT TATTGGAAGT TTTACGTAAA TTTGATGTTA AATCAAAAAA ACTATCTTCA ATAAAAAATA ATTATTATTC TACAGTGATG TTTGATGAAA CATTTGCTCA AAGAGATGGT ACAACTAAAA GGAAAACGGA ATTCCCCTCT GGAATTTCAC AATGGATATT ATCTGGACCA CATTTCTTTG TGGGGAATGC TTATTACAAA ACGCCACGGA CTTTATGTAA ATTAAATAGT GACTATGACT CCATTGATTT GATGACTATT CCTGATAATT ATATGCCTAG GGTTAATTAT ATTCCTAATT GCGAGTATTC CGATTTTCTA AATCGAATTC CGACAGTTTC TTGGGATGGT AATGTCTTTC CTGTGACTAA TGGCTATAGA TTTGTTAATC GGGAAATGAT AGGTTCAACT TCTGAACGAA CTTTCATCGC AACTATAATA CCTCCCGGTA TTTCTCATAT TAATACGTGT CTAAGCACTA TTTTTAAACA TGACTATGAT TTACTTGATT TCTTTTCGAT GTCTTTGTCC ATTGTCGTAG ATTATAGAGT CAAGTGTACT GGCATGGGGC ATGCGAATCA GTCATTAGTT AATCAACTTC CATTATTAAG TAATGAGAAG TTTAGAGCCT CTTTACATGC AAGGTCGATG GCTTTGGTTA GTATAACAGA GCATTATAAA AAATTGTGGT GTTCCATTTA TTCCTCAGAA TTTAAAATAC AATATTGGAG CCGCGATCTC CCACAGCTCC CCCAGGATTT CTTCACCAAT CTGACCCCAG AATGGCAGCG TAACTGCGCC TTACGCTCCG ACTACAGCCG CCGTCAGGCG CTGGTGGAAA TCGACGTGCT GGTGGCGCAG GCGCTGGGGC TAACCCTCGA AGAACTACTC GCCATCTACC GTATTCAGTT CCCGGTGATG CGCCAGTACG AAGCGGACAC CTGGTACGAT CAAAACGGTC GCATTATCTT TACCCCAAGC AAAGGGCTGG TGGGCGTTGG CTTGCCTCGT ACCGCGCGTA AAGCTGACCT GAAAAACGGC TTTGTCTTTA ACGTCGACAG CCCGGACTGG ACCGGCGGTG ACTGCACCGA TCAAGCCATC GGCTGGGATG ATGTCAAACA TCTGCAAACC GGTACCGTCA GCGTTACCTT TGATGACTAT ACCCGCAGCG ACGAAGGCGA GCGCCGTACC GTCACCTGGC AGGCACCGTT TATCAAGCCG GATCGCGAAG ACGACTACAA AGTGGCCTGG GCGTTCTTTG CACAAGATAA GGAGAGCGCC TGA
|
Protein sequence | MALVGINNEN EFYSNHYLGE VFTSDIRDVL EPWIAQENAA REAERAAREQ GKDVEPGYRA PWNQFNSLAT EFFRKLAEHE KQRQIPQRLA DQRNRWQPLL KALGYEITPQ IQMLDDDTPL PVLARYNSTD GSPWLWIVEA HDQEEGTLDP LALSLLTAQF PADTDKHKRD SLRKKANGEY RSWQDLLSTA VFTQNEPPRF VLLLGNRQLL LLDRTKWAQN RLLRFDFEEI LSRRETDTLK ATAVLLHKDS LLPGSGAPYL DSLDDNSHKH AFGVSEDLKY ALRESIELLG NEAMHYLIDR GLANYTGNRA VDPDELSREC LRYMYRLLFL FYIEARPELG YAPMTAKTYL QGYSLETLRD LEMIPLTSEE DRNGRYFHDS LNMLFKLVRE GYNGGVKMQS DLESGDRITI HSHQFSVPRL ESHLFDANNT RILNRVVFRN ETLQQIIQAM SLSRPGKGRF NRRGRISYRQ LGINQLGAVY EALLSYRGFF ASEDLYEVKK AGEEFNELET GYFVSKDEIG KYHEDEKVYE KDGSLRIHRK GSFIYRMAGR DREKSASYYT PEVLTRSLVK YALKELFKEQ IDPITDLHAK ADAILNLTVC EPAMGSAAFL NEAINQLAEA YLFHKQQAEG RRIPQDRYTQ ELQRVKMYIA DNNVFGVDLN PVAVELAEVS LWLNAISGDA FVPWFGYQLH CGNSLVGARR QVFNKSELTY KKAKDPSWLN SEPVELAMNT PREETQIFHF LLPDGGMANY SDKTVKQRYP DDFKALDSWR KEFIKSFAGH EIADVQRISE KVEALWNTYR QQLKAERLKT ADSYPVWPAE NSEQTRSSLS SKDETFSGRL EDNSAYQKLR WVMDYWCALW FWPIDKADEL PDRGTWLFEI ETLLDGIVIT EKVTEVAEHT TGDLFAEEGL LREESSLFSG AGRLKTEVLF RHLPRLAIVD ALRKQYRFFH WDLEFCDLFA ERGGFDLMLG NPPWLKVVWE ERGILGDYKP ELVLHNLSAA NISSLREESF ARVYGLEDAW RSEFESAVGL QNYLNSQQNF SCLLGQKTNL YKCFLPVSWR LISEHGVSGL LHPDGVYDDS KAGELRAKIY CRLRDHYQFQ NELGLFPDVH HATTFSINIY GPLQESPSFT HMSNLFSVYT IDSSILHDNQ GEIPGIKEES YNGDKLKINW NISPHKSRVL TITIDELRLF AGIFDDNSTP ALHARLPALH TVQLLEVLRK FDVKSKKLSS IKNNYYSTVM FDETFAQRDG TTKRKTEFPS GISQWILSGP HFFVGNAYYK TPRTLCKLNS DYDSIDLMTI PDNYMPRVNY IPNCEYSDFL NRIPTVSWDG NVFPVTNGYR FVNREMIGST SERTFIATII PPGISHINTC LSTIFKHDYD LLDFFSMSLS IVVDYRVKCT GMGHANQSLV NQLPLLSNEK FRASLHARSM ALVSITEHYK KLWCSIYSSE FKIQYWSRDL PQLPQDFFTN LTPEWQRNCA LRSDYSRRQA LVEIDVLVAQ ALGLTLEELL AIYRIQFPVM RQYEADTWYD QNGRIIFTPS KGLVGVGLPR TARKADLKNG FVFNVDSPDW TGGDCTDQAI GWDDVKHLQT GTVSVTFDDY TRSDEGERRT VTWQAPFIKP DREDDYKVAW AFFAQDKESA
|
| |