Gene EcSMS35_4829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4829 
Symbol 
ID6143084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4925727 
End bp4930649 
Gene Length4923 bp 
Protein Length1640 aa 
Translation table11 
GC content49% 
IMG OID641619633 
Producthypothetical protein 
Protein accessionYP_001746740 
Protein GI170679644 
COG category[V] Defense mechanisms 
COG ID[COG1002] Type II restriction enzyme, methylase subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTGG TCGGTATTAA TAACGAAAAC GAATTTTACT CTAACCACTA TTTGGGTGAG 
GTATTCACCA GTGATATCCG CGATGTGCTG GAACCCTGGA TAGCCCAGGA AAATGCAGCG
CGTGAAGCGG AGCGTGCCGC TCGTGAACAG GGCAAAGACG TAGAGCCGGG ATACCGCGCT
CCGTGGAACC AGTTTAACAG TCTGGCGACT GAGTTTTTCC GCAAACTTGC CGAGCACGAA
AAACAGCGTC AGATCCCGCA GCGTCTGGCC GATCAACGTA ATCGCTGGCA GCCATTGTTA
AAGGCGCTGG GCTACGAAAT TACGCCGCAG ATCCAAATGC TGGATGACGA TACACCACTG
CCAGTACTGG CGCGTTACAA CAGCACTGAC GGTAGCCCGT GGCTGTGGAT TGTTGAAGCA
CACGATCAGG AAGAAGGAAC GCTGGATCCG CTGGCGCTCT CCTTACTGAC CGCGCAATTC
CCGGCGGATA CCGACAAACA TAAGCGCGAC AGCCTGCGCA AAAAAGCCAA CGGTGAATAT
CGCAGCTGGC AGGATCTGCT CTCTACGGCG GTCTTCACCC AAAATGAACC GCCGCGTTTT
GTGCTGCTGC TCGGTAACCG TCAGCTATTG TTGTTGGACC GTACTAAGTG GGCGCAAAAC
CGTCTGCTAC GTTTTGATTT TGAAGAGATT TTAAGTCGTC GTGAAACGGA TACGCTGAAA
GCGACGGCAG TGTTGCTACA TAAAGATTCG CTGCTGCCGG GCAGTGGGGC ACCATATCTT
GACTCGCTGG ATGACAATTC GCACAAACAT GCGTTTGGTG TTTCGGAAGA TCTGAAATAT
GCCCTGCGCG AAAGCATAGA GTTGTTGGGC AACGAAGCGA TGCATTATCT GATCGACCGT
GGCCTGGCAA ACTATACCGG TAACCGTGCG GTGGACCCGG ATGAACTGAG CCGCGAATGT
CTGCGTTACA TGTACCGCCT GCTGTTCCTG TTCTACATTG AAGCGCGCCC GGAGCTGGGT
TATGCGCCAA TGACAGCCAA AACCTATCTG CAAGGTTACA GCCTGGAAAC GTTGCGCGAT
CTGGAGATGA TCCCGCTGAC CAGCGAAGAA GATCGCAACG GGCGCTACTT CCACGACAGC
CTGAATATGC TGTTTAAACT GGTGCGCGAA GGCTACAACG GCGGCGTGAA AATGCAGAGT
GACCTGGAGA GCGGCGACCG GATCACCATC CATAGTCATC AGTTCAGCGT CCCGCGTCTG
GAAAGTCATC TGTTTGATGC CAACAACACG CGCATTCTTA ACCGCGTGGT ATTCCGTAAC
GAAACCCTGC AACAGATTAT CCAGGCGATG TCGTTAAGCC GCCCGGGCAA AGGGCGCTTT
AACCGCCGCG GACGTATTTC TTATCGCCAG TTGGGTATCA ACCAGTTGGG TGCGGTGTAT
GAGGCGCTGC TCTCCTATCG CGGATTCTTC GCCAGCGAAG ATCTCTACGA GGTGAAGAAA
GCCGGGGAAG AGTTTAACGA GCTGGAGACG GGTTACTTCG TCAGTAAGGA TGAGATTGGC
AAATACCACG AAGACGAGAA GGTCTACGAG AAAGACGGCA GTCTGCGCAT TCACCGCAAA
GGCAGTTTTA TCTACCGTAT GGCCGGGCGC GACCGTGAGA AATCTGCCTC TTATTACACC
CCGGAAGTGC TGACCCGCTC ACTGGTTAAA TATGCCCTGA AAGAACTGTT TAAAGAGCAA
ATTGATCCGA TTACCGATCT GCACGCTAAA GCTGATGCCA TCTTAAACCT CACCGTGTGC
GAACCGGCGA TGGGCAGCGC GGCGTTCCTT AACGAAGCCA TCAACCAGCT GGCGGAAGCG
TATCTGTTCC ACAAGCAGCA GGCGGAAGGT CGCCGTATTC CGCAGGATCG TTACACCCAG
GAGTTACAGC GGGTGAAAAT GTACATTGCC GACAACAACG TTTTCGGCGT GGACTTAAAC
CCGGTGGCGG TGGAACTGGC GGAAGTGTCG CTGTGGCTGA ACGCCATCAG TGGCGATGCC
TTTGTACCGT GGTTTGGTTA CCAGCTGCAC TGCGGTAACT CGCTGGTGGG CGCGCGCCGT
CAGGTGTTCA ACAAGAGTGA ACTGACCTAC AAAAAAGCCA AAGATCCGAG CTGGCTTAAC
AGCGAGCCGG TCGAACTGGC GATGAACACG CCGCGTGAAG AGACGCAGAT TTTTCACTTC
CTGCTGCCCG ACGGCGGTAT GGCTAACTAC AGCGATAAAA CCGTTAAGCA GCGTTATCCG
GATGACTTCA AAGCCCTGGA CAGCTGGCGC AAAGAGTTTA TTAAAAGCTT TGCCGGGCAT
GAGATTGCTG ATGTGCAGCG TATCAGCGAA AAGGTGGAAG CTCTGTGGAA CACCTATCGC
CAGCAACTTA AAGCAGAACG TCTGAAAACC GCCGACAGCT ACCCGGTGTG GCCGGCAGAA
AACAGCGAGC AGACGCGTTC TTCGCTGAGC AGTAAAGATG AAACCTTCAG TGGTCGTCTT
GAAGATAACA GTGCCTACCA GAAGCTGCGT TGGGTGATGG ACTACTGGTG CGCGCTGTGG
TTCTGGCCGA TCGACAAAGC CGATGAGCTA CCGGATCGCG GCACCTGGTT GTTTGAGATT
GAAACCCTGC TCGACGGGAT TGTAATCACG GAAAAAGTCA CTGAAGTTGC GGAGCACACC
ACCGGCGATC TGTTTGCCGA AGAAGGTCTG CTGCGAGAAG AGTCTTCACT GTTTTCCGGC
GCAGGTCGTC TGAAAACCGA GGTGTTGTTC CGTCATTTGC CGCGTCTGGC GATTGTCGAT
GCCCTGAGAA AGCAGTACCG TTTCTTCCAC TGGGATCTGG AGTTCTGCGA CCTGTTTGCC
GAGCGCGGCG GTTTTGACCT GATGCTCGGA AACCCGCCGT GGCTAAAAGT TGTATGGGAA
GAACGCGGAA TATTGGGAGA TTATAAACCT GAGCTTGTAC TGCACAATCT TAGCGCTGCC
AATATATCTT CTCTCCGAGA AGAATCTTTT GCACGAGTCT ATGGATTAGA AGATGCATGG
CGAAGTGAGT TTGAATCTGC CGTTGGATTA CAGAACTATC TAAATAGTCA GCAAAATTTT
TCATGCTTGT TAGGGCAAAA AACAAATCTA TATAAATGTT TTTTACCAGT CTCATGGCGA
CTTATTTCTG AACATGGTGT TTCAGGGTTA TTGCACCCTG ATGGAGTTTA TGATGATAGC
AAGGCGGGGG AACTTCGTGC CAAAATTTAC TGCCGTTTGC GTGACCATTA TCAATTTCAA
AATGAATTAG GGCTTTTTCC CGATGTTCAT CATGCAACAA CATTTAGTAT TAATATTTAT
GGTCCATTGC AAGAATCGCC TTCATTTACA CACATGTCAA ATTTGTTCTC TGTGTATACA
ATAGATTCAA GTATTTTGCA TGATAATCAA GGGGAAATTC CTGGTATCAA GGAAGAATCT
TACAATGGGG ATAAACTGAA AATAAATTGG AATATATCAC CCCATAAATC AAGAGTATTA
ACAATTACTA TTGATGAGTT ACGTTTGTTT GCGGGGATAT TTGATGATAA TTCAACACCT
GCACTGCACG CAAGGCTTCC AGCTTTACAT ACAGTACAAT TATTGGAAGT TTTACGTAAA
TTTGATGTTA AATCAAAAAA ACTATCTTCA ATAAAAAATA ATTATTATTC TACAGTGATG
TTTGATGAAA CATTTGCTCA AAGAGATGGT ACAACTAAAA GGAAAACGGA ATTCCCCTCT
GGAATTTCAC AATGGATATT ATCTGGACCA CATTTCTTTG TGGGGAATGC TTATTACAAA
ACGCCACGGA CTTTATGTAA ATTAAATAGT GACTATGACT CCATTGATTT GATGACTATT
CCTGATAATT ATATGCCTAG GGTTAATTAT ATTCCTAATT GCGAGTATTC CGATTTTCTA
AATCGAATTC CGACAGTTTC TTGGGATGGT AATGTCTTTC CTGTGACTAA TGGCTATAGA
TTTGTTAATC GGGAAATGAT AGGTTCAACT TCTGAACGAA CTTTCATCGC AACTATAATA
CCTCCCGGTA TTTCTCATAT TAATACGTGT CTAAGCACTA TTTTTAAACA TGACTATGAT
TTACTTGATT TCTTTTCGAT GTCTTTGTCC ATTGTCGTAG ATTATAGAGT CAAGTGTACT
GGCATGGGGC ATGCGAATCA GTCATTAGTT AATCAACTTC CATTATTAAG TAATGAGAAG
TTTAGAGCCT CTTTACATGC AAGGTCGATG GCTTTGGTTA GTATAACAGA GCATTATAAA
AAATTGTGGT GTTCCATTTA TTCCTCAGAA TTTAAAATAC AATATTGGAG CCGCGATCTC
CCACAGCTCC CCCAGGATTT CTTCACCAAT CTGACCCCAG AATGGCAGCG TAACTGCGCC
TTACGCTCCG ACTACAGCCG CCGTCAGGCG CTGGTGGAAA TCGACGTGCT GGTGGCGCAG
GCGCTGGGGC TAACCCTCGA AGAACTACTC GCCATCTACC GTATTCAGTT CCCGGTGATG
CGCCAGTACG AAGCGGACAC CTGGTACGAT CAAAACGGTC GCATTATCTT TACCCCAAGC
AAAGGGCTGG TGGGCGTTGG CTTGCCTCGT ACCGCGCGTA AAGCTGACCT GAAAAACGGC
TTTGTCTTTA ACGTCGACAG CCCGGACTGG ACCGGCGGTG ACTGCACCGA TCAAGCCATC
GGCTGGGATG ATGTCAAACA TCTGCAAACC GGTACCGTCA GCGTTACCTT TGATGACTAT
ACCCGCAGCG ACGAAGGCGA GCGCCGTACC GTCACCTGGC AGGCACCGTT TATCAAGCCG
GATCGCGAAG ACGACTACAA AGTGGCCTGG GCGTTCTTTG CACAAGATAA GGAGAGCGCC
TGA
 
Protein sequence
MALVGINNEN EFYSNHYLGE VFTSDIRDVL EPWIAQENAA REAERAAREQ GKDVEPGYRA 
PWNQFNSLAT EFFRKLAEHE KQRQIPQRLA DQRNRWQPLL KALGYEITPQ IQMLDDDTPL
PVLARYNSTD GSPWLWIVEA HDQEEGTLDP LALSLLTAQF PADTDKHKRD SLRKKANGEY
RSWQDLLSTA VFTQNEPPRF VLLLGNRQLL LLDRTKWAQN RLLRFDFEEI LSRRETDTLK
ATAVLLHKDS LLPGSGAPYL DSLDDNSHKH AFGVSEDLKY ALRESIELLG NEAMHYLIDR
GLANYTGNRA VDPDELSREC LRYMYRLLFL FYIEARPELG YAPMTAKTYL QGYSLETLRD
LEMIPLTSEE DRNGRYFHDS LNMLFKLVRE GYNGGVKMQS DLESGDRITI HSHQFSVPRL
ESHLFDANNT RILNRVVFRN ETLQQIIQAM SLSRPGKGRF NRRGRISYRQ LGINQLGAVY
EALLSYRGFF ASEDLYEVKK AGEEFNELET GYFVSKDEIG KYHEDEKVYE KDGSLRIHRK
GSFIYRMAGR DREKSASYYT PEVLTRSLVK YALKELFKEQ IDPITDLHAK ADAILNLTVC
EPAMGSAAFL NEAINQLAEA YLFHKQQAEG RRIPQDRYTQ ELQRVKMYIA DNNVFGVDLN
PVAVELAEVS LWLNAISGDA FVPWFGYQLH CGNSLVGARR QVFNKSELTY KKAKDPSWLN
SEPVELAMNT PREETQIFHF LLPDGGMANY SDKTVKQRYP DDFKALDSWR KEFIKSFAGH
EIADVQRISE KVEALWNTYR QQLKAERLKT ADSYPVWPAE NSEQTRSSLS SKDETFSGRL
EDNSAYQKLR WVMDYWCALW FWPIDKADEL PDRGTWLFEI ETLLDGIVIT EKVTEVAEHT
TGDLFAEEGL LREESSLFSG AGRLKTEVLF RHLPRLAIVD ALRKQYRFFH WDLEFCDLFA
ERGGFDLMLG NPPWLKVVWE ERGILGDYKP ELVLHNLSAA NISSLREESF ARVYGLEDAW
RSEFESAVGL QNYLNSQQNF SCLLGQKTNL YKCFLPVSWR LISEHGVSGL LHPDGVYDDS
KAGELRAKIY CRLRDHYQFQ NELGLFPDVH HATTFSINIY GPLQESPSFT HMSNLFSVYT
IDSSILHDNQ GEIPGIKEES YNGDKLKINW NISPHKSRVL TITIDELRLF AGIFDDNSTP
ALHARLPALH TVQLLEVLRK FDVKSKKLSS IKNNYYSTVM FDETFAQRDG TTKRKTEFPS
GISQWILSGP HFFVGNAYYK TPRTLCKLNS DYDSIDLMTI PDNYMPRVNY IPNCEYSDFL
NRIPTVSWDG NVFPVTNGYR FVNREMIGST SERTFIATII PPGISHINTC LSTIFKHDYD
LLDFFSMSLS IVVDYRVKCT GMGHANQSLV NQLPLLSNEK FRASLHARSM ALVSITEHYK
KLWCSIYSSE FKIQYWSRDL PQLPQDFFTN LTPEWQRNCA LRSDYSRRQA LVEIDVLVAQ
ALGLTLEELL AIYRIQFPVM RQYEADTWYD QNGRIIFTPS KGLVGVGLPR TARKADLKNG
FVFNVDSPDW TGGDCTDQAI GWDDVKHLQT GTVSVTFDDY TRSDEGERRT VTWQAPFIKP
DREDDYKVAW AFFAQDKESA