Gene EcHS_A0339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0339 
Symbol 
ID5595008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp345798 
End bp349415 
Gene Length3618 bp 
Protein Length1205 aa 
Translation table11 
GC content47% 
IMG OID640919524 
Productputative restriction enzyme 
Protein accessionYP_001457110 
Protein GI157159792 
COG category[V] Defense mechanisms 
COG ID[COG1002] Type II restriction enzyme, methylase subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00710778 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACCA ATAACATTAA AAAATATGCC CCACAGGCCC GTAACGACTT CCGCGATGCG 
GTGATCCAGA AGCTAACGAC GCTTGGGATC GCTGCAGATA AAAAAGGCAA TTTGCAGATT
GCCGAGGCCG AAACCATTGG CGAGACCGTG CGTTACGGTC AGTTTGATTA CCCGTTATCG
ACCCTTCCCC GCCGCGAACG GCTGGTAAAA CGCGCCCGTG AGCAGGGTTT TGAGGTGCTG
GTTGAGCACT GCGCCTACAC CTGGTTTAAC CGCTTATGTG CAATTCGCTA TATGGAGCTG
CACGGTTATC TTGATCACGG CTTCCGTATG TTGTCTCACC CGGAGACTCC GACCGCGTTT
GAGGTGCTGG ATCATGTGCC GGAAGTGGCA GAAGCTCTGC TACCAGAAAG TAAGGCGCAG
TTGGTTGAAA TGAAGCTTTC CGGTAATCAG GACGAAGCTC TGTACCGTGA ACTGCTGCTG
GGGCAGTGCC ACGCCCTGCA CCACGCGATG CCGTTCCTGT TTGAAGCGGT AGATGACGAA
GCGGAACTGC TGCTGCCGGA TAACCTGACC CGTACTGACT CCATTCTACG TGGGCTGGTT
GATGATATTC CGGAAGAAGA CTGGGAGCAG GTAGAGGTTA TCGGCTGGCT GTATCAGTTC
TATATTTCGG AAAAGAAAGA TGCCGTGATT GGCAAAGTGG TGAAGAGCGA AGATATTCCT
GCCGCCACCC AGCTGTTTAC GCCGAACTGG ATTGTGCAGT ATCTGGTACA GAACTCCGTT
GGCCGCCAGT GGTTGCAGAC CTACCCGGAC TCACCGCTGA AAGACAAAAT GGAGTACTAC
ATTGAGCCTG CGGAGCAAAC GCCGGAAGTG CAGGCGCAGC TGGCGGCGAT TACCCCGGCC
AGCATTGAGC CTGAAAGCAT TAAAGTGCTG GATCCGGCCT GCGGTTCCGG GCATATCCTG
ACAGAAGCCT ATAACGTGCT GAAGGCCATT TACGAAGAGC GCGGTTACCG TACCCGTGAC
ATTCCGCAGC TGATTCTGGA AAACAATATT TTCGGCCTGG ACATCGATGA TCGTGCGGCG
CAGCTTTCCG GATTTGCGAT GCTGATGCTG GCACGCCAGG ATGACCGTCG TATTCTGGGC
CGTGGTGTGA GGCTGAATAT TGTCTCCCTA CAGGAAAGTA AGCTGGATAT TGCTGAGGTT
TGGACCAAGC TGAACTTCCA TCAGCATATG CAACGCGGCA GCATGGGAGA TATGTTTACC
CAAGGGACGG CACTGGCCAA TACCGACAGC GCGGAATATA AGCTACTGAT GCGTACTCTG
GCATTGTTCA CTAGTGCAAA AACGCTGGGA TCACTGATTC AGGTACCGCA GGAAGACGAA
GCGGCGTTGA AAGCGTTTCT GGAGAGATTG TATCGTCTGG CTGTTGAAGG TGATATTCAG
CAGAAAGAGG CTGCGGCAGA GCTGATCCCA TATATTCAGC AGGCGTGGAT ACTGGCGCAA
CGTTATGATG CTGTGGTCGC GAACCCGCCG TATATGGGGG GGAAAGGGAT GAATGGTGAT
CTGAAAGAGT TTGCTAAAAA ACAATTCCCG GACAGCAAGT CGGATTTGTT TGCGATGTTT
ATGCAACATG CATTTTCTTT ACTCAAAGAA AACGGTTTTA ATGCGCAAGT GAATATGCAG
TCATGGATGT TCCTGTCTAG CTATGAAGCG CTTCGTGGAT GGTTGCTGGA TAATAAAACC
TTTATTACGA TGGCACATTT GGGGGCTCGA GCATTTGGTC AAATCTCTGG AGAAGTTGTT
CAAACTACCG CATGGGTGAT TAAAAATAAC CATTCAGGAT TTTATAAACC TGTATTTTTC
CGTTTAGTTG ACGATAATGA AGAACATAAA AAAAACAATC TCTTGAATCG GATGAATTGC
TTTAAAAACA CTCTGCAGAA TGACTTTAAA AAAATACCTG GTTCACCCAT TGCTTACTGG
GCAACATTGG CGTTTATTAA TTCCTTTCTT AAATTGCCTG CTCTTGGAAC TCGTGCTGTG
AAAGGACTAG ATACAAACGG GTCCATTGAT GTATTTTTGC GCAGATGGCC GGAAGTCAGC
ATAAATTCCT TTGATGCATT AGGGAAAGGT AACTCAAAAT GGTTTCCGAT TGCAAAAGGG
GGTGAGTTGA GGAAGTGGTT CGGAAATCAT GAGTATATCA TAAACTATGA AAATGATGGC
ATTGAATTAA GAAAAAACAA AGCAAACTTG AGAAATAAGG ATATGTACTT TCAGGAGGGG
GGAACCTGGA CTGTTGTATC AACCACCGGT TTCTCAATGA GATATATGCC AAAAGGATTC
CTTTTTGACC AAGGCGGTTC TGCTGTTTTT TGTGAAAATA ATGATGAGCT ATCGATATAT
AACATCCTGG CCTGCATGAA CTCTAAATAT ATCAACTACT CTGCAAGTTT AATTTGTCCT
ACGCTTAACT TTACAACCGG TGATGTTAGG AAATTCCCTG TTATAAAAAA CAATCACCTT
GAAGATTTAG CAAAAAAAGC AATTGAGATA TCAAAAGCAG ACTGGAACCA ATTCGAGACA
TCTTGGGAGT TCAGTAAAAA CAAGTTAATT GAACACAAAG GAAACGTTGC TTATTCGTAT
GCTAGCTATT GTAACTTTCA AGATAAACTG TATGAACAGC TAGTTAATAT TGAAAAAAAT
ATTAATAACA TAATCGAAGA AATACTGGGT TTTAAAATAG AAACAACAGA GAATAGTGAG
TTAATTACAT TAAACTCGAA CAAAATATAT CGTTATGGGC AAAGTGAAAC CAATGATACA
TTCCTAAACA GGCATCGGAG TGACACCATT TCGGAACTCA TCTCATATTC AGTCGGCTGC
CAAATGGGAC GCTATTCCCT CGATCGCGAA GGCCTCGTCT ACGCCCATGA AGGCAACAAA
GGCTTTGCCG AACTTGCCGC TGAAGGTGCG TACAAAACCT TCCCGGCAGA CAATGACGGC
ATCCTGCCGC TGATGGATGA CGAGTGGTTT GAGGACGACG TCACCTCTCG CGTCAAAGAG
TTTGTCCGCA CCGTCTGGGG CGAAGAGCAC CTGCAGGAAA ACCTCGAATT TATCGCCGAA
AGTCTCTGTT TATACGCGAT CAAGCCGAAA AAAGGCGAAT CTGCGCTGGA GACCATTCGT
CGCTATCTTT CCACACAGTT CTGGAAAGAT CATATGAAGA TGTATAAAAA GCGCCCAATC
TACTGGCTAT TCAGCTCCGG TAAAGAGAAA GCCTTCGAGT GCCTGGTGTA TCTGCATCGC
TATAACGACG CCACGCTGTC GAGAATGCGT ACCGAATATG TGGTGCCGCT GCTGGCACGT
TATCAGGCCA ATATCGATCG CCTGAACGAT CAACTTGATG AAGCTTCTGG CGGTGAAGCC
ACACGCCTGA AACGCGAACG CGACAGCCTG ATCAAAAAAT TTAGCGAACT GCGTAGCTAT
GACGACCGCC TGCGTCACTA TGCTGATATG AGAATCAGTA TTGATCTCGA CGATGGCGTT
AAGGTTAACT ACGGCAAGTT TGGCGATCTG CTGGCAGATG TCAAAGCCAT CACCGGCAAT
GCCCCAGAGG CGATCTAA
 
Protein sequence
MNTNNIKKYA PQARNDFRDA VIQKLTTLGI AADKKGNLQI AEAETIGETV RYGQFDYPLS 
TLPRRERLVK RAREQGFEVL VEHCAYTWFN RLCAIRYMEL HGYLDHGFRM LSHPETPTAF
EVLDHVPEVA EALLPESKAQ LVEMKLSGNQ DEALYRELLL GQCHALHHAM PFLFEAVDDE
AELLLPDNLT RTDSILRGLV DDIPEEDWEQ VEVIGWLYQF YISEKKDAVI GKVVKSEDIP
AATQLFTPNW IVQYLVQNSV GRQWLQTYPD SPLKDKMEYY IEPAEQTPEV QAQLAAITPA
SIEPESIKVL DPACGSGHIL TEAYNVLKAI YEERGYRTRD IPQLILENNI FGLDIDDRAA
QLSGFAMLML ARQDDRRILG RGVRLNIVSL QESKLDIAEV WTKLNFHQHM QRGSMGDMFT
QGTALANTDS AEYKLLMRTL ALFTSAKTLG SLIQVPQEDE AALKAFLERL YRLAVEGDIQ
QKEAAAELIP YIQQAWILAQ RYDAVVANPP YMGGKGMNGD LKEFAKKQFP DSKSDLFAMF
MQHAFSLLKE NGFNAQVNMQ SWMFLSSYEA LRGWLLDNKT FITMAHLGAR AFGQISGEVV
QTTAWVIKNN HSGFYKPVFF RLVDDNEEHK KNNLLNRMNC FKNTLQNDFK KIPGSPIAYW
ATLAFINSFL KLPALGTRAV KGLDTNGSID VFLRRWPEVS INSFDALGKG NSKWFPIAKG
GELRKWFGNH EYIINYENDG IELRKNKANL RNKDMYFQEG GTWTVVSTTG FSMRYMPKGF
LFDQGGSAVF CENNDELSIY NILACMNSKY INYSASLICP TLNFTTGDVR KFPVIKNNHL
EDLAKKAIEI SKADWNQFET SWEFSKNKLI EHKGNVAYSY ASYCNFQDKL YEQLVNIEKN
INNIIEEILG FKIETTENSE LITLNSNKIY RYGQSETNDT FLNRHRSDTI SELISYSVGC
QMGRYSLDRE GLVYAHEGNK GFAELAAEGA YKTFPADNDG ILPLMDDEWF EDDVTSRVKE
FVRTVWGEEH LQENLEFIAE SLCLYAIKPK KGESALETIR RYLSTQFWKD HMKMYKKRPI
YWLFSSGKEK AFECLVYLHR YNDATLSRMR TEYVVPLLAR YQANIDRLND QLDEASGGEA
TRLKRERDSL IKKFSELRSY DDRLRHYADM RISIDLDDGV KVNYGKFGDL LADVKAITGN
APEAI