Gene ECH74115_0581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0581 
Symbol 
ID6969304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp584939 
End bp589324 
Gene Length4386 bp 
Protein Length1461 aa 
Translation table11 
GC content59% 
IMG OID643384625 
ProductPKD domain protein 
Protein accessionYP_002269139 
Protein GI209395885 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGA TTATTGATGT TATTTCGCGT AAAACATCCG TCAAACAAAC GCTGATTAAT 
CCTGGCGACG TCACGGTTGT TATTTATGAG CCTTCCGTGG TGCAGGTTCA TGCTCAGGCC
TCTGCCGTTG CGCGTTACGT CCGTGAAGGA AATGACCTGC TGATCTATAT GCGGGACGGC
ACGGTGATCC GCTGCAACGG TTATTTCCTG CAAGCGGCGA ATACAGCTGA ACAATCGGAA
CTGGTGTTTG CCGATGGTCA ACAGCTAACC CATATCACCT TTGCCGATAC TGCTGCGGGT
GGATTAGCCC CCGTAGAACT GACTGCCCAG ACCACTGCGA TTGAAAGCAT TGCGCCATTT
CTTGATACCG TTGCTCAGAC CAGCGCCTTC CCGTGGGGTT GGCTGGCGGG GGCGGCGGTA
GGTGGTGGCG CGCTTGGTGC ACTGCTGGCA AGCGGTGGCG ATGGCGACTC GAAAACAGAA
GTGATTAATA ACCCTACGCC ACCTGCTGAG CCTGGCAACG CCACACCATC ATTTTTAGTT
ACCGATAATC AGGGCGATCA GCGCGGCATT CTAGCCACCA ATGACATCAC CGATGACACC
ACGCCAACCT TTAGCGGCAG CGGGCAGGCG GGGGCGACTA TTCAGATTAA AGACAGTAAC
GGCAATACTA TTGCCAGTAC TCAGGTAGAC AACAACGGTC ACTGGAGTGT CTCGTTACCC
ACGCAAAGTG CAGGTGAACA TACCTGGTCA GTGGTGCAAA TTGTCGGCAG TACCATCACT
GACGCCGGTT CGATAACGTT AACCATCGAC AATAGTCAGG CCAGCGTGCA GGTTGCCACC
ACCGCAGGCG ATAACATTAT TAACGCCAGC GAACAGGCCG CCGGGTTTAC GCTTTCTGGC
ACCAGTAGCC ATCTGGCGCA GGGAACAGAA CTCACCGTTA CGCTAAACGG CAAAACCTAC
ACGACCAGCG TAGGCGCTAA CGGTGCCTGG AGCGTGCAGG TGCCGACCGC CGATGCACAG
GCGTTAGGCG AAGGGAATCA GGCGGTGCTG GTCAGTGGGA AAGACGCCAC AGGCAATACG
GTCACCGGCG CGCAGCTACT AACGGTCGAT ACCCAACCGC CAACGCTTGC CATCAACACC
ATCGCTCAGG ACAACATTAT CAGTGCTGCG GAACATAACG TCGCGCTGGT ACTGAGCGGC
ACGTCGAATG CAGAAGCGGG GCAAACCGTA ACACTGACCG TCAACGGGAA AAGCCATACA
GCAACCGTCG GTAGCGACGG AACCTGGCAA GTGACGCTGC CTGCCACGGA AGTCCAGGCA
CTGGCGGAGG GTAATTACGC TGTCAATGCC AGTGTCAGCG ATCGGGCAGG GAACACCACC
AGCCACAGCG CGAATTTCAC GGTAGACACC TCAGCACCCG TGGTCAGTGT TAATACCGTG
GCGGGCGACG ATATTCTTAA TAATGCCGAG CAGGCCGTCG CGCAGATCAT CTCCGGACAA
GTCAGCGGTG CTTCTCCAGG CGATACGGTA ACGGTGAAAT TGGGCACTCA TGTCCTGACG
GGCATCGTGC TGGCAGATGG CAGCTGGAAT GTGGCGCTGG ACCCAGCGGT AACCCGCACG
CTGGATCGCG GAGCCAATAC GATTTTCGTC ACCGTGACAG ATGCTGCAGG AAATACTGGC
GCGGCGTCTC GAGCAATCAC GCTGGTCGGT GTTTCTCCGT TGATCACCAT TAACACCGTC
TCCGGCGATG ACATTATCAG TGGCGCAGAA AAAGGTGCGC CACTGACCCT TACCGGTAGC
ACTCAACAGG CTGAGACAGG ACAAACCGTC ACAGTAACCC TGGCTGGACA GAGTTTTACC
ACTACCGTGC AGGCCGATGG CTCCTGGAGT CTGACGGTAC CTGCCGCCGC GATGGGAAAT
CTGCCTGACG GCGCGGTGGC GATTACCGCT TCTGTGACGG ATCTCAGCGG CAATACCGGC
AACACTTCCC GCACCATTAC CGTCGATAGC CAGGCCCCGG CCTTAAGCAT TGATCCACTG
ACCGCTGATA ACATCATTAA CGCCGCCGAA AGCGGGCAGG ATCTGCCCAT CACCGGCACC
ACCGACGCTC AGCCGGGGCA GACGGTGACC GTTACGTTAA ATGGGCAGAC GTATCAGGGC
GTCGTGCAGC CAGACGGCAC CTGGAGCGTG ACTGTGCCCG CCGCCAACGT GGGCGCACTG
GCTGACGGCA ACGCTACGGT CACCGCCAGC GTGAACGATG TCGCCGGTAA TCCGAGCAGC
GTTTCACGCG TGGCGCTGGT GGATGCCACG CCGCCGGTGG TAACCATTAA TCCGGTGGCG
ACCGATAACG TCATCAACAC GCCGGAACAT GCTCAGGCGC AAATCATCAG CGGCACGGTT
ACTGGCGCTC AGGCGGGCGA TATCGTCACC GTGACGCTGA ATAATGTGGA TTACACCACG
GTGGTGGATG GTTCCGGCAA CTGGAGTCTG GGCGTTCCGG CCTCGGTGGT CAGTGGGCTG
GCGGACGGCA GTTATCCTGT CAGCGTCTCG GTAACCGACA AAGCCGGAAA CACGGGCAGC
CAGTCATTGA CCGTCACGGT CAATACCGCC GCGCCCCTTA TCGGCATTAA CAGCATTGCG
GGCGATGATG TGATTAACGC CAGCGAAAAA GGGGCCGATC TCCAGATTAC CGGCACCAGC
GATCAGCCTG TTAACACCGC CATCACCGTG ACGCTGAACG GGCAAAATTA CACCACCACG
ACCGACGCCT CCGGCAACTG GAGCGTCACC GTTCCGGCAT CGGCGGTTAC AGCATTAGGC
CAGGCCAACT ATACGGTAAC GGCGGCGGTG ACCAGCGATA TCGGCAACAG CGCCACTGCC
AGCCATAACG TGCTGGTCGA CAGCGCGCTG CCCGGTGTGA CCATTAATCC GGTGGCAACC
GACGATATTA TTAACGCCGC CGAAGCGGGC GTGGCGCAAA CCATCAGCGG GCAGGTGACT
GGCGCGGAAG ATGGCGACAC GGTAACTATT ACGTTGGGTG GTAATACTTA TACGGCGACG
GTGGGCAGCA ATCTCACCTG GAGCGTGGAC GTTCCAGCGG CAGATATTCA GGCGCTGGGA
AATGGCGATT TAACGGTTAA TGCCTCAGTC ACCAATCAAA ACGGCAACAC CGGCAGCGGC
ACGCGGGATA TCACCATCGA CGCCAATCTG CCCGGCCTGC GGGTCGATAC GGTGGCGGGC
GATGATGTGG TCAATATCAT CGAGCACGGG CAGGCGCTGG TGGTCACCGG CAGCAGCTCG
GGGCTGGCTG AAAGCACGCC GCTTACCGTT ACGATTAATA ATGTGGAATA CACCACTGCG
GTGCAGGCCG ATGGTAGCTG GAGCGTGGGC GTCACGGCGG CGCAGGTTAG CGCCTGGCCT
GCGGGGACGG TTAATATTGC CGTTTCAGGG GAAAGTAGCG CCGGAAACTC GGTGAGCATT
ACGCATCCGG TGACGGTGGA TCTCACTCCG GCAGCGATCA CCATCAACAC CATCGCCACG
GACGATGTGA TTAACGCCGC AGAAAAAGGC GCTGATTTAA CCCTTTCCGG CACCACCACT
AACGTAGAAC CCGGTCAAAC CGTCACCGTC ACCTTTGGCG GGAAAAATTA CACTGCCAGC
GTAGCGAGCG ATGGTAGCTG GACTGCCACC GTACCCGCCG CCGATCTGGC GTCATTACCC
GAGGGCAGCG CCTCCGCACT GGCCAGCGTC AGCAATATCA ACGGCAATAG CGCCTCGGCG
GTGCACAACT ACAGCGTCGA CAGCAGCGCG CCAACCATCA TTATCAATAC CGTCGCCAGC
GACAATATCG TCAACGCCAG CGAAGCCGAT GCGGGCGTGA CGGTGAGCGG CAGTACCACC
GCCGAAGCGG GGCAGATTGT TACGATAACG CTTAACAGCC CGACCGTGCA GACCTATCAG
GCAACGGTGC AGGCGGACGG CAGCTGGAGC ATCAATATTC CGGCGGCAGA TCTTGAGGCA
TTGACCGATG GCAGCCACAC CCTGACCGCC ACGGTCAATG ACAAAGCGGG CAATCCGGCG
AGCACCACGC ATAATCTGGC GGTGGATCTC ACCGTTCCGG TGCTGACCAT CAACACCATT
GCGGGCGATG ACATTATTAA CGCCACCGAA CACGGGCAGG CGCTGGTGAT TTCCGGTTCC
AGCACCGGCG GAGAAGCGGG GGATGTCGTC ACCGTCACGC TAAACAGTAA AACCTACACC
ACCACCCTGG ACGCCTCCGG CAACTGGAGC GTCGGCGTTC CGGCGGCGGA TGTCACGGCG
CTTGGCAGCG GCCCGCAAAC TGTCACCGCC ACGGTTACCG ATGCGGCAGG CAACAGCGAC
AATTAG
 
Protein sequence
MSLIIDVISR KTSVKQTLIN PGDVTVVIYE PSVVQVHAQA SAVARYVREG NDLLIYMRDG 
TVIRCNGYFL QAANTAEQSE LVFADGQQLT HITFADTAAG GLAPVELTAQ TTAIESIAPF
LDTVAQTSAF PWGWLAGAAV GGGALGALLA SGGDGDSKTE VINNPTPPAE PGNATPSFLV
TDNQGDQRGI LATNDITDDT TPTFSGSGQA GATIQIKDSN GNTIASTQVD NNGHWSVSLP
TQSAGEHTWS VVQIVGSTIT DAGSITLTID NSQASVQVAT TAGDNIINAS EQAAGFTLSG
TSSHLAQGTE LTVTLNGKTY TTSVGANGAW SVQVPTADAQ ALGEGNQAVL VSGKDATGNT
VTGAQLLTVD TQPPTLAINT IAQDNIISAA EHNVALVLSG TSNAEAGQTV TLTVNGKSHT
ATVGSDGTWQ VTLPATEVQA LAEGNYAVNA SVSDRAGNTT SHSANFTVDT SAPVVSVNTV
AGDDILNNAE QAVAQIISGQ VSGASPGDTV TVKLGTHVLT GIVLADGSWN VALDPAVTRT
LDRGANTIFV TVTDAAGNTG AASRAITLVG VSPLITINTV SGDDIISGAE KGAPLTLTGS
TQQAETGQTV TVTLAGQSFT TTVQADGSWS LTVPAAAMGN LPDGAVAITA SVTDLSGNTG
NTSRTITVDS QAPALSIDPL TADNIINAAE SGQDLPITGT TDAQPGQTVT VTLNGQTYQG
VVQPDGTWSV TVPAANVGAL ADGNATVTAS VNDVAGNPSS VSRVALVDAT PPVVTINPVA
TDNVINTPEH AQAQIISGTV TGAQAGDIVT VTLNNVDYTT VVDGSGNWSL GVPASVVSGL
ADGSYPVSVS VTDKAGNTGS QSLTVTVNTA APLIGINSIA GDDVINASEK GADLQITGTS
DQPVNTAITV TLNGQNYTTT TDASGNWSVT VPASAVTALG QANYTVTAAV TSDIGNSATA
SHNVLVDSAL PGVTINPVAT DDIINAAEAG VAQTISGQVT GAEDGDTVTI TLGGNTYTAT
VGSNLTWSVD VPAADIQALG NGDLTVNASV TNQNGNTGSG TRDITIDANL PGLRVDTVAG
DDVVNIIEHG QALVVTGSSS GLAESTPLTV TINNVEYTTA VQADGSWSVG VTAAQVSAWP
AGTVNIAVSG ESSAGNSVSI THPVTVDLTP AAITINTIAT DDVINAAEKG ADLTLSGTTT
NVEPGQTVTV TFGGKNYTAS VASDGSWTAT VPAADLASLP EGSASALASV SNINGNSASA
VHNYSVDSSA PTIIINTVAS DNIVNASEAD AGVTVSGSTT AEAGQIVTIT LNSPTVQTYQ
ATVQADGSWS INIPAADLEA LTDGSHTLTA TVNDKAGNPA STTHNLAVDL TVPVLTINTI
AGDDIINATE HGQALVISGS STGGEAGDVV TVTLNSKTYT TTLDASGNWS VGVPAADVTA
LGSGPQTVTA TVTDAAGNSD N