Gene ECH74115_5486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5486 
Symbol 
ID6968476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5131815 
End bp5134001 
Gene Length2187 bp 
Protein Length728 aa 
Translation table11 
GC content38% 
IMG OID643389132 
ProductShET2 enterotoxin, N- region family 
Protein accessionYP_002273529 
Protein GI209399314 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACTC ATATTCCTCG TAGTTCTTTC TCTGCAAATA TTAATAATAC AGCCCAGACA 
AATGAACACC AAACCCTGAG TGAATTGTTT TATAAAGAAC TCGAGGATAA ATTTTCTGGC
AAGGAGCTGG CGACACCTCT ATTAAAAAGC TTCTCAGAGA ACTGTAGACA CAATGGTCGC
CATATCTTTA GTAACAAGGA TTTTGTCATT AAGTTTTCCA TATCTGTCTT ACAAGCTGAT
AAGAAAGAAA TTACGATAAT TAATAAAAAT GAAAACACGA CACTCACTCA AACCATTGCT
CCAATATTTG AAGAATACCT AATGGAAATT TTACCTCAAC GCTCAGACGC TCTTGATAAA
AAAGAATTAA ATCTAAACTC AGATAGAAAA GAAAAAGAAT TCCCAAGAGT TAAGCTTAAT
GGTCAATGTT ATTTTCCGGG GCGCCCCCAA AACCGTATAG TATGCCGACA CATTGCTGCA
CAATATATTA ATGATATTTA TCAGAATGTT GACTACAAAC CCCATCAAGA TGATTACTCT
TCAGCTGAAA AATTTCTCAC TCACTTCAAC AAAAAATGCA AAAACCAGAC TTTGGCGTTG
ATTTCCAGCC GTCCTGAGGG GCGTTGCGTT GCTGCCTGCG GTGATTTCGG GCTAGTTATG
AAGGCATATT TTGACAAGAT GGAATCAAAT GACCTCAGTG TTATGGCTGC CATATTATTG
GTAGATAACC ATGCTTTGAC GGTCCGGCTA AGAATAAAGA ACACAACTGA AGGATGTATC
CATTACGTGG TTTCGGTTTA TGATCCTAAT GTAACTAACG ATAAAATAAG AATCATGAGC
GAAAGTAAAG AGGATATTAA ACACTATTCA CTGATGGATT TTATGAATGT AGATTATAGC
CTCCTGAAAT GGTCAAATGA TCATGTTATT AACCAATCTG TTGCAATAAT TCCAGCACTT
CCAAAAGAAC AGCTATTGAT GCTAAAAGGA ACTGTGGATG AAATAACCCC TCCATTATCA
CCTGCAACGA TGAATTTGCT AATGGCAATT GGTCAGAATC ACCAACTTAA GCAACTGATG
ATTCAACTCC AGAAAATGCC AGAACTACAT AGAACAGAAA TGTTGACTGC CTATAATAGT
ATTAACCTCC CCGGTTTATA TTTGGCTATA AATTATGGTA ATGCGGATAT CGTTGAGACT
ATTTTTAATT CATTGTCAGA GCCAGGATAT GAAGGATTAC TCTCGAAAAA AAATCTCATG
CATATTCTGG AGGCAAAAGA TAAAAATGGT TTTTCTGGAT TATTTTTAGC GATATCACGT
AAGGATAAAA ATGTTGTAAC CTCGATTCTG AACGCCTTAC CTAAACTGGC CGCAACACAT
CATTTAGATA ACGAACAAGT GTATAAATTC CTGAGTGCCA AAAATAGTAC GTCCAGCCAT
GTTTTATACC ATGTTATGGC GAATGGTGAT GCCGACATGC TGAAAATTGT TTTGGACGCG
TTATCTTTGT TAATTCGCAC ATGTCATTTG ACTAAAGAAC AGGTACTCGA TCTCCTGAAG
GCAAAGGATT TTTATGGTTG CCCAGGACTA TACCTGGCGA TGCAAAATGG ACATAGCGAT
ATCGTGAAAG TTATTCTTGA AGCATTGCCC AGCCTGGCCC AGGAAATTAA CATTTCAGCT
TCCGATATTG TAGATCTTCT GACCGCTAAA AGTCTTGCGC GCGACACGGG TTTGTTTATG
GCCATGCAGC GCGGACATAT GAACGTTATT AATACTATTT TTAACGCATT ACCCACTCTG
TTTAATACGT TTAAATTCGA TAAAAAAAAT ATGAAGCCCC TCCTCCTGGC AAATAATTCT
AATGAATACC CAGGTTTGTT TTCAGCGATA CAGCATAAAC AGCAAAACGT TGTAGAGATG
GTTTATCTTG CTTTATCTGA CCATGCACGC CTGTTTGGAT TTACCGCTGA AGATATTATG
GATTTTTGGC AACACAAAGC GCCACAAAAA TACTCTGCCT TTGAGTTGGC TTGTGAATTG
GGTCACCGGG TTATTGCTGA ATTAATCTTT AATACATTAA ATAAGATGGC TGAAAGCTTT
GGCTTTACGG ATAACCCTCG ATACATTGCG GAGAAAAATT ATATGGAAGC TTTACTCAAA
AAAGCATCTC CCCATACCGT ACGCTAA
 
Protein sequence
MITHIPRSSF SANINNTAQT NEHQTLSELF YKELEDKFSG KELATPLLKS FSENCRHNGR 
HIFSNKDFVI KFSISVLQAD KKEITIINKN ENTTLTQTIA PIFEEYLMEI LPQRSDALDK
KELNLNSDRK EKEFPRVKLN GQCYFPGRPQ NRIVCRHIAA QYINDIYQNV DYKPHQDDYS
SAEKFLTHFN KKCKNQTLAL ISSRPEGRCV AACGDFGLVM KAYFDKMESN DLSVMAAILL
VDNHALTVRL RIKNTTEGCI HYVVSVYDPN VTNDKIRIMS ESKEDIKHYS LMDFMNVDYS
LLKWSNDHVI NQSVAIIPAL PKEQLLMLKG TVDEITPPLS PATMNLLMAI GQNHQLKQLM
IQLQKMPELH RTEMLTAYNS INLPGLYLAI NYGNADIVET IFNSLSEPGY EGLLSKKNLM
HILEAKDKNG FSGLFLAISR KDKNVVTSIL NALPKLAATH HLDNEQVYKF LSAKNSTSSH
VLYHVMANGD ADMLKIVLDA LSLLIRTCHL TKEQVLDLLK AKDFYGCPGL YLAMQNGHSD
IVKVILEALP SLAQEINISA SDIVDLLTAK SLARDTGLFM AMQRGHMNVI NTIFNALPTL
FNTFKFDKKN MKPLLLANNS NEYPGLFSAI QHKQQNVVEM VYLALSDHAR LFGFTAEDIM
DFWQHKAPQK YSAFELACEL GHRVIAELIF NTLNKMAESF GFTDNPRYIA EKNYMEALLK
KASPHTVR