Gene ECH74115_5846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5846 
Symbol 
ID6966617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5497913 
End bp5500096 
Gene Length2184 bp 
Protein Length727 aa 
Translation table11 
GC content36% 
IMG OID643389468 
Producthypothetical protein 
Protein accessionYP_002273860 
Protein GI209399931 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.909355 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTCCC TATTTAACAT ATACAAAGAT ATTTTCCCAA CACTCGGCAT GTATTCGGGA 
CTAAAAGCTT GCCATGAAAA AAACAACCTA CCATTTGATA TTAACACGGA AATTGAAACC
ATACAAAAAC AAATTAATTA TGATATAAAT CATTTGAATG ATGGTTTGAT TAAGCGTGTA
CTGAATCTTT TTATTCACCT TATCTCTAAT CCCGACAATC TTGAATTAAC CTTAAATAGA
TATTCATCAA CAACAGAACA AATCATCGGC AGAACCAAAA GAAATGGTTT ACATGAGTTT
GACGATGGCG ATCTAAAAAT AATATTTAAT CGACAAGATG ATAATGAAAG CGTATTAACT
GTTAAAGATA AAGATAAAGA TAAAGATATA AGTCATCACT GCAATGTTAA AACCGAGCAA
CTGCAGCAGT TTATTAAAAT AATGGAACAA AAAGCGCAAC TACCAATCTA TATTGACAAG
AACAATTTGA AAGAGAGTAT TTTCTCTGTT TTGCACAATG ACCCACAACA AGTAGATAAA
GATCAACACC TTCCCTGTGA AAAGTTTTTA AAACATGCCT GCAAAAGTTC AAATTCATTT
GAAGTGAAAT TAGATGCCAC TCATCAATAT CAACACCTGA ATAACTTCAT GATTTCTTTG
GACCCAGTAG AAAATCAATT AACCATACGG GATAACAATA ACAAGACTGA AACTTTCTCG
TTTACAAACT TACAATGGGA AAATTTGCTG CAATACTACA AAGAAAACCA CCAGCAGCCA
AATATAGCAG GATCACGAAA TCTCACGGAT AATATAGATA AAATTAAAAA TACAATATCC
ACCTCTGAAA TTATTGAATG CGCCTCTCCT GAAATAAGAA GTAGCGTCCT GAACGATCTT
TATAGCATTG CTAATTTCCT CCCGGACAAT AATCTGACCC CAAATGAGAG CTGGAAAAGA
TTTTGTGAGA CATGCGAGCG CTTTTACGTT GCTCAGAAGA GTATCACTGG AGATAAGAGT
GAACGTCTTA CGCGAAAACT CTCTATCTCT GATGCAGGAA TTACAATGAC CTTCAAGATA
GGTGATGTTG TCATCAATAC TATTAGCACT GCTATTCCTG AAGATGCAAC GGGTCAACGG
TGTATCGAAG GGTTGAATTT AGCAGAGATG GATTTAACCG ACATAGACTT GTCGAAAATG
GCGCTAAGGA ATGTCAATTT TAATGGCAGC ATTCTTAGAA ATGCCAAGTT CTCCGGTACG
ATCTGTGAAG GCGTGGATTT TACCGATTGT GATCTGCGTA ATGCAGAATT CGAAAATGCC
TCATTAGAAA ATAATGATTT TCGTAAAGTT CGCCACTTGA CTTATGTAAA TTTCAAAAAC
GCAAATCTAC GAAACAGTAA CTTCAACGGA AAAGTTCTCA CTGGCGTAAC CTTTACTGGA
AGTGACCTTA GTAACGCGTA TCTTGAACAC ATAGATTTCA CAACCGTGAT TCTATATGAA
ACATCTAAAA TACCTGGAAT ACCTGGAACA CCTCAAATAC CGGGAACACC TAAAGTAATT
CTTACTGGCG CAATACTAAA TTATTCCGAT CTATCGGGAA AAGATCTTTC AGAATATAAT
CTTACTGGTA TTCTCTGCAT GTATACCAAC TTTTCAAACG CTAATTTAAC AAATTGTAAA
ATCTCTAATG CAAACTTTTC GAATGCAAAA TTCTACAATA CTAATTGTAC TGGTGCAAAT
TGTTCGAATA TCCTATTTGA CTACGCATGG TTTGACAATA CAATATTTAT AAAAACGCTT
TTTAAAAATA CCTGTTTTTA CAATGTCAGA GCGAAAAATG TCTATCTTGA GGGAGCATAT
CTGAACAATG ATAATATCGT GAATCAAGCC AATAACAGTA CCGAGAAACA ATCCATTGAC
AGTACCGATA AACAGGCCAA TGACAGTACG GTGCAACAAT CCATTGACAG TACGGTGCAA
CAAGCCAATG ACAGTACCGA TAAACAAGCC AATGACAATA TCGATAAACA GGTCAATGAC
AGTACCGATA AACAAGCCAA GAACAGTACC GAGCAACAGG ACAGTAACAG TTTTAATCAA
GCCCGTTTAA AGAAAGAAGT GAATAGGAGA TTTTCCATTC CGGGTTTAAC GTCTTATCAG
CCAACATATA TAGTTGAAGA ATAG
 
Protein sequence
MGSLFNIYKD IFPTLGMYSG LKACHEKNNL PFDINTEIET IQKQINYDIN HLNDGLIKRV 
LNLFIHLISN PDNLELTLNR YSSTTEQIIG RTKRNGLHEF DDGDLKIIFN RQDDNESVLT
VKDKDKDKDI SHHCNVKTEQ LQQFIKIMEQ KAQLPIYIDK NNLKESIFSV LHNDPQQVDK
DQHLPCEKFL KHACKSSNSF EVKLDATHQY QHLNNFMISL DPVENQLTIR DNNNKTETFS
FTNLQWENLL QYYKENHQQP NIAGSRNLTD NIDKIKNTIS TSEIIECASP EIRSSVLNDL
YSIANFLPDN NLTPNESWKR FCETCERFYV AQKSITGDKS ERLTRKLSIS DAGITMTFKI
GDVVINTIST AIPEDATGQR CIEGLNLAEM DLTDIDLSKM ALRNVNFNGS ILRNAKFSGT
ICEGVDFTDC DLRNAEFENA SLENNDFRKV RHLTYVNFKN ANLRNSNFNG KVLTGVTFTG
SDLSNAYLEH IDFTTVILYE TSKIPGIPGT PQIPGTPKVI LTGAILNYSD LSGKDLSEYN
LTGILCMYTN FSNANLTNCK ISNANFSNAK FYNTNCTGAN CSNILFDYAW FDNTIFIKTL
FKNTCFYNVR AKNVYLEGAY LNNDNIVNQA NNSTEKQSID STDKQANDST VQQSIDSTVQ
QANDSTDKQA NDNIDKQVND STDKQAKNST EQQDSNSFNQ ARLKKEVNRR FSIPGLTSYQ
PTYIVEE