Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5846 |
Symbol | |
ID | 6966617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5497913 |
End bp | 5500096 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643389468 |
Product | hypothetical protein |
Protein accession | YP_002273860 |
Protein GI | 209399931 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.909355 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTCCC TATTTAACAT ATACAAAGAT ATTTTCCCAA CACTCGGCAT GTATTCGGGA CTAAAAGCTT GCCATGAAAA AAACAACCTA CCATTTGATA TTAACACGGA AATTGAAACC ATACAAAAAC AAATTAATTA TGATATAAAT CATTTGAATG ATGGTTTGAT TAAGCGTGTA CTGAATCTTT TTATTCACCT TATCTCTAAT CCCGACAATC TTGAATTAAC CTTAAATAGA TATTCATCAA CAACAGAACA AATCATCGGC AGAACCAAAA GAAATGGTTT ACATGAGTTT GACGATGGCG ATCTAAAAAT AATATTTAAT CGACAAGATG ATAATGAAAG CGTATTAACT GTTAAAGATA AAGATAAAGA TAAAGATATA AGTCATCACT GCAATGTTAA AACCGAGCAA CTGCAGCAGT TTATTAAAAT AATGGAACAA AAAGCGCAAC TACCAATCTA TATTGACAAG AACAATTTGA AAGAGAGTAT TTTCTCTGTT TTGCACAATG ACCCACAACA AGTAGATAAA GATCAACACC TTCCCTGTGA AAAGTTTTTA AAACATGCCT GCAAAAGTTC AAATTCATTT GAAGTGAAAT TAGATGCCAC TCATCAATAT CAACACCTGA ATAACTTCAT GATTTCTTTG GACCCAGTAG AAAATCAATT AACCATACGG GATAACAATA ACAAGACTGA AACTTTCTCG TTTACAAACT TACAATGGGA AAATTTGCTG CAATACTACA AAGAAAACCA CCAGCAGCCA AATATAGCAG GATCACGAAA TCTCACGGAT AATATAGATA AAATTAAAAA TACAATATCC ACCTCTGAAA TTATTGAATG CGCCTCTCCT GAAATAAGAA GTAGCGTCCT GAACGATCTT TATAGCATTG CTAATTTCCT CCCGGACAAT AATCTGACCC CAAATGAGAG CTGGAAAAGA TTTTGTGAGA CATGCGAGCG CTTTTACGTT GCTCAGAAGA GTATCACTGG AGATAAGAGT GAACGTCTTA CGCGAAAACT CTCTATCTCT GATGCAGGAA TTACAATGAC CTTCAAGATA GGTGATGTTG TCATCAATAC TATTAGCACT GCTATTCCTG AAGATGCAAC GGGTCAACGG TGTATCGAAG GGTTGAATTT AGCAGAGATG GATTTAACCG ACATAGACTT GTCGAAAATG GCGCTAAGGA ATGTCAATTT TAATGGCAGC ATTCTTAGAA ATGCCAAGTT CTCCGGTACG ATCTGTGAAG GCGTGGATTT TACCGATTGT GATCTGCGTA ATGCAGAATT CGAAAATGCC TCATTAGAAA ATAATGATTT TCGTAAAGTT CGCCACTTGA CTTATGTAAA TTTCAAAAAC GCAAATCTAC GAAACAGTAA CTTCAACGGA AAAGTTCTCA CTGGCGTAAC CTTTACTGGA AGTGACCTTA GTAACGCGTA TCTTGAACAC ATAGATTTCA CAACCGTGAT TCTATATGAA ACATCTAAAA TACCTGGAAT ACCTGGAACA CCTCAAATAC CGGGAACACC TAAAGTAATT CTTACTGGCG CAATACTAAA TTATTCCGAT CTATCGGGAA AAGATCTTTC AGAATATAAT CTTACTGGTA TTCTCTGCAT GTATACCAAC TTTTCAAACG CTAATTTAAC AAATTGTAAA ATCTCTAATG CAAACTTTTC GAATGCAAAA TTCTACAATA CTAATTGTAC TGGTGCAAAT TGTTCGAATA TCCTATTTGA CTACGCATGG TTTGACAATA CAATATTTAT AAAAACGCTT TTTAAAAATA CCTGTTTTTA CAATGTCAGA GCGAAAAATG TCTATCTTGA GGGAGCATAT CTGAACAATG ATAATATCGT GAATCAAGCC AATAACAGTA CCGAGAAACA ATCCATTGAC AGTACCGATA AACAGGCCAA TGACAGTACG GTGCAACAAT CCATTGACAG TACGGTGCAA CAAGCCAATG ACAGTACCGA TAAACAAGCC AATGACAATA TCGATAAACA GGTCAATGAC AGTACCGATA AACAAGCCAA GAACAGTACC GAGCAACAGG ACAGTAACAG TTTTAATCAA GCCCGTTTAA AGAAAGAAGT GAATAGGAGA TTTTCCATTC CGGGTTTAAC GTCTTATCAG CCAACATATA TAGTTGAAGA ATAG
|
Protein sequence | MGSLFNIYKD IFPTLGMYSG LKACHEKNNL PFDINTEIET IQKQINYDIN HLNDGLIKRV LNLFIHLISN PDNLELTLNR YSSTTEQIIG RTKRNGLHEF DDGDLKIIFN RQDDNESVLT VKDKDKDKDI SHHCNVKTEQ LQQFIKIMEQ KAQLPIYIDK NNLKESIFSV LHNDPQQVDK DQHLPCEKFL KHACKSSNSF EVKLDATHQY QHLNNFMISL DPVENQLTIR DNNNKTETFS FTNLQWENLL QYYKENHQQP NIAGSRNLTD NIDKIKNTIS TSEIIECASP EIRSSVLNDL YSIANFLPDN NLTPNESWKR FCETCERFYV AQKSITGDKS ERLTRKLSIS DAGITMTFKI GDVVINTIST AIPEDATGQR CIEGLNLAEM DLTDIDLSKM ALRNVNFNGS ILRNAKFSGT ICEGVDFTDC DLRNAEFENA SLENNDFRKV RHLTYVNFKN ANLRNSNFNG KVLTGVTFTG SDLSNAYLEH IDFTTVILYE TSKIPGIPGT PQIPGTPKVI LTGAILNYSD LSGKDLSEYN LTGILCMYTN FSNANLTNCK ISNANFSNAK FYNTNCTGAN CSNILFDYAW FDNTIFIKTL FKNTCFYNVR AKNVYLEGAY LNNDNIVNQA NNSTEKQSID STDKQANDST VQQSIDSTVQ QANDSTDKQA NDNIDKQVND STDKQAKNST EQQDSNSFNQ ARLKKEVNRR FSIPGLTSYQ PTYIVEE
|
| |