Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2324 |
Symbol | |
ID | 6971978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2195948 |
End bp | 2197456 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643386202 |
Product | hypothetical protein |
Protein accession | YP_002270686 |
Protein GI | 209400318 |
COG category | [S] Function unknown |
COG ID | [COG5339] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAT CGCTGGTAGC GGTAGGCGTC ATTGTTGCGC TAGGCGTAGT CTGGACAGGC GGCGCATGGT ATACAGGCAA GAAGATTGAA ACCCATCTCG AAGACATGGT CGCGCAGGCG AACGCGCAAC TCAAACTGAC CGCTCCTGAA TCCAACCTGG AAGTGAGTTA TCAAAACTAT CATCGCGGCG TATTCAGCAG TCAGCTGCAA CTGTTGGTGA AACCCATTGC CGGAAAAGAA AATCCGTGGA TTAAAAGCGG TCAGAGCGTC ATCTTCAACG AATCGGTTGA TCATGGTCCC TTCCCCCTTG CCCAGCTTAA AAAACTGAAC CTGATCCCGT CGATGGCATC AATTCAAACC ACGCTGGTTA ATAACGAAGT AAGCAAACCA CTGTTTGATA TGGCAAAAGG TGAAACGCCT TTTGAGATTA ACTCGCGCAT TGGTTACAGC GGTGATTCCA GTTCCGATAT TTCGCTCAAG CCACTAAATT ACGAGCAAAA GGATGAAAAA GTCGCCTTTA GCGGCGGCGA GTTCCAGTTA AATGCGGACA GAGACGGCAA AGCTATCTCC CTTTCCGGGG AGGCGCAAAG TGGTCGGATA GACGCGGTTA ACGAATACAA CCAGAAAGTA CAGTTGACCT TTAATAATCT GAAAACCGAC GGTTCCAGCA CGCTGGCAAG TTTTGGTGAG CGCGTAGGAA ACCAAAAACT GTCACTGGTA AAAATGACCA TTTCAGTGGA AGGCAAAGAA CTGGCACTGC TGGAAGGCAT GGAGATCAGC GGTAAATCGG ATCTGGTCAA TGACGGTAAA ACGATCAATA GCCAACTGGA TTACTCGCTA AACAGCCTGA AGGTACAGAA TCAGGATCTG GGCAGCGGCA AGCTGACTTT AAAAGTCGGC CAAATTGATG GCGAAGCCTG GCATCAGTTT AGCCAGCAAT ATAACGCGCA AACTCAGGCG CTGCTGGCAC AGCCAGAAAT TGCCAACAAT CCCGAACTTT ATCAGGAGAA AGTGACGGAA GCCTTCTTTA GCGCCCTGCC GCTGATGTTG AAAGGCGATC CGGTGATTAC TATCGCGCCG CTAAGCTGGA AAAACAGTCA GGGTGAAAGT GCGCTGAATC TGTCGCTGTT CCTGAAAGAT CCGGCAACGA CTAAAGAAGC GCCGCAAACG CTGGCGCAGG AAGTAGATCG TTCGGTTAAA TCTCTGGATG CGAAACTGAC CATTCCGGTG GATATGGCAA CTGAGTTGAT GACTCAGGTA GCGAAGCTGG AAGGTTATCA GGAAGATCAA GCGAAAAAAC TGGCGAAACA GCAAGTTGAA GGTGCATCAG CAATGGGGCA GATGTTCCGT CTGACCACCT TGCAGGACAA TACCATCACC ACCAGCCTGC AATATACTAA CGGTCAGATA ACGTTAAACG GGCAGAAAAT GCCACTGGAA GATTTCGTTG GTATGTTTGC AATGCCGGCA TTAAATGTTC CGGTCGTACC CGCTATTCCG CAGCAGTAA
|
Protein sequence | MNKSLVAVGV IVALGVVWTG GAWYTGKKIE THLEDMVAQA NAQLKLTAPE SNLEVSYQNY HRGVFSSQLQ LLVKPIAGKE NPWIKSGQSV IFNESVDHGP FPLAQLKKLN LIPSMASIQT TLVNNEVSKP LFDMAKGETP FEINSRIGYS GDSSSDISLK PLNYEQKDEK VAFSGGEFQL NADRDGKAIS LSGEAQSGRI DAVNEYNQKV QLTFNNLKTD GSSTLASFGE RVGNQKLSLV KMTISVEGKE LALLEGMEIS GKSDLVNDGK TINSQLDYSL NSLKVQNQDL GSGKLTLKVG QIDGEAWHQF SQQYNAQTQA LLAQPEIANN PELYQEKVTE AFFSALPLML KGDPVITIAP LSWKNSQGES ALNLSLFLKD PATTKEAPQT LAQEVDRSVK SLDAKLTIPV DMATELMTQV AKLEGYQEDQ AKKLAKQQVE GASAMGQMFR LTTLQDNTIT TSLQYTNGQI TLNGQKMPLE DFVGMFAMPA LNVPVVPAIP QQ
|
| |