Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5122 |
Symbol | |
ID | 6971662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4762150 |
End bp | 4763400 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643388794 |
Product | hypothetical protein |
Protein accession | YP_002273220 |
Protein GI | 209398871 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.116872 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGCGG GAACCGTTTT GTATCAGGAC CGCGCCATGA AACAGATAAC CTTTGCTCCC CGTAATCACC TGCTCACCAA TACCAATACC TGGACGCCCG ACAGCCAGTG GCTGGTATTT GACGTGCGTC CTTCTGGCGC GTCGTTTACC GGCGAGACCA TTGAGCGTGT GAATATCCAT ACCGGCGAGG TCGAGGTTAT CTATCGCGTG TCACAGGGCG CGCACGTCGG CGTGGTGACC GTTCATCCAA AGTCAGAGAA ATATGTCTTT ATCCACGGAC CTGAAAATCC TCATGAAACA TGGCATTACG ATTTCCATCA CCGTCGCGGC GTGATTGTTG AAGGCGGCAA GATGAACAAT CTCGATGCAA TGGATATTAC CGTTCCGTAC ACATCAGGGG CGCTGCGTGG CGGCAGCCAT GTGCATGTCT TTAGCCCGAA CGGTGAAATG GTGAGTTTTA CCTATAACGA CCATGTAATG CATCAGTTCG ATTCGGCGCT GGATTTGCGA AACGTCGGGG TTGCTGCACC GTTTGGCCCG GTCAACGTAC AAAAGCAGCA TCCGCGTGAA TACAGCGGTA GCCACTGGTG CGTGCTGGTG AGTAAAACCA CGCCCACGCC GCAGCCTGGC AGCGATGAAA TCAATCGTGC TTATGAAGAA GGATGGGTAG GAAATCACGC GCTGGCGTTT ATTGGCGACA CACTTTCGCC AAAGGGCGAG AAAGTGCCGG AGCTGTTTAT CGTTGAGTTA CCGCAAGATG AAGCTGGCTG GAAAGCAGCA GGTGATGCGC CGTTAAGTGG AACGGAAACA ACCCTGCCCG CGCCACCGCG TGGCATCGTG CAGCGACGTT TAACCTTTAC CCACCATCGG GCTTATCCGG GGTTAGTCAA CGTCCCGCGC CACTGGGTGC GCTGTAATCC GCAGGGTACG CAAATCGCGT TTTTAATGCG AGATGATAAC GGCATTGTGC AACTGTGGCT TATCTCGCCA CAGGGCGGCG AGCCGCGCCA GTTAACCCAT AACAAAACGG ATATTCAGTC TGCATTTAAC TGGCATCCGT CAGGAGAATG GTTGGGCTTT GTGCTGGGTA ATCGAATTGC TTGTGCCCAT GCGCAAAGCG GCGAGGTTGA GTATTTAACC GAAAACCACG CCAATCCACC TTCTGCGGAC GCCGTGGTCT TCTCGCCGGA TGGTCAGTGG CTGGCGTGGA TGGAAGGCGG CCAGCTGTGG ATCACCGAAA CTGATCGCTA A
|
Protein sequence | MMAGTVLYQD RAMKQITFAP RNHLLTNTNT WTPDSQWLVF DVRPSGASFT GETIERVNIH TGEVEVIYRV SQGAHVGVVT VHPKSEKYVF IHGPENPHET WHYDFHHRRG VIVEGGKMNN LDAMDITVPY TSGALRGGSH VHVFSPNGEM VSFTYNDHVM HQFDSALDLR NVGVAAPFGP VNVQKQHPRE YSGSHWCVLV SKTTPTPQPG SDEINRAYEE GWVGNHALAF IGDTLSPKGE KVPELFIVEL PQDEAGWKAA GDAPLSGTET TLPAPPRGIV QRRLTFTHHR AYPGLVNVPR HWVRCNPQGT QIAFLMRDDN GIVQLWLISP QGGEPRQLTH NKTDIQSAFN WHPSGEWLGF VLGNRIACAH AQSGEVEYLT ENHANPPSAD AVVFSPDGQW LAWMEGGQLW ITETDR
|
| |