Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4952 |
Symbol | |
ID | 6971265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4593466 |
End bp | 4595436 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643388635 |
Product | hypothetical protein |
Protein accession | YP_002273062 |
Protein GI | 209399472 |
COG category | [S] Function unknown |
COG ID | [COG3533] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATTT CGGAAGTCGA TCTGCATAAA CTGACGGTCA GCGATCCGTT CCTCGGTCAG TACCAACAAC TGGTCCGCGA CGTGGTGATT CCTTATCAGT GGGATGCCTT GAACGATCGT ATCCCAGAAG CGGAACCCAG CCATGCGATT GAAAACTTTC GCATTGCCGC CGGACTTCAG GAGGGTGAAT TTTACGGGAT GGTGTTTCAG GACAGCGACG TCGCCAAATG GCTGGAAGCG GTAGCCTGGT CGCTGTGCCA GAAGCCGGAC GCCGAACTGG AAAAAACCGC CGACGAGGTA ATCGAACTGA TCGCCTCCGC CCAATGCGAA GACGGCTATC TCAATACTTA CTTTACGGTA AAAGCACCCG AAGAACGCTG GAGCAATCTG GCGGAGTGTC ATGAACTTTA CTGCGCAGGT CATCTGATTG AAGCCGGAGT CGCCTTCTTC CAGGCCACGG GAAAACGGCG CTTGCTGGGG GTCGTTTGCC GCCTGGCCGA TCATATCGAC AGCGTATTTG GTCCAGATGA AAGTAAGTTA CACGGTTATC CTGGTCACCC GGAAATTGAA CTGGCACTAA TGCGCCTGTA TGAAGTGACT GAAGAGCCGC GCTACCTGGC GCTGACGAAC TATTTTGTCG AACAGCGTGG TGCGCAACCG CACTATTACG ACCAGGAATA TGAAAAGCGC GGGCAGACAT CGCACTGGCA CACCTACGGC CCGGCGTGGA TGGTGAAAGA CAAAGCCTAC AGCCAGGCAC ATTTGCCCCT TGCGCAACAG CAAACCGCCA TCGGTCACGC GGTACGTTTT GTCTATCTGA TGACCGGCGT CGCGCATCTC GCGCGTTTAA GTCACGATGA CAGCAAGCGT CAGGACTGCC TGCGGCTGTG GAACAATATG GCCCAGCGTC AGTTATATAT TACCGGCGGC ATCGGCTCAC AAAGCAGCGG CGAAGCGTTC AGCAGCGATT ACGATCTGCC GAATGACACG GTTTACGCCG AAAGTTGTGC TTCCATCGGC CTGATGATGT TCGCCCGACG AATGCTGGAA ATGGAAGGCG ACAGTCAATA TGCCGATGTG ATGGAGCGCG CACTGTACAA CACTGTGCTC GGCGGCATGG CATTGGATGG CAAACATTTC TTCTATGTGA ATCCACTGGA AGTACATCCA AAATCGCTGA AATTCAACCA TATCTACGAT CACGTTAAAC CGATCCGCCA GCGTTGGTTT GGCTGCGCTT GTTGTCCGCC AAATATCGCC CGCGTGCTGA CCTCGATTGG TCATTATCTC TACACGCCGC GTGAAGATGC GTTGTATATC AACATCTACG CAGGAAACAG CATGGAAGTG CCGGTAGAAA ATGGCACGCT GCGCCTGCGG GTTAGCGGGA ACTATCCGTG GCAGGAGCAG GTGACGATTG CGGTTGAATC GCCCCAGCCG GTACGTCATA CGCTGGCTTT ACGTCTGCCG GACTGGTGCA CACAGCCGCA GATCATATTG AATGGGGAAG AGGTCGAGCA GGATATTCGT AAAGGGTATT TGCACATTAC CCGCGAATGG CAGGAGGGCG ATACGCTGAA TCTGACTTTG CCGATGCCGG TACGCCGCGT TTACGGTAAC CCGCTGGTGC GTCACGTCGC CGGAAAAGTG GCGATTCAGC GCGGCCCGCT GGTGTATTGC CTGGAAAAGG CCGACAACGG CGAGTCACTG CATAATCTGT GGCTGCCCAC CGATGCGCCA TTTACGACAT TTGAAGGCAA GGGATTGTTT AGCCATAAGA TCTTAATCCA GGCACCGGGT TACCGGTATG AACAGAGCAA TCCAGAGCAG CAACCGCTGT GGCATTACGA CAGCGCGCCA GCCAAACGCC AGACGCAAAC TCTGACCTTT ATCCCGTGGT TTAGCTGGGC CAACCGGGGT GAAGGCGAAA TGCGGATCTG GGTGAATGAG GAAAAGCATT GCCATCCGTA G
|
Protein sequence | MNISEVDLHK LTVSDPFLGQ YQQLVRDVVI PYQWDALNDR IPEAEPSHAI ENFRIAAGLQ EGEFYGMVFQ DSDVAKWLEA VAWSLCQKPD AELEKTADEV IELIASAQCE DGYLNTYFTV KAPEERWSNL AECHELYCAG HLIEAGVAFF QATGKRRLLG VVCRLADHID SVFGPDESKL HGYPGHPEIE LALMRLYEVT EEPRYLALTN YFVEQRGAQP HYYDQEYEKR GQTSHWHTYG PAWMVKDKAY SQAHLPLAQQ QTAIGHAVRF VYLMTGVAHL ARLSHDDSKR QDCLRLWNNM AQRQLYITGG IGSQSSGEAF SSDYDLPNDT VYAESCASIG LMMFARRMLE MEGDSQYADV MERALYNTVL GGMALDGKHF FYVNPLEVHP KSLKFNHIYD HVKPIRQRWF GCACCPPNIA RVLTSIGHYL YTPREDALYI NIYAGNSMEV PVENGTLRLR VSGNYPWQEQ VTIAVESPQP VRHTLALRLP DWCTQPQIIL NGEEVEQDIR KGYLHITREW QEGDTLNLTL PMPVRRVYGN PLVRHVAGKV AIQRGPLVYC LEKADNGESL HNLWLPTDAP FTTFEGKGLF SHKILIQAPG YRYEQSNPEQ QPLWHYDSAP AKRQTQTLTF IPWFSWANRG EGEMRIWVNE EKHCHP
|
| |