Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4038 |
Symbol | |
ID | 6967038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3732474 |
End bp | 3733379 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643387800 |
Product | hypothetical protein |
Protein accession | YP_002272243 |
Protein GI | 209399772 |
COG category | [R] General function prediction only |
COG ID | [COG1512] Beta-propeller domains of methanol dehydrogenase type |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000052611 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.949055 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTTAA TCTTTATTCT ATTCACTTTG TGGTGCCTTC CTGGGCTTGC CCAGCAAATT GCTGTGCCGG AACTGCGCCA ACAGGTGACC GATATTACTG GTACGTTAAG CACTTCTGAA CAACAATCAC TGACACAGCA ACTGCAGGAT ATTACACATA AAACACGGGC GCAGGTCGCT GTATTAGTAG TGCCGTCCAC CGGAGACGAT TCCATTGAAC AATATGCTAC AAGGGTTTTT GATAGCTGGA AACTGGGAGA TAAGCAGAGA AACGATGGTA TTTTACTTCT GGTTGCCTGG GAGGATCATG CTGTTCGTAT TGAGGTTGGC TACGGGCTGG AAGGGGTGGT TACCGACCTG CAAGCGGCGA AGATTATCAG GGATATATTG ATCCCTGCCT TCAAAAGTGA TGACCTGATG GGTGGATTAA CACTGGCAAG CGAAAATATC GGCGCGCTTC TGTTGAATGG TGAATTACCG GAAGACAGGG GGGATTATTA CAGTATCAAT CCCCCTATTC CATTATCGCT TGCTGTAATT ATATTGCTGG CGGTACTTTC ATATTTTATT GTTTTTACCG ATCCGTCGAA TTTACCCTGG ATCACTCTCA CTGGCGCCAT TTACGGGATG GTATTCCTCT ATGTTGCCGA GCCAGGACCA TGGACCAATT TAATCGTTGC CTGCGGTATG TTAACGCCTT TTGCGATCGT ACCACTGGTT ATTTTTTGGC TTATCGTCAA TAAAAAGCTA CGCGCCAAAT ACAAAAAACT CAGTAAAGAT AGAGCTTCAA GAAAAGGTTC TTCTTCATCC TCTTCCGGTG GCGGCTCAAG TGGCGGAGGC TCAGGCGGCG GATTTAGCGG CGGGGGCGGT TCTTCTGGCG GTGGCGGCGC GTCCGGTCGC TGGTAA
|
Protein sequence | MRLIFILFTL WCLPGLAQQI AVPELRQQVT DITGTLSTSE QQSLTQQLQD ITHKTRAQVA VLVVPSTGDD SIEQYATRVF DSWKLGDKQR NDGILLLVAW EDHAVRIEVG YGLEGVVTDL QAAKIIRDIL IPAFKSDDLM GGLTLASENI GALLLNGELP EDRGDYYSIN PPIPLSLAVI ILLAVLSYFI VFTDPSNLPW ITLTGAIYGM VFLYVAEPGP WTNLIVACGM LTPFAIVPLV IFWLIVNKKL RAKYKKLSKD RASRKGSSSS SSGGGSSGGG SGGGFSGGGG SSGGGGASGR W
|
| |