Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4757 |
Symbol | |
ID | 6971599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4402102 |
End bp | 4403139 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643388454 |
Product | putative dehydrogenase |
Protein accession | YP_002272882 |
Protein GI | 209400110 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.523048 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.406172 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCATCA ACTGCGCCTT TATTGGCTTC GGCAAAAGCA CCACCCGTTA CCATCTGCCG TATGTACTTA ACCGCAAGGA TAGCTGGCAT GTCGCGCATA TTTTTCGCCG CCATGCAAAG CCGGAAGAAC AAGCTCCCAT TTACTCTCAT ATCCATTTCA CCAGCGATCT CAACGAAGTG CTAAACGATC CCGATGTTAA GCTGGTTGTT GTCTGCACCC ATGCGGACAG CCACTTCGAG TACGCGAAGC GCGCGATGGA AGCCGGGAAA AATGTGCTGG TCGAAAAACC GTTCACTCCG ACACTTGCGC AGGCGAAAGA GCTGTTTGCA CTGGCGAAAA GCAAAGGGCT GACCGTCACG CCATATCAGA ATCGTCGCTT TGATTCCTGC TTCCTGACGG CGAAAAAAGC GATTGAAAGC GGCAAGCTGG GAGAGATTGT CGAAGTGGAA AGCCATTTTG ACTATTACCG CCCGGTGGCA GAAACCAAAC CTGGGCTGCC GCAGGATGGC GCGTTCTATG GCCTTGGTGT GCATACGATG GACCAGATTA TTTCTCTGTT CGGTCGCCCG GATCACGTCG CTTATGACAT CCGCAGCCTG CGCAATAAAG CCAATCCTGA CGATACTTTC GAAGCGCAAC TGTTTTATGG CGATCTAAAA GCCATCGTCA AAACCAGCCA TCTGGTGAAA ATCGATTATC CGAAATTTAT CGTTCACGGT AAGAAAGGTT CGTTTATTAA ATATGGTATC GACCAGCAGG AAACCAGCCT GAAGGCTAAT ATTATGCCGG GCGAACCGGG ATTCGCAGCG GATGATTCGG TCGGTGTGCT GGAGTATGTC AATGACGAGG GCGTGACGGT CAGAGAAGAG ATGAAGCCGG AGATGGGCGA TTACGGGCGC GTTTATGATG CGTTGTATCA AACCATCACC CACGGTGCGC CAAATTACGT CAAGGAATCT GAAGTTCTTA CGAATCTGGA AATACTTGAA CGTGGATTCG AGCAAGCCTC TCCCTCCACA GTGACTCTCG CGAAGTAA
|
Protein sequence | MVINCAFIGF GKSTTRYHLP YVLNRKDSWH VAHIFRRHAK PEEQAPIYSH IHFTSDLNEV LNDPDVKLVV VCTHADSHFE YAKRAMEAGK NVLVEKPFTP TLAQAKELFA LAKSKGLTVT PYQNRRFDSC FLTAKKAIES GKLGEIVEVE SHFDYYRPVA ETKPGLPQDG AFYGLGVHTM DQIISLFGRP DHVAYDIRSL RNKANPDDTF EAQLFYGDLK AIVKTSHLVK IDYPKFIVHG KKGSFIKYGI DQQETSLKAN IMPGEPGFAA DDSVGVLEYV NDEGVTVREE MKPEMGDYGR VYDALYQTIT HGAPNYVKES EVLTNLEILE RGFEQASPST VTLAK
|
| |