Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4315 |
Symbol | |
ID | 6970009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3993101 |
End bp | 3993985 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643388044 |
Product | oxidoreductase |
Protein accession | YP_002272482 |
Protein GI | 209399644 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCATT TAAAAGACCC GACCACGCAG TATTACACTG GTGAATATCC CAAACAGAAA CAACCGACGC CAGGCATCCA GGCGAAGATG ACACCGGTAC CGGATTGCGG CGAGAAAACC TATGTTGGTA GCGGTCGCCT GAAAGATCGT AAAGCCCTGG TGACAGGGGG CGATTCCGGA ATCGGTCGCG CTGCCGCCAT CGCTTACGCG CGTGAAGGGG CTGACGTGGC GATCAGTTAT CTTCCCGTGG AAGAAGAAGA CGCTCAGGAT GTGAAAAAGA TCATTGAAGA ATGCGGACGC AAAGCCGTTC TGCTGCCAGG CGATTTAAGC GATGAGAAAT TTGCCCGTTC GCTGGTTCAC GAAGCGCATA AGGCCTTAGG TGGGCTGGAT ATTATGGCGC TGGTCGCCGG GAAACAGGTT GCCATTCCTG ATATTGCAGA CCTCACCAGC GAACAGTTTC AAAAGACCTT TGCCATTAAC GTTTTCGCGC TGTTCTGGCT AACCCAGGAA GCGATCCCCC TGCTACCGAA AGGTGCAAGT ATTATCACCA CTTCGTCAAT CCAGGCATAC CAGCCAAGCC CGCATTTACT GGACTATGCA GCTACGAAGG CGGCGATTCT GAACTACAGC CGAGGCCTGG CAAAACAGGT CGCGGAGAAA GGTATTCGGG TGAATATTGT CGCGCCAGGC CCGATCTGGA CAGCACTGCA AATTTCCGGC GGACAAACGC AGGATAAGAT CCCGCAGTTT GGTCAGCAAA CGCCGATGAA ACGTGCGGGG CAACCGGCGG AACTGGCCCC TGTATATGTT TATCTGGCAA GTCAGGAGTC GAGCTACGTC ACCGCAGAAG TGCACGGCGT GTGCGGCGGC GAGCATTTAG GTTAA
|
Protein sequence | MSHLKDPTTQ YYTGEYPKQK QPTPGIQAKM TPVPDCGEKT YVGSGRLKDR KALVTGGDSG IGRAAAIAYA REGADVAISY LPVEEEDAQD VKKIIEECGR KAVLLPGDLS DEKFARSLVH EAHKALGGLD IMALVAGKQV AIPDIADLTS EQFQKTFAIN VFALFWLTQE AIPLLPKGAS IITTSSIQAY QPSPHLLDYA ATKAAILNYS RGLAKQVAEK GIRVNIVAPG PIWTALQISG GQTQDKIPQF GQQTPMKRAG QPAELAPVYV YLASQESSYV TAEVHGVCGG EHLG
|
| |