Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5123 |
Symbol | |
ID | 6969957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4763465 |
End bp | 4764529 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643388795 |
Product | putative oxidoreductase |
Protein accession | YP_002273221 |
Protein GI | 209398169 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.121383 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACATT TCGACGTGGC GATTATTGGC CTCGGCCCGG CAGGGTCGGC GTTGGCACGA AAGTTAGCCG GCAAAATGCA GGTGATCGCG CTGGATAAAA AGCACCAGCA TGGTACTGAA GGTTTCAGCA AGCCTTGTGG CGGTCTGCTG GCACCGGACG CGCAGCGATC TTTTATTCGC GATGGACTGA CGCTTCCTGT CGATGTGATC GCCAATCCAC AGATTTTCAG CGTCAAAACT GTCGACGTCG CCGCATCGCT CACGCGTAAC TACCAGCGAA GCTATATCAA TATTAATCGC CATGCTTTCG ACTTGTGGAT GAAATCGCTG ATCCCCGCCA GCGTTGAGGT TTATCACGAT AGCCTGTGCC GGAAAATCTG GCGTGAGGAT GATAAATGGC ATGTCATTTT TCGTGCAGAC GGTTGGGAGC AGCATATTAC TGCCCGCTAT CTGGTCGGTG CAGATGGTGC CAACTCGATG GTGCGGCGAT ATCTCTACCC GGACCATCAG ATTCGTAAAT ATGTCGCTAT CCAGCAGTGG TTCGCAGAGA AACATCCGGT GCCGTTCTAC TCATGCATCT TTGATAATGC GATAACTGAC TGTTACTCAT GGAGTATCAG CAAAGACGGT TATTTTATCT TTGGCGGTGC CTATCCAATG AAAGACGGTC AGACGCGTTT CACGACGCTG AAAGAGAAAA TGAGCGCCTT TCAATTCCAG TTTGGTAAGG CGGTAAAAAG CGAAAAATGC ACGGTGCTAT TTCCCTCACG CTGGCAGGAT TTTGTCTGCG GTAAGGACAA CGCCTTTCTG ATTGGTGAAG CGGCGGGATT TATCAGCGCC AGCTCGCTGG AGGGGATAAG CTATGCGCTG GATAGCGCAG AGATTCTGCG TTCGGTGTTA CTGAAGCAGC CAGAGAAGAT CAACGCAGCC TACTGGCACG CCACCCGCAA ACTGCGTTTA AAACTCTTCG GCAAGATAGT AAAAAGCCGA TGCCTGACCG CACCGGCTTT AAGAAAGTGG ATTATGCGCA GTGGTATGGC GCATATTCCA CAGTTGAAAG ATTAG
|
Protein sequence | MEHFDVAIIG LGPAGSALAR KLAGKMQVIA LDKKHQHGTE GFSKPCGGLL APDAQRSFIR DGLTLPVDVI ANPQIFSVKT VDVAASLTRN YQRSYININR HAFDLWMKSL IPASVEVYHD SLCRKIWRED DKWHVIFRAD GWEQHITARY LVGADGANSM VRRYLYPDHQ IRKYVAIQQW FAEKHPVPFY SCIFDNAITD CYSWSISKDG YFIFGGAYPM KDGQTRFTTL KEKMSAFQFQ FGKAVKSEKC TVLFPSRWQD FVCGKDNAFL IGEAAGFISA SSLEGISYAL DSAEILRSVL LKQPEKINAA YWHATRKLRL KLFGKIVKSR CLTAPALRKW IMRSGMAHIP QLKD
|
| |