Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1960 |
Symbol | |
ID | 6972158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1851661 |
End bp | 1852716 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643385886 |
Product | gfo/idh/mocA family protein |
Protein accession | YP_002270375 |
Protein GI | 209397813 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.430175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.910454 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAAGTG CAATGACAAG CTCTCCACTG CGGGTCGCGA TAATAGGCGC AGGCCAGGTG GCGGATAAGG TTCATGCTTC GTACTACTGC ACCCGCAACG ATCTGGAACT GGTGGCTGTC TGTGACAGCC GCCTTTCCCA GGCGCAGGCG CTGGCAGAAA AATACGGGAA TGCATCCGTG TGGGACGATC CGCAGGCCAT GCTGCTGGCG GTGAAACCTG ATGTGGTTAG CGTCTGCTCA CCTAACCGTT TTCATTACGA ACATACCCTG ATGGCGCTGG AAGCGGGCTG CCATGTGATG TGCGAAAAAC CGCCCGCCAT GACGCCAGAA CAGGCGCGGG AAATGTGCGA TACCGCGCGC AAACAGGGCA AGGTGCTGGC CTACGACTTT CACCATCGTT TTGCGCTCGA TACGCAACAG CTGCGTGAAC AGGTGACCAA CGGCGTTTTG GGAGAGATTT ACGTTACCAC CGCCCGCGCC CTGCGTCGCT GCGGCGTTCC CGGCTGGGGT GTCTTTACCA ATAAAGAACT GCAGGGTGGT GGCCCGCTGA TCGACATCGG CATTCATATG CTGGATGCTG CGATGTATGT GCTGGGCTAT CCTGCGGTGA AAAGCGTGAA TGCGCATAGC TTTCAAAAGA TCGGCACGCA AAAGAGCTGT GGTCAATTTG GCGAGTGGGA TCCGGCAACT TACAGCGTCG AAGATTCGCT GTTTGGCACC ATTGAATTTC ATAACGGCGG CATTCTGTGG CTGGAAACGT CATTTGCGCT CAACATCCGC GAACAGTCGA TTATGAACGT CAGCTTTTGT GGTGATAAAG CCGGTGCGAC GCTGTTTCCA GCACATATCT ACACCGATAA CAACGGTGAA TTAATGACGC TGATGCAACG GGAAATGGCA GACGACAACC GCCATTTGGG CAGCATGGAA GCCTTTATCA ATCACGTACA GGGCAAGCCC GTGATGATAG CCGACGCCGA GCAGGGGTAC ATCATCCAGC AACTGGTGGC GGCGTTATAT CAATCCGCAG AAACAGGGAC GCGTGTGGAA TTATGA
|
Protein sequence | MKSAMTSSPL RVAIIGAGQV ADKVHASYYC TRNDLELVAV CDSRLSQAQA LAEKYGNASV WDDPQAMLLA VKPDVVSVCS PNRFHYEHTL MALEAGCHVM CEKPPAMTPE QAREMCDTAR KQGKVLAYDF HHRFALDTQQ LREQVTNGVL GEIYVTTARA LRRCGVPGWG VFTNKELQGG GPLIDIGIHM LDAAMYVLGY PAVKSVNAHS FQKIGTQKSC GQFGEWDPAT YSVEDSLFGT IEFHNGGILW LETSFALNIR EQSIMNVSFC GDKAGATLFP AHIYTDNNGE LMTLMQREMA DDNRHLGSME AFINHVQGKP VMIADAEQGY IIQQLVAALY QSAETGTRVE L
|
| |