Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0753 |
Symbol | |
ID | 6968372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 775774 |
End bp | 776814 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643384782 |
Product | PhoH family protein |
Protein accession | YP_002269295 |
Protein GI | 209396833 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00493548 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAACATAG ACACTCGCGA AATCACCCTG GAGCCAGCAG ACAACGCGCG TCTGTTGAGC CTGTGCGGCC CGTTTGATGA CAACATCAAG CAGCTCGAAC GCCGTCTCGG CATCGAGATC AATCGCCGCG ATAACCACTT TAAACTGACC GGCCGTCCGA TTTGCGTCAC CGCTGCGGCA GACATTCTGC GTAGCCTGTA TGTCGATACT GCCCCGATGC GCGGTCAGAT TCAGGATATC GAACCGGAAC AGATCCACCT TGCGATTAAA GAAGCACGGG TACTGGAGCA AAGCGCGGAG AGCGTGCCGG AGTACGGCAA AGCGGTCAAT ATCAAAACCA AACGCGGCGT AATTAAGCCG CGCACGCCAA ACCAGGCGCA GTACATCGCC AATATTCTCG ACCATGACAT TACCTTCGGC GTTGGCCCGG CGGGTACGGG TAAAACCTAC CTGGCAGTGG CTGCGGCAGT TGATGCCCTG GAGCGTCAGG AAATTCGCCG TATTCTGCTG ACTCGTCCGG CAGTAGAAGC CGGTGAGAAA CTGGGCTTCC TGCCTGGCGA TTTAAGCCAG AAAGTAGACC CGTATCTGCG CCCGCTGTAC GACGCGCTGT TTGAAATGCT GGGTTTTGAG AAAGTCGAGA AACTGATTGA GCGCAACGTT ATTGAAGTCG CACCGCTGGC CTATATGCGT GGTCGTACGC TGAACGACGC ATTTATCATT CTCGATGAGA GCCAGAACAC TACCATCGAA CAGATGAAGA TGTTCCTGAC CCGTATCGGT TTTAACTCAA AAGCGGTTAT CACCGGCGAC GTCACACAGA TCGACCTGCC GCGTAATACT AAATCAGGCT TACGTCATGC CATCGAAGTG CTGGCCGATG TCGAAGAGAT CAGCTTTAAC TTCTTCCACA GCGAAGACGT GGTTCGTCAC CCGGTGGTGG CGCGTATCGT TAACGCCTAT GAAGCCTGGG AAGAAGCCGA ACAAAAACGT AAAGCGGCGC TGGCGGCAGA ACGCAAGCGC GAAGAACAGG AACAAAAATG A
|
Protein sequence | MNIDTREITL EPADNARLLS LCGPFDDNIK QLERRLGIEI NRRDNHFKLT GRPICVTAAA DILRSLYVDT APMRGQIQDI EPEQIHLAIK EARVLEQSAE SVPEYGKAVN IKTKRGVIKP RTPNQAQYIA NILDHDITFG VGPAGTGKTY LAVAAAVDAL ERQEIRRILL TRPAVEAGEK LGFLPGDLSQ KVDPYLRPLY DALFEMLGFE KVEKLIERNV IEVAPLAYMR GRTLNDAFII LDESQNTTIE QMKMFLTRIG FNSKAVITGD VTQIDLPRNT KSGLRHAIEV LADVEEISFN FFHSEDVVRH PVVARIVNAY EAWEEAEQKR KAALAAERKR EEQEQK
|
| |