Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0707 |
Symbol | |
ID | 5591259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 721348 |
End bp | 722388 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640919885 |
Product | PhoH family protein |
Protein accession | YP_001457466 |
Protein GI | 157160148 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.0340161 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAACATAG ACACTCGCGA AATCACCCTG GAGCCAGCAG ACAACGCGCG TCTGTTGAGC CTGTGCGGCC CGTTTGATGA CAACATCAAG CAGCTCGAAC GCCGTCTCGG CATCGAGATC AATCGCCGCG ACAACCACTT TAAACTGACC GGCCGTCCGA TTTGCGTCAC CGCTGCGGCA GACATTCTGC GTAGCCTGTA TGTCGATACA GCCCCGATGC GCGGTCAGAT TCAGGATATC GAACCGGAAC AGATCCACCT TGCGATTAAA GAAGCAAGGG TACTGGAGCA AAGCGCGGAG AGCGTGCCGG AGTACGGCAA AGCGGTCAAT ATCAAAACCA AACGCGGCGT AATTAAGCCG CGCACGCCAA ACCAGGCGCA GTACATCGCC AATATTCTCG ACCATGACAT TACCTTCGGC GTTGGCCCGG CGGGTACGGG TAAAACCTAC CTGGCAGTCG CTGCAGCAGT TGATGCCCTG GAGCGTCAGG AAATTCGCCG TATTCTGCTG ACTCGTCCGG CGGTCGAAGC CGGTGAGAAA CTGGGCTTCC TGCCTGGCGA TTTAAGCCAG AAAGTAGACC CGTATCTGCG CCCACTGTAC GACGCGCTGT TTGAAATGCT GGGCTTTGAG AAAGTCGAGA AACTGATTGA GCGCAACGTT ATTGAAGTCG CGCCGCTGGC CTATATGCGT GGTCGTACGC TGAATGACGC CTTTATCATT CTCGATGAGA GCCAGAACAC CACCATCGAA CAGATGAAGA TGTTCCTGAC CCGTATCGGT TTTAACTCAA AAGCGGTTAT CACCGGCGAC GTCACACAGA TCGACCTGCC GCGTAATACT AAATCAGGCT TACGTCATGC CATCGAAGTG CTGGCCGATG TCGAAGAGAT CAGCTTTAAC TTCTTCCACA GCGAAGACGT GGTTCGTCAC CCGGTGGTGG CGCGTATCGT TAACGCCTAT GAAGCCTGGG AAGAAGCCGA ACAAAAACGT AAAGCGGCGC TGGCGGCAGA ACGCAAGCGC GAAGAACAGG AACAAAAATG A
|
Protein sequence | MNIDTREITL EPADNARLLS LCGPFDDNIK QLERRLGIEI NRRDNHFKLT GRPICVTAAA DILRSLYVDT APMRGQIQDI EPEQIHLAIK EARVLEQSAE SVPEYGKAVN IKTKRGVIKP RTPNQAQYIA NILDHDITFG VGPAGTGKTY LAVAAAVDAL ERQEIRRILL TRPAVEAGEK LGFLPGDLSQ KVDPYLRPLY DALFEMLGFE KVEKLIERNV IEVAPLAYMR GRTLNDAFII LDESQNTTIE QMKMFLTRIG FNSKAVITGD VTQIDLPRNT KSGLRHAIEV LADVEEISFN FFHSEDVVRH PVVARIVNAY EAWEEAEQKR KAALAAERKR EEQEQK
|
| |