Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0682 |
Symbol | |
ID | 6145280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 691795 |
End bp | 692835 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615572 |
Product | PhoH family protein |
Protein accession | YP_001742778 |
Protein GI | 170681748 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0121436 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAACATAG ACACTCGCGA AATCACCCTG GAGCCAGCAG ACAATGCGCG TCTGTTGAGC CTGTGCGGCC CGTTTGATGA CAACATCAAG CAGCTCGAAC GCCGTCTCGG CATCGAGATC AATCGCCGCG ATAACCACTT TAAACTGACC GGCCGTCCGA TTTGCGTCAC CGCTGCGGCA GACATTCTGC GTAGCCTGTA TGTCGATACT GCCCCGATGC GTGGTCAGAT TCAGGATATC GAACCGGAAC AGATCCACCT TGCGATTAAA GAAGCGCGGG TACTGGAGCA AAGCGCGGAG AGCGTGCCGG AGTACGGCAA AGCGGTCAAT ATCAAAACCA AACGCGGCGT AATCAAGCCG CGTACGCCAA ACCAGGCGCA GTACATCGCC AATATTCTCG ACCATGACAT CACCTTCGGC GTTGGCCCGG CGGGTACGGG TAAAACCTAC CTGGCAGTGG CTGCGGCAGT TGATGCCCTG GAGCGTCAGG AAATTCGCCG TATTCTGCTG ACTCGTCCGG CGGTAGAAGC CGGTGAGAAA CTGGGCTTCC TGCCTGGCGA TTTAAGCCAG AAAGTAGACC CGTATTTGCG CCCACTGTAC GACGCGCTGT TTGAAATGCT GGGCTTTGAG AAAGTCGAGA AACTGATTGA GCGCAACGTT ATTGAAGTCG CGCCGCTGGC CTATATGCGT GGTCGTACGC TGAACGACGC GTTTATCATT CTCGATGAGA GCCAGAACAC TACCATCGAA CAGATGAAGA TGTTCCTGAC CCGTATCGGT TTTAACTCAA AAGCGGTTAT CACCGGCGAC GTCACGCAGA TCGACTTGCC GCGTAATACT AAATCAGGCT TACGTCACGC TATCGAAGTG TTAGCCGATG TCGAAGAGAT CAGCTTTAAC TTCTTCCACA GCGAAGACGT GGTTCGTCAC CCGGTGGTGG CGCGTATCGT TAACGCTTAT GAAGCCTGGG AAGAAGCAGA ACAAAAACGT AAAGCGGCGC TGGCAGCAGA ACGCAAGCGC GAAGAACAGG AACAAAAATG A
|
Protein sequence | MNIDTREITL EPADNARLLS LCGPFDDNIK QLERRLGIEI NRRDNHFKLT GRPICVTAAA DILRSLYVDT APMRGQIQDI EPEQIHLAIK EARVLEQSAE SVPEYGKAVN IKTKRGVIKP RTPNQAQYIA NILDHDITFG VGPAGTGKTY LAVAAAVDAL ERQEIRRILL TRPAVEAGEK LGFLPGDLSQ KVDPYLRPLY DALFEMLGFE KVEKLIERNV IEVAPLAYMR GRTLNDAFII LDESQNTTIE QMKMFLTRIG FNSKAVITGD VTQIDLPRNT KSGLRHAIEV LADVEEISFN FFHSEDVVRH PVVARIVNAY EAWEEAEQKR KAALAAERKR EEQEQK
|
| |