Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0054 |
Symbol | apaH |
ID | 6970653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 55345 |
End bp | 56193 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643384136 |
Product | diadenosine tetraphosphatase |
Protein accession | YP_002268659 |
Protein GI | 209398997 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0639] Diadenosine tetraphosphatase and related serine/threonine protein phosphatases |
TIGRFAM ID | [TIGR00668] bis(5'-nucleosyl)-tetraphosphatase (symmetrical) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.257316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACAT ACCTTATTGG CGACGTTCAT GGTTGTTACG ATGAACTGAT CGCATTGCTG CATAAAGTAG AATTTACCCC TGGGAAAGAT ACCCTCTGGC TGACGGGCGA TCTGGTCGCG CGCGGCCCGG GTTCGCTGGA TGTTCTGCGC TATGTGAAAT CCTTAGGCGA CAGCGTACGT CTGGTGCTGG GCAATCACGA TCTGCATCTG CTGGCAGTAT TTGCCGGGAT CAGCCGCAAT AAACCGAAAG ATCGCCTGAC ACCGCTGCTG GAAGCGCCGG ATGCCGACGA GCTGCTTAAC TGGCTGCGTC GCCAGCCTCT GCTGCAAATC GACGAAGAGA AAAAGCTGGT GATGGCCCAC GCAGGGATCA CGCCGCAGTG GGATCTGCAG ACCGCCAAAG AGTGCGCGCG CGATGTAGAA GCGGTGCTGT CGAGTGACTC CTATCCCTTC TTTCTCGACG CTATGTACGG CGATATGCCA AATAACTGGT CACCGGAATT GCGGGGGCTG GGAAGACTGC GGTTTATCAC CAACGCCTTT ACCCGTATGC GTTTTTGCTT CCCGAACGGT CAACTGGATA TGTACAGCAA AGAATCGCCG GAAGAGGCCC CTGCCCCACT GAAACCGTGG TTTGCGATTC CTGGACCTGT CGCTGAAGAA TACAGCATCG CCTTTGGTCA CTGGGCATCG CTGGAAGGCA AAGGTACGCC GGAAGGTATT TACGCGCTGG ATACCGGCTG CTGCTGGGGT GGGTCATTAA CCTGCCTGCG CTGGGAAGAT AAACAGTATT TTGTCCAGCC GTCGAACCGG CATAAGGATT TGGGTGAGGG AGAGGCGGTC GCGTCTTAA
|
Protein sequence | MATYLIGDVH GCYDELIALL HKVEFTPGKD TLWLTGDLVA RGPGSLDVLR YVKSLGDSVR LVLGNHDLHL LAVFAGISRN KPKDRLTPLL EAPDADELLN WLRRQPLLQI DEEKKLVMAH AGITPQWDLQ TAKECARDVE AVLSSDSYPF FLDAMYGDMP NNWSPELRGL GRLRFITNAF TRMRFCFPNG QLDMYSKESP EEAPAPLKPW FAIPGPVAEE YSIAFGHWAS LEGKGTPEGI YALDTGCCWG GSLTCLRWED KQYFVQPSNR HKDLGEGEAV AS
|
| |