Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0053 |
Symbol | apaH |
ID | 6143405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 56910 |
End bp | 57758 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641614954 |
Product | diadenosine tetraphosphatase |
Protein accession | YP_001742170 |
Protein GI | 170681134 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0639] Diadenosine tetraphosphatase and related serine/threonine protein phosphatases |
TIGRFAM ID | [TIGR00668] bis(5'-nucleosyl)-tetraphosphatase (symmetrical) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.627509 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.566701 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACAT ACCTTATTGG CGACGTTCAT GGTTGTTACG ATGAACTGAT CGCATTGCTG CATAAAGTAG AATTTACCCC TGGGAAAGAT ACCCTCTGGC TGACGGGCGA TCTGGTCGCG CGCGGCCCGG GTTCGCTGGA TGTTCTGCGC TATGTGAAAT CCTTAGGCGA CAGCGTACGT CTGGTGCTGG GCAATCACGA TCTGCATCTG CTGGCGGTAT TTGCCGGGAT CAGCCGCAAT AAACCGAAAG ATCGCCTGAC ACCGCTGCTG GAAGCGCCGG ATGCCGACGA GCTGCTTAAC TGGCTGCGTC GCCAGCCTTT GCTGCAAATC GACGAAGAGA AAAAGCTGGT GATGGCCCAC GCCGGGATCA CGCCGCAGTG GGATCTGCAG ACCGCCAAAG AGTGCGCGCG CGATGTAGAA GCGGTGCTGT CGAGTGACTC CTATCCCTTC TTCCTTGATG CCATGTACGG CGATATGCCA AATAACTGGT CACCGGAATT GCGGGGGCTG GGAAGACTGC GGTTTATCAC CAACGCCTTT ACCCGGATGC GTTTTTGCTT CCCGAACGGT CAACTGGATA TGTACAGCAA AGAATCGCCG GAAGAGGCCC CTGCCCCACT GAAACCGTGG TTTGCGATTC CTGGCCCCGT CGCTGAAGAA TACAGCATCG CCTTTGGTCA CTGGGCATCG CTGGAGGGCA AAGGTACGCC GGAAGGTATT TACGCGCTGG ATACCGGCTG CTGTTGGGGC GGGACATTAA CCTGCCTGCG CTGGGAAGAT AAACAGTATT TTGTCCAGCC GTCGAACCGG CATAAAGATA TGGGTGAGGG AGAGGCGGCC GCGTCTTAA
|
Protein sequence | MATYLIGDVH GCYDELIALL HKVEFTPGKD TLWLTGDLVA RGPGSLDVLR YVKSLGDSVR LVLGNHDLHL LAVFAGISRN KPKDRLTPLL EAPDADELLN WLRRQPLLQI DEEKKLVMAH AGITPQWDLQ TAKECARDVE AVLSSDSYPF FLDAMYGDMP NNWSPELRGL GRLRFITNAF TRMRFCFPNG QLDMYSKESP EEAPAPLKPW FAIPGPVAEE YSIAFGHWAS LEGKGTPEGI YALDTGCCWG GTLTCLRWED KQYFVQPSNR HKDMGEGEAA AS
|
| |