Gene ECH74115_3725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3725 
Symbolppx 
ID6970225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3445374 
End bp3446915 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content51% 
IMG OID643387518 
Productexopolyphosphatase 
Protein accessionYP_002271971 
Protein GI209398375 
COG category[P] Inorganic ion transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.282301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATAC ACGATAAATC CCCTCGTCCG CAGGAGTTTG CTGCGGTCGA TCTTGGTTCA 
AACAGTTTTC ACATGGTCAT AGCCCGTGTG GTAGATGGTG CCATGCAGAT TATTGGCCGC
CTGAAACAGC GGGTGCATCT GGCGGACGGC CTGGGGCCAG ATAATATGTT GAGTGAAGAG
GCAATGACGC GCGGTTTGAA CTGTCTGTCG CTGTTTGCCG AACGGCTACA AGGGTTTTCT
CCTGCCAGCG TCTGTATAGT TGGTACCCAT ACGCTGCGTC AGGCGCTGAA CGCCACTGAC
TTTCTGAAAC GTGCGGAAAA GGTCATTCCC TACCCGATTG AAATTATTTC CGGTAATGAA
GAAGCCCGTC TGATTTTTAT GGGCGTGGAA CATACCCAAC CGGAAAAAGG TCGCAAACTG
GTTATTGATA TTGGCGGCGG TTCTACAGAA CTGGTGATTG GTGAAAATTT CGAACCGATT
CTCGTTGAAA GCCGCCGGAT GGGGTGTGTC AGCTTTGCCC AGCTTTATTT CCCTGGCGGG
GTCATCAATA AAGAGAATTT TCAGCGCGCT CGCATGGCGG CAGCACAAAA ACTGGAAACT
TTAACCTGGC AATTTCGTAT TCAGGGCTGG AACGTGGCAA TGGGCGCTTC CGGTACCATA
AAAGCCGCCC ATGAAGTGTT AATGGAAATG GGCGAGAAAG ACGGGATAAT TACCCCGGAA
CGTCTGGAAA AACTGGTAAA AGAAGTTTTA CGGCACCGTA ATTTCGCATC GCTGAGTTTA
CCAGGTCTTT CCGAAGAGCG GAAAACAGTC TTCGTTCCTG GACTAGCGAT TTTATGCGGT
GTGTTTGATG CTTTAGCCAT CCGTGAACTG CGCCTTTCTG ACGGGGCGTT ACGCGAAGGC
GTACTGTATG AAATGGAAGG ACGTTTCCGT CATCAGGATG TGCGTAGTCG CACCGCCAGC
AGCCTCGCCA ACCAGTATCA CATCGACAGC GAACAGGCCC GACGGGTGCT GGATACCACT
ATGCAAATGT ACGAACAGTG GCGGGAACAG CAACCGAAGC TGGCGCATCC GCAACTGGAG
GCGCTACTGC GATGGGCCGC CATGCTGCAT GAGGTCGGGT TGAATATCAA CCACAGCGGT
TTGCATCGTC ACTCCGCTTA TATTCTACAA AACAGCGACT TGCCGGGTTT TAATCAGGAA
CAGCAGCTGA TGATGGCGAC ACTGGTGCGC TATCACCGTA AAGCGATTAA ACTCGACGAT
CTGCCGCGCT TTACCTTGTT TAAGAAGAAA CAGTTCCTGC CACTGATTCA GCTATTGCGC
CTTGGCGTAT TACTCAACAA TCAGCGTCAG GCAACCACCA CACCGCCAAC ATTGACGCTG
ATTACTGATG ACAGTCACTG GACACTGCGT TTCCCGCATG ACTGGTTTAG TCAGAATGCG
CTGGTACTGC TTGATCTGGA AAAGGAGCAA GAATACTGGG AAGGCGTGGC TGGCTGGCGG
TTGAAAATTG AAGAAGAAAG TACACCTGAA ATCGCAGCTT AA
 
Protein sequence
MPIHDKSPRP QEFAAVDLGS NSFHMVIARV VDGAMQIIGR LKQRVHLADG LGPDNMLSEE 
AMTRGLNCLS LFAERLQGFS PASVCIVGTH TLRQALNATD FLKRAEKVIP YPIEIISGNE
EARLIFMGVE HTQPEKGRKL VIDIGGGSTE LVIGENFEPI LVESRRMGCV SFAQLYFPGG
VINKENFQRA RMAAAQKLET LTWQFRIQGW NVAMGASGTI KAAHEVLMEM GEKDGIITPE
RLEKLVKEVL RHRNFASLSL PGLSEERKTV FVPGLAILCG VFDALAIREL RLSDGALREG
VLYEMEGRFR HQDVRSRTAS SLANQYHIDS EQARRVLDTT MQMYEQWREQ QPKLAHPQLE
ALLRWAAMLH EVGLNINHSG LHRHSAYILQ NSDLPGFNQE QQLMMATLVR YHRKAIKLDD
LPRFTLFKKK QFLPLIQLLR LGVLLNNQRQ ATTTPPTLTL ITDDSHWTLR FPHDWFSQNA
LVLLDLEKEQ EYWEGVAGWR LKIEEESTPE IAA