Gene EcSMS35_2650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2650 
Symbolppx 
ID6143834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2709410 
End bp2710951 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content51% 
IMG OID641617521 
Productexopolyphosphatase 
Protein accessionYP_001744686 
Protein GI170682766 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATAC ACGATAAATC CCCTCGTCCG CAGGAGTTTG CTGCGGTCGA TCTTGGTTCA 
AACAGTTTTC ACATGGTCAT AGCCCGTGTG GTAGATGGTG CCATGCAGAT TATTGGCCGC
CTGAAACAGC GGGTGCATCT GGCGGACGGC CTGGGGCCAG ATAATATGTT GAGTGAAGAA
GCAATGACGC GCGGTTTAAA CTGTCTGTCG CTGTTTGCCG AACGGCTACA AGGGTTTTCT
CCTGCCAGCG TCTGTATAGT TGGCACCCAT ACACTGCGTC AGGCGCTGAA CGCCACTGAC
TTTCTGAAAC GCGCGGAAAA GGTCATTCCC TACCCGATTG AAATTATTTC CGGTAATGAA
GAAGCCCGTC TGATTTTTAT GGGCGTGGAA CATACCCAAC CGGAGAAAGG TCGCAAACTG
GTTATTGATA TTGGCGGCGG ATCTACGGAA CTGGTGATTG GTGAAAATTT CGAACCTATT
CTCGTTGAAA GCCGCCGGAT GGGTTGTGTC AGCTTTGCCC AGCTTTATTT CCCTGGCGGG
GTCATCAATA AAGAGAATTT CCAGCGCGCT CGTATGGCGG CAGCACAAAA ACTGGAAACG
TTAACCTGGC AATTCCGTAT TCAGGGCTGG AACGTGGCAA TGGGCGCTTC CGGTACCATA
AAAGCCGCCC ATGAAGTGTT AATGGAAATG GGCGAGAAAG ACGGGATAAT TACCCCGGAA
CGTCTGGAAA AACTGGTAAA AGAAGTTTTA CGTCACCGTA ATTTCGCATC GCTGAGTTTA
CCGGGTCTTT CCGAAGAGCG GAAAACAGTC TTCGTTCCTG GACTGGCGAT TTTATGCGGT
GTGTTTGATG CTTTAGCCAT CCGTGAACTG CGCCTTTCTG ACGGGGCGTT ACGCGAAGGC
GTACTGTATG AAATGGAAGG ACGTTTCCGT CATCAGGATG TGCGTAGTCG CACCGCCAGC
AGCCTCGCCA ACCAGTATCA CATCGACAGC GAACAGGCCC GACGGGTGCT GGATACCACT
ATGCAAATGT ACGAACAGTG GCGGGAACAG CAACCGAAGC TGGCGCATCC GCAACTGGAG
GCGCTACTGC GATGGGCCGC CATGCTGCAT GAGGTCGGGT TGAATATCAA CCACAGCGGT
TTGCATCGCC ACTCCGCTTA TATTCTGCAA AACAGCGACT TGCCGGGTTT TAATCAGGAA
CAGCAGTTGA TGATGGCGAC GCTGGTGCGC TATCACCGTA AAGCGATTAA GCTCGACGAT
CTGCCGCGCT TTACCTTGTT TAAGAAGAAA CAGTTTCTGC CACTGATTCA GCTATTGCGC
CTTGGCGTAT TACTCAATAA TCAACGTCAG GCAACCACCA CACCGCCAAC ATTGACGCTG
ATTACCGATG ACAGTCACTG GACACTGCGT TTCCCGCATG ACTGGTTTAG TCAGAATGCG
CTGGTACTGC TTGATCTGGA AAAGGAGCAA GAATACTGGG AAGGCGTGGC TGGCTGGCGG
TTGAAAATTG AAGAAGAAAG TACCCCAGAA ATCGCCGCTT AA
 
Protein sequence
MPIHDKSPRP QEFAAVDLGS NSFHMVIARV VDGAMQIIGR LKQRVHLADG LGPDNMLSEE 
AMTRGLNCLS LFAERLQGFS PASVCIVGTH TLRQALNATD FLKRAEKVIP YPIEIISGNE
EARLIFMGVE HTQPEKGRKL VIDIGGGSTE LVIGENFEPI LVESRRMGCV SFAQLYFPGG
VINKENFQRA RMAAAQKLET LTWQFRIQGW NVAMGASGTI KAAHEVLMEM GEKDGIITPE
RLEKLVKEVL RHRNFASLSL PGLSEERKTV FVPGLAILCG VFDALAIREL RLSDGALREG
VLYEMEGRFR HQDVRSRTAS SLANQYHIDS EQARRVLDTT MQMYEQWREQ QPKLAHPQLE
ALLRWAAMLH EVGLNINHSG LHRHSAYILQ NSDLPGFNQE QQLMMATLVR YHRKAIKLDD
LPRFTLFKKK QFLPLIQLLR LGVLLNNQRQ ATTTPPTLTL ITDDSHWTLR FPHDWFSQNA
LVLLDLEKEQ EYWEGVAGWR LKIEEESTPE IAA