Gene EcolC_1174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1174 
Symbol 
ID6065826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1286093 
End bp1287634 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content51% 
IMG OID641600590 
Productexopolyphosphatase 
Protein accessionYP_001724168 
Protein GI170019214 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0153917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATAC ACGATAAATC CCCTCGTCCG CAGGAGTTTG CTGCGGTCGA TCTTGGTTCA 
AACAGTTTTC ACATGGTCAT AGCCCGTGTG GTAGATGGTG CCATGCAGAT TATTGGCCGC
CTGAAACAGC GGGTGCATCT GGCGGACGGC CTGGGGCCAG ATAATATGTT GAGTGAAGAG
GCAATGACGC GCGGTTTGAA CTGTCTGTCG CTTTTTGCCG AACGGCTACA AGGGTTTTCT
CCTGCCAGCG TCTGTATAGT TGGTACCCAT ACGCTGCGTC AGGCGCTGAA CGCCACTGAC
TTTCTGAAAC GCGCGGAAAA GGTCATTCCC TACCCGATTG AAATTATTTC CGGTAATGAA
GAAGCCCGTC TGATTTTTAT GGGAGTGGAA CATACCCAAC CGGAAAAAGG TCGCAAACTG
GTTATTGATA TTGGCGGCGG ATCTACGGAA CTGGTGATTG GTGAAAATTT CGAACCTATT
CTCGTTGAAA GCCGCCGGAT GGGTTGTGTC AGCTTTGCCC AGCTTTATTT TCCTGGCGGG
GTCATCAATA AAGAGAATTT TCAGCGCGCT CGCATGGCGG CAGCACAAAA ACTGGAAACT
TTAACCTGGC AATTCCGTAT TCAGGGCTGG AACGTTGCAA TGGGCGCTTC CGGTACCATA
AAAGCCGCCC ATGAAGTGTT AATGGAAATG GGCGAGAAAG ACGGGATAAT TACCCCGGAA
CGTCTGGAAA AACTGGTAAA AGAAGTTTTA CGTCACCGTA ATTTCGCATC GCTGAGTTTA
CCGGGTCTTT CCGAAGAGCG GAAAACAGTC TTCGTTCCGG GACTGGCGAT TTTATGCGGT
GTGTTTGATG CTTTAGCCAT CCGTGAACTG CGCCTTTCTG ACGGGGCGTT ACGCGAAGGC
GTACTGTATG AAATGGAAGG ACGTTTCCGT CATCAGGATG TGCGTAGTCG CACCGCCAGC
AGCCTCGCCA ACCAGTATCA CATCGACAGC GAACAGGCCC GACGGGTGCT GGATACCACT
ATGCAAATGT ACGAACAGTG GCGGGAACAG CAACCGAAGC TGGCGCATCC GCAACTGGAG
GCGCTACTGC GATGGGCCGC CATGCTGCAT GAGGTCGGGT TGAATATCAA CCACAGCGGT
TTGCATCGCC ACTCCGCTTA TATTCTGCAA AACAGTGACT TGCCGGGTTT TAATCAGGAA
CAGCAGCTGA TGATGGCGAC ACTGGTGCGC TATCACCGTA AAGCGATTAA GCTCGACGAT
CTGCCGCGCT TTACCTTGTT TAAGAAGAAA CAGTTCCTGC CACTGATACA GCTATTGCGC
CTTGGCGTAT TACTCAACAA TCAGCGTCAG GCAACCACCA CACCGCCAAC ATTGACGCTG
ATTACTGATG ACAGTCACTG GACACTGCGT TTCCCGCATG ACTGGTTTAG TCAGAATGCG
CTGGTACTGC TTGATCTGGA AAAGGAGCAA GAATACTGGG AAGGCGTGGC TGGCTGGCGG
TTGAAAATTG AAGAAGAAAG TACACCTGAA ATCGCCGCTT AA
 
Protein sequence
MPIHDKSPRP QEFAAVDLGS NSFHMVIARV VDGAMQIIGR LKQRVHLADG LGPDNMLSEE 
AMTRGLNCLS LFAERLQGFS PASVCIVGTH TLRQALNATD FLKRAEKVIP YPIEIISGNE
EARLIFMGVE HTQPEKGRKL VIDIGGGSTE LVIGENFEPI LVESRRMGCV SFAQLYFPGG
VINKENFQRA RMAAAQKLET LTWQFRIQGW NVAMGASGTI KAAHEVLMEM GEKDGIITPE
RLEKLVKEVL RHRNFASLSL PGLSEERKTV FVPGLAILCG VFDALAIREL RLSDGALREG
VLYEMEGRFR HQDVRSRTAS SLANQYHIDS EQARRVLDTT MQMYEQWREQ QPKLAHPQLE
ALLRWAAMLH EVGLNINHSG LHRHSAYILQ NSDLPGFNQE QQLMMATLVR YHRKAIKLDD
LPRFTLFKKK QFLPLIQLLR LGVLLNNQRQ ATTTPPTLTL ITDDSHWTLR FPHDWFSQNA
LVLLDLEKEQ EYWEGVAGWR LKIEEESTPE IAA