Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0389 |
Symbol | |
ID | 6966850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 396595 |
End bp | 397977 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643384441 |
Product | putative deaminase |
Protein accession | YP_002268956 |
Protein GI | 209396661 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.776675 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAAA ATAATAGCCG CCGTGAATTT CTGAGCCAGA GCGGTAAGAT GGTCACCGCC GCCGCGCTGT TTGGTACCTC TGTGCCGCTC GCCCATGCGG CGGTCTCTGG CACCACAAAC TGCGAAGCGA ACAACACCAT GAAAATCACT GACCCGCATT ACTATCTCGA CAACGTGCTG CTGGAAACCG GTTTTGACTA CGAAAATGGC GTGGCGGTGC AGACCCGCAC GGCGCGCCAG ACCGTGGAGA TTCAGAACGG CAAAATTGTC GCGCTGCGCG AGAATAAGCA GCACCCGGAT GCCACGCTGC CGCACTATGA CGCTGGCGAT AAGCTGATGC TGCCCACCAC CCGCGACATG CATATTCATC TCGACAAAAC GTTTTACGGC GGGCCGTGGC GCTCGCTCAA TCGCCCGGCA GGCACCACCA TCCAGGACAT GATCAAACTC GAGCAGAAAA TGCTGCCGGA GCTGCAACCG TACACTCAGG AGCGGGCAGA AAAACTGATT GATTTATTGC AGTCGAAAGG CACCACCATT GCCCGCAGCC ACTGCAATAT CGAACCGGTT TCCGGCCTGA AAAATCTGCA AAATTTGCAG GCGGTGCTGG CGCGACGTCA GGCGGGCTTT GAGTGTGAAA TCGTCGCCTT CCCGCAGCAC GGTTTGCTGC TGTCGAAATC GGAACCCTTA ATGCGTGAAG CAATGCAGGC GGGTGCGCAT TACGTCGGCG GGCTGGACCC GACCAGTGTC GATGGCGCGA TGGAAAAATC CCTCGACACC ATGTTCCAGA TTGCGCTGGA CTACGACAAA GGCGTCGATA TTCACCTGCA CGAAACCACT CCGTCGGGCG TGGCAGCCAT CAATTATATG GTTGAAACGG TAGAGAAAAC GCCTCAACTG AAAGGTAAGC TGACCATCAG CCACGCCTTT GCGCTGGCTA CGCTCAACGA GCAACAGGTA GATGAACTGG CGCATCGCAT GGCGGCGCAG CAAATTTCTA TCGCCTCGAC GGTGCCGATT GGCACGCTGC ATATGCCGCT CAAACAGTTG CACGACAAAG GCGTAAAAGT GATGACTGGC ACTGACAGCG TTATCGACCA CTGGTCGCCT TATGGTCTGG GCGACATGCT GGAAAAAGCC AATCTCTACG CGCAGCTCTA TATTCGTCCT AACGAACAGA ACCTCTCCCG TTCGCTGTTT CTAGCCACTG GCGATGTATT GCCGCTGAAT GAAAAAGGCG AGCGTGTATG GCCAAAAGCG CAGGATGACG CCAGCTTTGT GCTGGTGGAC GCCTCCTGTT CCGCCGAGGC GGTGGCGCGT ATCTCGCCGA GAACCGCAAC GTTCCATAAA GGGCAACTGG TGTGGGGGAG TGTGGCAGGT TGA
|
Protein sequence | MKENNSRREF LSQSGKMVTA AALFGTSVPL AHAAVSGTTN CEANNTMKIT DPHYYLDNVL LETGFDYENG VAVQTRTARQ TVEIQNGKIV ALRENKQHPD ATLPHYDAGD KLMLPTTRDM HIHLDKTFYG GPWRSLNRPA GTTIQDMIKL EQKMLPELQP YTQERAEKLI DLLQSKGTTI ARSHCNIEPV SGLKNLQNLQ AVLARRQAGF ECEIVAFPQH GLLLSKSEPL MREAMQAGAH YVGGLDPTSV DGAMEKSLDT MFQIALDYDK GVDIHLHETT PSGVAAINYM VETVEKTPQL KGKLTISHAF ALATLNEQQV DELAHRMAAQ QISIASTVPI GTLHMPLKQL HDKGVKVMTG TDSVIDHWSP YGLGDMLEKA NLYAQLYIRP NEQNLSRSLF LATGDVLPLN EKGERVWPKA QDDASFVLVD ASCSAEAVAR ISPRTATFHK GQLVWGSVAG
|
| |