Gene ECH74115_0389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0389 
Symbol 
ID6966850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp396595 
End bp397977 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content56% 
IMG OID643384441 
Productputative deaminase 
Protein accessionYP_002268956 
Protein GI209396661 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.776675 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAA ATAATAGCCG CCGTGAATTT CTGAGCCAGA GCGGTAAGAT GGTCACCGCC 
GCCGCGCTGT TTGGTACCTC TGTGCCGCTC GCCCATGCGG CGGTCTCTGG CACCACAAAC
TGCGAAGCGA ACAACACCAT GAAAATCACT GACCCGCATT ACTATCTCGA CAACGTGCTG
CTGGAAACCG GTTTTGACTA CGAAAATGGC GTGGCGGTGC AGACCCGCAC GGCGCGCCAG
ACCGTGGAGA TTCAGAACGG CAAAATTGTC GCGCTGCGCG AGAATAAGCA GCACCCGGAT
GCCACGCTGC CGCACTATGA CGCTGGCGAT AAGCTGATGC TGCCCACCAC CCGCGACATG
CATATTCATC TCGACAAAAC GTTTTACGGC GGGCCGTGGC GCTCGCTCAA TCGCCCGGCA
GGCACCACCA TCCAGGACAT GATCAAACTC GAGCAGAAAA TGCTGCCGGA GCTGCAACCG
TACACTCAGG AGCGGGCAGA AAAACTGATT GATTTATTGC AGTCGAAAGG CACCACCATT
GCCCGCAGCC ACTGCAATAT CGAACCGGTT TCCGGCCTGA AAAATCTGCA AAATTTGCAG
GCGGTGCTGG CGCGACGTCA GGCGGGCTTT GAGTGTGAAA TCGTCGCCTT CCCGCAGCAC
GGTTTGCTGC TGTCGAAATC GGAACCCTTA ATGCGTGAAG CAATGCAGGC GGGTGCGCAT
TACGTCGGCG GGCTGGACCC GACCAGTGTC GATGGCGCGA TGGAAAAATC CCTCGACACC
ATGTTCCAGA TTGCGCTGGA CTACGACAAA GGCGTCGATA TTCACCTGCA CGAAACCACT
CCGTCGGGCG TGGCAGCCAT CAATTATATG GTTGAAACGG TAGAGAAAAC GCCTCAACTG
AAAGGTAAGC TGACCATCAG CCACGCCTTT GCGCTGGCTA CGCTCAACGA GCAACAGGTA
GATGAACTGG CGCATCGCAT GGCGGCGCAG CAAATTTCTA TCGCCTCGAC GGTGCCGATT
GGCACGCTGC ATATGCCGCT CAAACAGTTG CACGACAAAG GCGTAAAAGT GATGACTGGC
ACTGACAGCG TTATCGACCA CTGGTCGCCT TATGGTCTGG GCGACATGCT GGAAAAAGCC
AATCTCTACG CGCAGCTCTA TATTCGTCCT AACGAACAGA ACCTCTCCCG TTCGCTGTTT
CTAGCCACTG GCGATGTATT GCCGCTGAAT GAAAAAGGCG AGCGTGTATG GCCAAAAGCG
CAGGATGACG CCAGCTTTGT GCTGGTGGAC GCCTCCTGTT CCGCCGAGGC GGTGGCGCGT
ATCTCGCCGA GAACCGCAAC GTTCCATAAA GGGCAACTGG TGTGGGGGAG TGTGGCAGGT
TGA
 
Protein sequence
MKENNSRREF LSQSGKMVTA AALFGTSVPL AHAAVSGTTN CEANNTMKIT DPHYYLDNVL 
LETGFDYENG VAVQTRTARQ TVEIQNGKIV ALRENKQHPD ATLPHYDAGD KLMLPTTRDM
HIHLDKTFYG GPWRSLNRPA GTTIQDMIKL EQKMLPELQP YTQERAEKLI DLLQSKGTTI
ARSHCNIEPV SGLKNLQNLQ AVLARRQAGF ECEIVAFPQH GLLLSKSEPL MREAMQAGAH
YVGGLDPTSV DGAMEKSLDT MFQIALDYDK GVDIHLHETT PSGVAAINYM VETVEKTPQL
KGKLTISHAF ALATLNEQQV DELAHRMAAQ QISIASTVPI GTLHMPLKQL HDKGVKVMTG
TDSVIDHWSP YGLGDMLEKA NLYAQLYIRP NEQNLSRSLF LATGDVLPLN EKGERVWPKA
QDDASFVLVD ASCSAEAVAR ISPRTATFHK GQLVWGSVAG