Gene EcSMS35_0354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0354 
Symbol 
ID6147060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp366045 
End bp367427 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content56% 
IMG OID641615250 
Productputative deaminase 
Protein accessionYP_001742458 
Protein GI170681260 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAA ACAATAGCCG CCGTGAATTT CTGAGCCAGA GCGGTAAGAT GGTTACCGCC 
GCCGCGCTGT TTGGTACCTC AGTGCCGCTC GCCCATGCGG GGGTAGCTGG CACCCTAAAC
TGCGAAGCGA ACAACACCAT GAAAATCACT GACCCGCATT ACTATCTCGA TAACGTGCTG
CTGGAAACCG GTTTTGACTA CGAAAATGGC GTGGCGGTAC AGACCCGCAC GGCGCGCCAG
ACCGTGGAGA TTCAGGACGG TAAAATTGTT GCCCTGCGCG AGAACAAGCA GCATCCGGAC
GCCACGCTAC CGCACTATGA CGCTGGCGGT AAGCTGATGC TGCCCACCAC CCGCGACATG
CATATTCATC TCGACAAAAC CTTCTACGGC GGGCCGTGGC GCTCGCTCAA TCGTCCGGCA
GGCACCACTA TCCAGGACAT GATCAAACTC GAGCAGAAAA TGCTGCCGGA ACTGCAACCG
TACACGCAGG AACGGGTGGA AAAACTGATC GATTTATTGC AGTCGAAAGG CACCACCATT
GCCCGCAGCC ATTGCAATAT CGAACCGGTT TCCGGCCTGA AAAATCTGCA AAATTTGCAG
GCGGTGCTGG CGCGACGTCA GGCGGGCTTC GAGTGTGAAA TTGTCGCCTT CCCGCAGCAC
GGTTTGCTGC TGTCGAAATC GGAAGCCTTA ATGCGCGAAG CGATGCAGGC GGGGGCGCAT
TACGTCGGCG GGCTGGACCC GACCAGTGTT GATGGCGCGA TGGAAAAATC CCTCGACACC
ATGTTCCAGA TTGCGCTGGA CTACGACAAA GGCGTCGATA TTCACCTGCA CGAAACCACT
CCGTCGGGCG TGGCAGCCAT CAATTATATG GTTGAAACGG TAGAGAAAAC GCCACAGCTG
AAGGGCAAGC TGACCATCAG TCACGCCTTT GCGTTGGCAA CGCTCAACGA ACAACAGGTA
GATGAACTGG CGCACCGGAT GGCGGCGCAG CAAATTTCTA TCGCCTCGAC GGTGCCGATT
GACACGCTGC ATATGCCGCT CAAACAGTTG CACGACAAAG GCGTAAAAGT CATGACCGGC
ACCGACAGCG TTATCGACCA CTGGTCTCCC TACGGCCTGG GCGACATGCT GGAAAAAGCC
AATCTCTACG CGCAGCTCTA TATTCGTCCT AACGAACAGA ATTTGTCCCG TTCGCTGTTT
TTAGCCACTG GCGATGTATT GCCGCTCAAC GAAAAAGGCG AGCGCGTGTG GCCCAAAGCG
CAGGATGACG CCAGCTTTGT GCTGGTGGAC GCCTCCTGTT CCGCCGAGGC GGTGGCGCGT
ATCTCGCCGA GAACCGCAAC GTTCCATAAA GGGCAACTGG TGTGGGGGAG TGTGGCAGGT
TGA
 
Protein sequence
MKENNSRREF LSQSGKMVTA AALFGTSVPL AHAGVAGTLN CEANNTMKIT DPHYYLDNVL 
LETGFDYENG VAVQTRTARQ TVEIQDGKIV ALRENKQHPD ATLPHYDAGG KLMLPTTRDM
HIHLDKTFYG GPWRSLNRPA GTTIQDMIKL EQKMLPELQP YTQERVEKLI DLLQSKGTTI
ARSHCNIEPV SGLKNLQNLQ AVLARRQAGF ECEIVAFPQH GLLLSKSEAL MREAMQAGAH
YVGGLDPTSV DGAMEKSLDT MFQIALDYDK GVDIHLHETT PSGVAAINYM VETVEKTPQL
KGKLTISHAF ALATLNEQQV DELAHRMAAQ QISIASTVPI DTLHMPLKQL HDKGVKVMTG
TDSVIDHWSP YGLGDMLEKA NLYAQLYIRP NEQNLSRSLF LATGDVLPLN EKGERVWPKA
QDDASFVLVD ASCSAEAVAR ISPRTATFHK GQLVWGSVAG