Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0354 |
Symbol | |
ID | 6147060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 366045 |
End bp | 367427 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615250 |
Product | putative deaminase |
Protein accession | YP_001742458 |
Protein GI | 170681260 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAAA ACAATAGCCG CCGTGAATTT CTGAGCCAGA GCGGTAAGAT GGTTACCGCC GCCGCGCTGT TTGGTACCTC AGTGCCGCTC GCCCATGCGG GGGTAGCTGG CACCCTAAAC TGCGAAGCGA ACAACACCAT GAAAATCACT GACCCGCATT ACTATCTCGA TAACGTGCTG CTGGAAACCG GTTTTGACTA CGAAAATGGC GTGGCGGTAC AGACCCGCAC GGCGCGCCAG ACCGTGGAGA TTCAGGACGG TAAAATTGTT GCCCTGCGCG AGAACAAGCA GCATCCGGAC GCCACGCTAC CGCACTATGA CGCTGGCGGT AAGCTGATGC TGCCCACCAC CCGCGACATG CATATTCATC TCGACAAAAC CTTCTACGGC GGGCCGTGGC GCTCGCTCAA TCGTCCGGCA GGCACCACTA TCCAGGACAT GATCAAACTC GAGCAGAAAA TGCTGCCGGA ACTGCAACCG TACACGCAGG AACGGGTGGA AAAACTGATC GATTTATTGC AGTCGAAAGG CACCACCATT GCCCGCAGCC ATTGCAATAT CGAACCGGTT TCCGGCCTGA AAAATCTGCA AAATTTGCAG GCGGTGCTGG CGCGACGTCA GGCGGGCTTC GAGTGTGAAA TTGTCGCCTT CCCGCAGCAC GGTTTGCTGC TGTCGAAATC GGAAGCCTTA ATGCGCGAAG CGATGCAGGC GGGGGCGCAT TACGTCGGCG GGCTGGACCC GACCAGTGTT GATGGCGCGA TGGAAAAATC CCTCGACACC ATGTTCCAGA TTGCGCTGGA CTACGACAAA GGCGTCGATA TTCACCTGCA CGAAACCACT CCGTCGGGCG TGGCAGCCAT CAATTATATG GTTGAAACGG TAGAGAAAAC GCCACAGCTG AAGGGCAAGC TGACCATCAG TCACGCCTTT GCGTTGGCAA CGCTCAACGA ACAACAGGTA GATGAACTGG CGCACCGGAT GGCGGCGCAG CAAATTTCTA TCGCCTCGAC GGTGCCGATT GACACGCTGC ATATGCCGCT CAAACAGTTG CACGACAAAG GCGTAAAAGT CATGACCGGC ACCGACAGCG TTATCGACCA CTGGTCTCCC TACGGCCTGG GCGACATGCT GGAAAAAGCC AATCTCTACG CGCAGCTCTA TATTCGTCCT AACGAACAGA ATTTGTCCCG TTCGCTGTTT TTAGCCACTG GCGATGTATT GCCGCTCAAC GAAAAAGGCG AGCGCGTGTG GCCCAAAGCG CAGGATGACG CCAGCTTTGT GCTGGTGGAC GCCTCCTGTT CCGCCGAGGC GGTGGCGCGT ATCTCGCCGA GAACCGCAAC GTTCCATAAA GGGCAACTGG TGTGGGGGAG TGTGGCAGGT TGA
|
Protein sequence | MKENNSRREF LSQSGKMVTA AALFGTSVPL AHAGVAGTLN CEANNTMKIT DPHYYLDNVL LETGFDYENG VAVQTRTARQ TVEIQDGKIV ALRENKQHPD ATLPHYDAGG KLMLPTTRDM HIHLDKTFYG GPWRSLNRPA GTTIQDMIKL EQKMLPELQP YTQERVEKLI DLLQSKGTTI ARSHCNIEPV SGLKNLQNLQ AVLARRQAGF ECEIVAFPQH GLLLSKSEAL MREAMQAGAH YVGGLDPTSV DGAMEKSLDT MFQIALDYDK GVDIHLHETT PSGVAAINYM VETVEKTPQL KGKLTISHAF ALATLNEQQV DELAHRMAAQ QISIASTVPI DTLHMPLKQL HDKGVKVMTG TDSVIDHWSP YGLGDMLEKA NLYAQLYIRP NEQNLSRSLF LATGDVLPLN EKGERVWPKA QDDASFVLVD ASCSAEAVAR ISPRTATFHK GQLVWGSVAG
|
| |