Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3299 |
Symbol | |
ID | 6066353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3614534 |
End bp | 3615916 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641602715 |
Product | putative deaminase |
Protein accession | YP_001726248 |
Protein GI | 170021294 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAAA GCAATAGCCG CCGTGAATTT CTGAGCCAGA GCGGTAAGAT GGTCACCGCC GCCGCGCTGT TTGGTACCTC TGTGCCGCTC GCCCATGCGG CGGTAGCTGG CACCCTAAAC TGCGAAGCGA ACAACACCAT GAAAATCACT GACCCGCATT ACTATCTCGA TAACGTGCTG CTGGAAACCG GTTTTGACTA CGAAAATGGC GTGGCGGTGC AGACCCGCAC GGCGCGCCAG ACCGTGGAGA TTCAGGACGG CAAAATTGTC GCCCTGCGCG AGAACAAGCT GCATCCGGAC GCCACGCTGC CGCACTATGA CGCTGGCGGT AAGCTGATGC TGCCCACCAC CCGCGACATG CATATTCATC TCGACAAAAC CTTTTACGGC GGGCCGTGGC GCTCGCTCAA TCGTCCGGCA GGCACCACCA TCCAGGACAT GATCAAACTC GAGCAGAAAA TGCTGCCGGA ACTGCAACCG TACACTCAGG AGCGGGCAGA AAAACTGATT GATTTATTGC AGTCGAAAGG CACCACCATT GCCCGCAGCC ACTGCAATAT CGAACCGGTT TCCGGCCTGA AAAATCTGCA AAATTTGCAG GCGGTGCTGG CGCGACGTCA GGCGGGCTTT GAGTGTGAAA TCGTCGCCTT CCCGCAGCAC GGTTTGCTGC TGTCGAAATC TGAACCTTTA ATGCGTGAAG CGATGCAGGC GGGGGCGCAT TACGTCGGCG GCCTGGACCC GACCAGTGTT GATGGCGCGA TGGAAAAATC CCTCGACACC ATGTTCCAGA TTGCGCTGGA CTACGACAAA GGCGTCGATA TTCACCTGCA CGAAACCACT CCGGCAGGCG TGGCAGCCAT CAATTATATG GTTGAAACGG TAGAGAAAAC GCCACAGCTG AAGGGCAAGC TGACCATCAG TCACGCCTTT GCGCTGGCAA CGCTCAACGA GCAACAGGTA GATGAACTGG CGAACCGGAT GGTGGTGCAA CAAATTTCTA TTGCCTCGAC GGTGCCGATT GGCACGCTGC ATATGCCGCT CAAACAGTTG CACGACAAAG GCGTAAAAGT GATGACTGGC ACTGACAGCG TTATCGACCA CTGGTCGCCT TATGGTCTGG GCGACATGCT GGAAAAAGCC AATCTGTACG CGCAGCTCTA TATTCGTCCT AACGAACAGA ACCTCTCCCG TTCGCTGTTT CTAGCCACTG GCGATGTATT GCCGCTGAAT GAAAAAGGCG AGCGTGTATG GCCAAAAGCG CAGGATGACG CCAGCTTTGT GCTGGTGGAC GCCTCCTGTT CTGCCGAGGC GGTGGCGCGT ATCTCGCCGA GAACAGCAAC GTTCCATAAA GGGCAACTGG TGTGGGGGAG TGTGGCAGGT TGA
|
Protein sequence | MKESNSRREF LSQSGKMVTA AALFGTSVPL AHAAVAGTLN CEANNTMKIT DPHYYLDNVL LETGFDYENG VAVQTRTARQ TVEIQDGKIV ALRENKLHPD ATLPHYDAGG KLMLPTTRDM HIHLDKTFYG GPWRSLNRPA GTTIQDMIKL EQKMLPELQP YTQERAEKLI DLLQSKGTTI ARSHCNIEPV SGLKNLQNLQ AVLARRQAGF ECEIVAFPQH GLLLSKSEPL MREAMQAGAH YVGGLDPTSV DGAMEKSLDT MFQIALDYDK GVDIHLHETT PAGVAAINYM VETVEKTPQL KGKLTISHAF ALATLNEQQV DELANRMVVQ QISIASTVPI GTLHMPLKQL HDKGVKVMTG TDSVIDHWSP YGLGDMLEKA NLYAQLYIRP NEQNLSRSLF LATGDVLPLN EKGERVWPKA QDDASFVLVD ASCSAEAVAR ISPRTATFHK GQLVWGSVAG
|
| |