Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0410 |
Symbol | codA |
ID | 6967287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 416749 |
End bp | 418032 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643384462 |
Product | cytosine deaminase |
Protein accession | YP_002268976 |
Protein GI | 209399439 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGAATA ACGCTTTACA AACAATTATT AACGCCCGGT TGCCAGGCAA AGAGGGGCTG TGGCAGATTC ATCTGCAGGA CGGAAAAATC AGCGCCATTG ATGCGCAATC CGGCGTGATG CCCATAACTG AAAACAGCCT GGATGCCGAA CAAGGTTTAG TTATACCGCC GTTTGTGGAG CCACATATTC ACCTGGACAC CACGCAAACC GCCGGACAAC CGAACTGGAA TCAGTCCGGC ACGCTGTTTG AAGGCATTGA ACGCTGGGCC GAGCGCAAAG CGTTATTAAC CCATGACGAT GTGAAACAAC GCGCATGGCA AACGCTGAAA TGGCAGATTG CCAACGGCAT TCAGCATGTG CGTACCCATG TCGATGTTTC GGATGCAACG CTAACTGCGC TGAAAGCAAT GCTGGAAGTG AAGCTGGAAG TCGCGCCGTG GATTGATCTG CAAATCGTCG CCTTCCCTCA GGAAGGGATT TTGTCGTATC CCAACGGTGA AGCGTTGCTG GAAGAGGCGT TACGCTTAGG GGCAGATGTA GTGGGGGCGA TTCCGCATTT TGAATTTACC CGTGAATACG GCGTGGAGTC GCTGCATAAA ACCTTCGCCC TGGCGCAAAA ATACGACCGT CTCATCGACG TTCACTGTGA TGAGATCGAT GACGAGCAGT CGCGCTTTGT CGAAACCGTT GCTGCCCTGG CGCACCGTGA AGGCATGGGC GCGCGAGTCA CCGCCAGCCA CACCACGGCA ATGCACTCTT ATAACGGGGC GTATACCTCA CGTCTGTTCC GCTTGCTGAA AATGTCCGGT ATTAACTTTG TCGCCAACCC GCTGGTCAAT ATTCATCTGC AAGGACGATT CGATACGTAT CCAAAACGTC GCGGCATCAC GCGCGTTAAA GAGATGCTGG AGTCCGGCAT TAACGTCTGC TTTGGTCACG ATGATGTCTT CGATCCGTGG TATCCGCTGG GAACGGCGAA TATGCTGCAA GTGCTGCATA TGGGGCTGCA TGTTTGCCAG CTGATGGGCT ATGGGCAGAT TAACGATGGC CTGAATTTAA TCACCCACCA CAGCGCCAGG ACGTTGAATT TGCAGGATTA CAGCATTGCC GCCGGAAACA GCGCCAACCT GATTATCCTG CCGGCTGAAA ATGGATTTGA TGCGCTGCGC CGTCAGGTTC CGGTACGTTA TTCGGTACGT GGCGGCAAGG TGATTGCCAG CACACAACCG GCACAAACCA CCGTATATCT GGAGCAGCCG GAAGCCATCG ATTACAAACG GTGA
|
Protein sequence | MSNNALQTII NARLPGKEGL WQIHLQDGKI SAIDAQSGVM PITENSLDAE QGLVIPPFVE PHIHLDTTQT AGQPNWNQSG TLFEGIERWA ERKALLTHDD VKQRAWQTLK WQIANGIQHV RTHVDVSDAT LTALKAMLEV KLEVAPWIDL QIVAFPQEGI LSYPNGEALL EEALRLGADV VGAIPHFEFT REYGVESLHK TFALAQKYDR LIDVHCDEID DEQSRFVETV AALAHREGMG ARVTASHTTA MHSYNGAYTS RLFRLLKMSG INFVANPLVN IHLQGRFDTY PKRRGITRVK EMLESGINVC FGHDDVFDPW YPLGTANMLQ VLHMGLHVCQ LMGYGQINDG LNLITHHSAR TLNLQDYSIA AGNSANLIIL PAENGFDALR RQVPVRYSVR GGKVIASTQP AQTTVYLEQP EAIDYKR
|
| |