Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0361 |
Symbol | codA |
ID | 5588598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 391762 |
End bp | 393045 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640924086 |
Product | cytosine deaminase |
Protein accession | YP_001461513 |
Protein GI | 157156161 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCAAATA ACGCTTTACA AACAATTATT AACGCCCGGT TACCAGGCGA AGAGGGGCTG TGGCAGATTC ATCTGCAGGA CGGAAAAATC AGCGCCATTG ATGCGCAATC CGGCGTGATG CCCATAACTG AAAACAGCCT GGATGCCGAA CAAGGTTTAG TTATACCGCC GTTTGTGGAG CCACATATTC ACCTGGACAC CACGCAAACC GCCGGACAAC CGAACTGGAA TCAGTCCGGC ACGCTGTTTG AAGGCATTGA ACGCTGGGCC GAGCGCAAAG CGTTATTAAC CCATGACGAT GTGAAACAAC GCGCCTGGCA AACGCTGAAA TGGCAGATTG CCAACGGCAT TCAGCATGTG CGTACCCATG TCGATGTTTC GGATGCAACG CTAACTGCGC TGAAAGCAAT GCTGGAAGTG AAGCAGGAAG TCGCGCCGTG GATTGATCTG CAAATTGTCG CCTTCCCTCA GGAAGGGATT TTGTCGTATC CCAACGGTGA AGCGTTGCTG GAAGAGGCGT TACGCTTAGG GGCAGATGTA GTGGGGGCGA TTCCGCATTT TGAATTTACC CGTGAATACG GCGTGGAGTC GCTGCATAAA ACCTTCGCCC TGGCGCAAAA ATACGACCGT CTCATCGACG TTCACTGTGA TGAGATCGAT GACGAGCAGT CGCGCTTTGT CGAAACCGTT GCTGCCCTGG CGCACCGTGA AGGCATGGGC GCGCGAGTCA CCGCCAGCCA CACCACGGCA ATGCACTCCT ATAACGGGGC GTATACCTCA CGCCTGTTCC GCTTGCTGAA AATGTCCGGT ATTAACTTTG TCGCCAACCC GCTGGTCAAT ATTCATCTGC AAGGACGTTT CGATACGTAT CCAAAACGTC GCGGCATCAC GCGCGTTAAA GAGATGCTGG AGTCCGGCAT TAACGTCTGC TTTGGTCACG ATGATGTCTT CGATCCGTGG TATCCGCTGG GAACGGCGAA TATGCTGCAA GTGCTGCATA TGGGGCTGCA TGTTTGCCAG TTGATGGGCT ACGGGCAGAT TAACGATGGC CTGAATTTAA TCACCCACCA CAGTGCCAGA ACATTGAATT TGCAGGATTA CGGCATTGCC GCCGGAAACA GCGCAAACCT GATTATCCTG CCGGCTGAAA ATGGATTTGA TGCGCTGCGC CGTCAGGTTC CGGTACGTTA TTCGGTACGT GGCGGCAAGG TGATTGCCAG TACACAACCG GCACAAACCA CCGTATATCT GGAGCAGCCA GAAGCCATCG ATTACAAACG GTAA
|
Protein sequence | MSNNALQTII NARLPGEEGL WQIHLQDGKI SAIDAQSGVM PITENSLDAE QGLVIPPFVE PHIHLDTTQT AGQPNWNQSG TLFEGIERWA ERKALLTHDD VKQRAWQTLK WQIANGIQHV RTHVDVSDAT LTALKAMLEV KQEVAPWIDL QIVAFPQEGI LSYPNGEALL EEALRLGADV VGAIPHFEFT REYGVESLHK TFALAQKYDR LIDVHCDEID DEQSRFVETV AALAHREGMG ARVTASHTTA MHSYNGAYTS RLFRLLKMSG INFVANPLVN IHLQGRFDTY PKRRGITRVK EMLESGINVC FGHDDVFDPW YPLGTANMLQ VLHMGLHVCQ LMGYGQINDG LNLITHHSAR TLNLQDYGIA AGNSANLIIL PAENGFDALR RQVPVRYSVR GGKVIASTQP AQTTVYLEQP EAIDYKR
|
| |