Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0402 |
Symbol | codA |
ID | 5594590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 421674 |
End bp | 422957 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640919587 |
Product | cytosine deaminase |
Protein accession | YP_001457172 |
Protein GI | 157159854 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 60 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCGAATA ACGCTTTACA AACAATTATT AACGCCCGGT TACCAGGCAA AGAGGGGCTG TGGCAGATTC ATCTGCAGGA CGGAAAAATC AGCGCCATTG ATGCGCAATC CGGCGTGATG CCCATAACTG AAAACAGCCT GGATGCCGAA CAAGGTTTAG TTATACCGCC GTTTGTGGAG CCGCATATTC ACCTGGACAC CACGCAAACC GCCGGACAAC CGAACTGGAA TCAGTCCGGC ACGCTGTTTG AAGGCATTGA ACGCTGGGCC GAGCGCAAAG CGTTATTAAC CCATGACGAT GTGAAACAAC GCGCCTGGCA AACGCTGAAA TGGCAGATTG CCAACGGCAT TCAGCATGTG CGTACCCATG TCGATGTTTC GGATGCAACG CTAACTGCGC TGAAAGCAAT GCTGGAAGTG AAGCAGGAAG TCGCGCCGTG GATTGATCTG CAAATTGTCG CCTTCCCTCA GGAAGGGATT TTGTCGTATC CCAACGGTGA AGCGTTGCTG GAAGAGGCGT TACGCTTAGG GGCAGATGTA GTGGGGGCGA TTCCGCATTT TGAATTTACC CGTGAATACG GCGTGGAGTC GCTGCATAAA ACCTTCGCCC TGGCGCAAAA ATACGACCGT CTCATCGACG TTCACTGTGA TGAGATCGAT GACGAGCAGT CGCGCTTTGT CGAAACCGTT GCTGCCCTGG CGCACCGTGA AGGCATGGGC GCGCGAGTCA CCGCCAGCCA CACCACGGCA ATGCACTCCT ATAACGGGGC GTATACCTCA CGCCTGTTCC GCTTGCTGAA AATGTCCGGT ATTAACTTTG TCGCCAACCC GCTGGTCAAT ATTCATCTGC AAGGACGTTT CGATACGTAT CCAAAACGTC GCGGCATCAC GCGCGTTAAA GAGATGCTGG AGTCCGGCAT TAACGTCTGC TTTGGTCACG ATGATGTCTT CGATCCGTGG TATCCGCTGG GAACGGCGAA TATGCTGCAA GTGCTGCATA TGGGGCTGCA TGTTTGCCAG TTGATGGGCT ACGGGCAGAT TAACGATGGC CTGAATTTAA TCACCCACCA CAGTGCCAGA ACGTTGAATT TGCAGGATTA CGGCATTGCC GCCGGAAACA GCGCCAACCT GATTATCCTG CCGGCTGAAA ATGGGTTTGA TGCGCTGCGC CGTCAGGTTC CGGTACGTTA TTCGGTACGT GGCGGCAAGG TGATTGCCAG CACACAACCG GCACAAACCA CCGTATATCT GGAGCAGCCA GAAGCCATCG ATTACAAACG GTAA
|
Protein sequence | MSNNALQTII NARLPGKEGL WQIHLQDGKI SAIDAQSGVM PITENSLDAE QGLVIPPFVE PHIHLDTTQT AGQPNWNQSG TLFEGIERWA ERKALLTHDD VKQRAWQTLK WQIANGIQHV RTHVDVSDAT LTALKAMLEV KQEVAPWIDL QIVAFPQEGI LSYPNGEALL EEALRLGADV VGAIPHFEFT REYGVESLHK TFALAQKYDR LIDVHCDEID DEQSRFVETV AALAHREGMG ARVTASHTTA MHSYNGAYTS RLFRLLKMSG INFVANPLVN IHLQGRFDTY PKRRGITRVK EMLESGINVC FGHDDVFDPW YPLGTANMLQ VLHMGLHVCQ LMGYGQINDG LNLITHHSAR TLNLQDYGIA AGNSANLIIL PAENGFDALR RQVPVRYSVR GGKVIASTQP AQTTVYLEQP EAIDYKR
|
| |