Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3288 |
Symbol | |
ID | 6066989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3599586 |
End bp | 3600869 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641602703 |
Product | cytosine deaminase |
Protein accession | YP_001726237 |
Protein GI | 170021283 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGAATA ACGCTTTACA TACAATTATT AACGCCCGGT TACCCGGCAA AGAGGGGCTG TGGCAGATTC ATCTGCAGGA CGGAAAAATC AGCGCCATTG ATGCGCAAGC CGGCGTGATG CCCGTAACTG AAAACAGCCT TGATGCCGAA CAAGGTTTAG TTATACCGCC ATTTGTGGAG CCGCATATTC ACCTGGACAC CACGCAAACC GCCGGACAAC CGAACTGGAA TCAGTCCGGC ACGCTGTTCG AAGGCATTGA ACGCTGGGCC GAGCGCAAAG CGTTATTAAC CCATGACGAT GTGAAACAAC GCGCATGGCA AACGCTGAAA TGGCAGATTG CCAACGGCAT TCAGCATGTG CGTACCCATG TCGATGTTTC GGATGCAACG CTGACTGCGC TGAAAGCAAT GCTGGAAGTG AAGCAGGAAG TCGCGCCGTG GATTGATCTG CAAATCGTCG CCTTCCCTCA GGAAGGGATT TTGTCGTATC CCAACGGTGA AGCGTTGCTG GAAGAGGCGT TACGCTTAGG GGCAGATGTA GTGGGGGCGA TTCCGCATTT TGAATTTACC CGTGAATACG GCGTGGAGTC GCTGCATAAA ACCTTCGCCC TGGCGCAAAA ATACGACCGT CTCATCGACG TTCACTGTGA TGAGATCGAT GACGAGCAGT CGCGCTTTGT CGAAACTGTT GCTGCCCTGG CGCACCGTGA AGGCATGGGC GCGCGAGTCA CCGCCAGCCA CACCACGGCA ATGCACTCCT ATAACGGGGC GTATACCTCA CGCCTGTTCC GCTTGCTGAA AATGTCCGGT ATTAACTTTG TCGCCAACCC GCTGGTCAAT ATTCATCTGC AAGGACGTTT CGATACGTAT CCAAAACGTC GCGGCATCAC GCGCGTTAAA GAGATGCTGG AGTCCGGCAT TAACGTCTGC TTTGGTCACG ATGATGTCTT CGATCCGTGG TATCCGCTGG GAACGGCGAA TATGCTGCAA GTGCTGCATA TGGGGCTGCA TGTTTGCCAG TTGATGGGCT ACGGGCAGAT TAACGATGGC CTGAATTTAA TCACCCACCA CAGTGCCAGA ACGTTGAATT TGCAGGATTA CGGCATTGCC GCCGGAAACA GCGCAAACCT GATTATCCTG CCGGCTGAAA ATGGATTTGA TGCGCTGCGC CGTCAGGTTC CGGTACGCTA TTCGGTACGT GGCGGCAAGG TGATTGCCAG CACACAACCG GCACAAACCA CCGTATATCT GGAGCAGCCA GAAGCCATCG ATTACAAACG TTGA
|
Protein sequence | MSNNALHTII NARLPGKEGL WQIHLQDGKI SAIDAQAGVM PVTENSLDAE QGLVIPPFVE PHIHLDTTQT AGQPNWNQSG TLFEGIERWA ERKALLTHDD VKQRAWQTLK WQIANGIQHV RTHVDVSDAT LTALKAMLEV KQEVAPWIDL QIVAFPQEGI LSYPNGEALL EEALRLGADV VGAIPHFEFT REYGVESLHK TFALAQKYDR LIDVHCDEID DEQSRFVETV AALAHREGMG ARVTASHTTA MHSYNGAYTS RLFRLLKMSG INFVANPLVN IHLQGRFDTY PKRRGITRVK EMLESGINVC FGHDDVFDPW YPLGTANMLQ VLHMGLHVCQ LMGYGQINDG LNLITHHSAR TLNLQDYGIA AGNSANLIIL PAENGFDALR RQVPVRYSVR GGKVIASTQP AQTTVYLEQP EAIDYKR
|
| |