Gene ECH74115_0410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0410 
SymbolcodA 
ID6967287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp416749 
End bp418032 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content53% 
IMG OID643384462 
Productcytosine deaminase 
Protein accessionYP_002268976 
Protein GI209399439 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGAATA ACGCTTTACA AACAATTATT AACGCCCGGT TGCCAGGCAA AGAGGGGCTG 
TGGCAGATTC ATCTGCAGGA CGGAAAAATC AGCGCCATTG ATGCGCAATC CGGCGTGATG
CCCATAACTG AAAACAGCCT GGATGCCGAA CAAGGTTTAG TTATACCGCC GTTTGTGGAG
CCACATATTC ACCTGGACAC CACGCAAACC GCCGGACAAC CGAACTGGAA TCAGTCCGGC
ACGCTGTTTG AAGGCATTGA ACGCTGGGCC GAGCGCAAAG CGTTATTAAC CCATGACGAT
GTGAAACAAC GCGCATGGCA AACGCTGAAA TGGCAGATTG CCAACGGCAT TCAGCATGTG
CGTACCCATG TCGATGTTTC GGATGCAACG CTAACTGCGC TGAAAGCAAT GCTGGAAGTG
AAGCTGGAAG TCGCGCCGTG GATTGATCTG CAAATCGTCG CCTTCCCTCA GGAAGGGATT
TTGTCGTATC CCAACGGTGA AGCGTTGCTG GAAGAGGCGT TACGCTTAGG GGCAGATGTA
GTGGGGGCGA TTCCGCATTT TGAATTTACC CGTGAATACG GCGTGGAGTC GCTGCATAAA
ACCTTCGCCC TGGCGCAAAA ATACGACCGT CTCATCGACG TTCACTGTGA TGAGATCGAT
GACGAGCAGT CGCGCTTTGT CGAAACCGTT GCTGCCCTGG CGCACCGTGA AGGCATGGGC
GCGCGAGTCA CCGCCAGCCA CACCACGGCA ATGCACTCTT ATAACGGGGC GTATACCTCA
CGTCTGTTCC GCTTGCTGAA AATGTCCGGT ATTAACTTTG TCGCCAACCC GCTGGTCAAT
ATTCATCTGC AAGGACGATT CGATACGTAT CCAAAACGTC GCGGCATCAC GCGCGTTAAA
GAGATGCTGG AGTCCGGCAT TAACGTCTGC TTTGGTCACG ATGATGTCTT CGATCCGTGG
TATCCGCTGG GAACGGCGAA TATGCTGCAA GTGCTGCATA TGGGGCTGCA TGTTTGCCAG
CTGATGGGCT ATGGGCAGAT TAACGATGGC CTGAATTTAA TCACCCACCA CAGCGCCAGG
ACGTTGAATT TGCAGGATTA CAGCATTGCC GCCGGAAACA GCGCCAACCT GATTATCCTG
CCGGCTGAAA ATGGATTTGA TGCGCTGCGC CGTCAGGTTC CGGTACGTTA TTCGGTACGT
GGCGGCAAGG TGATTGCCAG CACACAACCG GCACAAACCA CCGTATATCT GGAGCAGCCG
GAAGCCATCG ATTACAAACG GTGA
 
Protein sequence
MSNNALQTII NARLPGKEGL WQIHLQDGKI SAIDAQSGVM PITENSLDAE QGLVIPPFVE 
PHIHLDTTQT AGQPNWNQSG TLFEGIERWA ERKALLTHDD VKQRAWQTLK WQIANGIQHV
RTHVDVSDAT LTALKAMLEV KLEVAPWIDL QIVAFPQEGI LSYPNGEALL EEALRLGADV
VGAIPHFEFT REYGVESLHK TFALAQKYDR LIDVHCDEID DEQSRFVETV AALAHREGMG
ARVTASHTTA MHSYNGAYTS RLFRLLKMSG INFVANPLVN IHLQGRFDTY PKRRGITRVK
EMLESGINVC FGHDDVFDPW YPLGTANMLQ VLHMGLHVCQ LMGYGQINDG LNLITHHSAR
TLNLQDYSIA AGNSANLIIL PAENGFDALR RQVPVRYSVR GGKVIASTQP AQTTVYLEQP
EAIDYKR