Gene ECH74115_2335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2335 
Symboladd 
ID6971220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2208157 
End bp2209158 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content53% 
IMG OID643386209 
Productadenosine deaminase 
Protein accessionYP_002270693 
Protein GI209395849 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1816] Adenosine deaminase 
TIGRFAM ID[TIGR01430] adenosine deaminase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATA CCACCCTGCC ATTAACTGAT ATCCATCGCC ACCTTGATGG CAACATTCGT 
CCCCAGACCA TTCTTGAACT TGGCCGCCAG TATAATATCT CGCTTCCTGC ACAATCCCTG
GAAACACTGA TTCCCCACGT TCAGGTCATT GCCAACGAAC CCGATCTGGT GAGCTTTCTG
ACCAAACTTG ACTGGGGCGT TAAAGTTCTC GCCTCTCTTG ATGCTTGTCG CCGCGTGGCA
TTTGAAAACA TTGAAGATGC AGCCCGTAAC GGCCTGCACT ATGTCGAGCT GCGTTTTTCA
CCAGGCTACA TGGCAATGGC ACATCAGCTG CCTGTTGCAG GTGTTGTTGA AGCGGTGATC
GATGGCGTAC GTGAAGGTTG CCGCACCTTT GGTGTGCAGG CGAAGCTTAT CGGCATTATG
AGCCGAACCT TCGGCGAAGC CGCCTGTCAG CAAGAGCTGG AGGCCTTTTT AGCCCACCGT
GACCAGATTA CCGCACTTGA TTTAGCCGGT GATGAACTTG GTTTCCCGGG AAGTCTGTTT
CTTTCTCACT TCAACCGCGC GCGTGATGCG GGCTGGCATA TTACCGTCCA TGCAGGCGAA
GCTGCCGGGC CGGAAAGCAT CTGGCAGGCG ATTCGTGAAC TGGGGGCAGA GCGCATTGGA
CATGGCGTAA AAGCCATTGA AGATCGGGCG CTGATGGATT TTCTCGCCGA GCAACAAATT
GGTATTGAAT CCTGTCTGAC CTCCAATATT CAGACCAGCA CCGTGGCAGA GCTGGCTGCA
CATCCGCTGA AAACGTTCCT TGAGCATGGC ATTCGTGCCA GCATTAACAC TGACGATCCC
GGCGTACAGG GAGTGGATAT CATTCACGAA TATACCGTTG CCGCGCCAGC TGCTGGGTTA
TCCCGCGAGC AAATCCGCCA GGCACAGATT AATGGTCTGG AAATGGCTTT CCTCAACGCA
GAGGAAAAAC GCGCACTGCG AGAAAAAGTC GCAGCGAAGT AA
 
Protein sequence
MIDTTLPLTD IHRHLDGNIR PQTILELGRQ YNISLPAQSL ETLIPHVQVI ANEPDLVSFL 
TKLDWGVKVL ASLDACRRVA FENIEDAARN GLHYVELRFS PGYMAMAHQL PVAGVVEAVI
DGVREGCRTF GVQAKLIGIM SRTFGEAACQ QELEAFLAHR DQITALDLAG DELGFPGSLF
LSHFNRARDA GWHITVHAGE AAGPESIWQA IRELGAERIG HGVKAIEDRA LMDFLAEQQI
GIESCLTSNI QTSTVAELAA HPLKTFLEHG IRASINTDDP GVQGVDIIHE YTVAAPAAGL
SREQIRQAQI NGLEMAFLNA EEKRALREKV AAK