Gene ECH74115_0572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0572 
SymbolushA 
ID6969359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp574290 
End bp575942 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content51% 
IMG OID643384617 
Productbifunctional UDP-sugar hydrolase/5'-nucleotidase periplasmic precursor 
Protein accessionYP_002269131 
Protein GI209397585 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAT TGCAGCGGGG CGTGGCGTTA GCACTGTTAA CCACATTTAC ACTGGCGAGT 
GAAGCTGCTC TGGCGTATGA GCAGGATAAA ACCTACAAAA TTACAGTTCT GCATACCAAT
GATCATCATG GGCATTTTTG GCGCAATGAA TATGGCGAAT ATGGTCTGGC GGCGCAAAAA
ACGCTGGTGG ATGGTATCCG CAAAGAGGTT GCGGCTGAAG GCGGTAGCGT GCTGCTACTT
TCCGGTGGCG ACATTAACAC TGGCGTGCCC GAGTCTGACT TACAGGATGC CGAACCTGAT
TTTCGCGGTA TGAATCTGGT GGGCTATGAC GCGATGGCGA TCGGTAATCA TGAATTTGAT
AACCCGCTCA CCGTATTACG CCAGCAGGAA AAGTGGGCAA AGTTCCCGTT ACTTTCTGCC
AATATCTACC AGAAAAGTAC CGGCGAGCGC CTGTTTAAAC CATGGGCGCT GTTTAAGCGT
CAGGATCTGA AAATTGCCGT TATTGGTCTG ACGACGGATG ACACAGCGAA GTTAGGCAAT
CCAGAAAACT TCACCGATAT TGAGTTCCGT AAGCCCGCCG ATGAAGCGAA GCTGGTGATT
CAGGAGCTGC AACAGACAGA AAAGCCAGAC ATTATTATCG CGGCGACACA TATGGGGCAT
TACGATAACG GCGATCACGG TTCTAACGCA CCGGGCGATG TGGAGATGGC ACGCGCGTTG
CCTGCCGGAT CGCTGGCGAT GATTGTCGGC GGTCACTCGC AAGATCCAGT CTGTATGGCA
GCGGAAAATA AAAAGCAGGT AGATTACGTT CCAGGTACGC CATGCAAACC GGATCAACAA
AACGGCATCT GGATTGTACA GGCGCATGAG TGGGGCAAGT ACGTTGGTCG GGCCGACTTT
GAGTTTCGCA ACGGCGAAAT GAAAATGGTT AACTACCAGC TGATTCCGGT GAACCTGAAG
AAGAAAGTGA CCTGGGAAGA CGGGAAAAGC GAGCGCGTGC TGTACACCCC TGAAATCGCT
GAAAACCAGC AAATGATCTC GCTGTTATCG CCGTTCCAGA ACAAAGGCAA AGCGCAGCTG
GAAGTGAAAA TAGGCGAAAC CAATGGTCGT CTGGAAGGCG ATCGTGACAA AGTGCGTTTT
GTACAGACCA ATATGGGGCG GTTGATTCTG GCAGCACAAA TGGATCGCAC TGGTGCCGAC
TTTGCGGTGA TGAGCGGAGG CGGAATTCGT GATTCTATCG AAGCAGGCGA TATCAGCTAT
AAAAACGTGC TGAAAGTGCA GCCATTCGGC AATGTGGTGG TGTATGCCGA CATGACCGGT
AAAGAGGTGA TTGATTACCT GACCGCCGTC GCGCAGATGA AGCCAGATTC AGGTGCCTAC
CCGCAATTTG CCAACGTTAG CTTTGTGGCG AAAGACGGCA AACTGAACGA CCTTAAAATC
AAAGGCGAAC CGGTCGATCC AGCGAAAACT TACCGTATGG CGACATTAAA CTTCAATGCC
ACCGGCGGTG ATGGATATCC GCGCCTTGAT AACAAACCGG GCTATGTGAA TACCGGCTTT
ATTGATGCCG AAGTGCTGAA AGCGTATATC CAGAAAAGCT CGCCGCTGGA TGTGAGTGTT
TATGAACCGA AAGGTGAGGT GAGCTGGCAG TAA
 
Protein sequence
MKLLQRGVAL ALLTTFTLAS EAALAYEQDK TYKITVLHTN DHHGHFWRNE YGEYGLAAQK 
TLVDGIRKEV AAEGGSVLLL SGGDINTGVP ESDLQDAEPD FRGMNLVGYD AMAIGNHEFD
NPLTVLRQQE KWAKFPLLSA NIYQKSTGER LFKPWALFKR QDLKIAVIGL TTDDTAKLGN
PENFTDIEFR KPADEAKLVI QELQQTEKPD IIIAATHMGH YDNGDHGSNA PGDVEMARAL
PAGSLAMIVG GHSQDPVCMA AENKKQVDYV PGTPCKPDQQ NGIWIVQAHE WGKYVGRADF
EFRNGEMKMV NYQLIPVNLK KKVTWEDGKS ERVLYTPEIA ENQQMISLLS PFQNKGKAQL
EVKIGETNGR LEGDRDKVRF VQTNMGRLIL AAQMDRTGAD FAVMSGGGIR DSIEAGDISY
KNVLKVQPFG NVVVYADMTG KEVIDYLTAV AQMKPDSGAY PQFANVSFVA KDGKLNDLKI
KGEPVDPAKT YRMATLNFNA TGGDGYPRLD NKPGYVNTGF IDAEVLKAYI QKSSPLDVSV
YEPKGEVSWQ