Gene EcHS_A0557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0557 
SymbolushA 
ID5591143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp570979 
End bp572631 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content51% 
IMG OID640919741 
Productbifunctional UDP-sugar hydrolase/5'-nucleotidase periplasmic precursor 
Protein accessionYP_001457325 
Protein GI157160007 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAT TGCAGCGGGG CGTGGCGTTA GCGCTGTTAA CCACATTTAC ACTGGCGAGT 
GAAACTGCTC TGGCGTATGA GCAGGATAAA ACCTACAAAA TTACAGTTCT GCATACCAAT
GATCATCATG GGCATTTTTG GCGCAATGAA TATGGCGAAT ATGGTCTGGC GGCGCAAAAA
ACGCTGGTGG ATGGTATCCG CAAAGAGGTT GCGGCTGAAG GCGGTAGCGT GCTGCTACTT
TCCGGTGGCG ACATTAACAC TGGCGTGCCC GAGTCTGACT TACAGGATGC CGAACCTGAT
TTTCGCGGTA TGAATCTGGT GGGCTATGAC GCGATGGCGA TCGGTAATCA TGAATTTGAT
AATCCGCTCA CCGTATTACG CCAGCAGGAA AAGTGGGCCA AGTTCCCGTT GCTTTCCGCG
AATATCTACC AGAAAAGTAC TGGCGAGCGC CTGTTTAAAC CGTGGGCGCT GTTTAAGCGT
CAGGATCTGA AAATTGCCGT TATTGGGCTG ACAACCGATG ACACAGCAAA AATTGGTAAC
CCGGAATACT TCACTGATAT CGAATTTCGT AAGCCCGCCG ATGAAGCGAA GCTGGTGATT
CAGGAGCTGC AACAGACAGA AAAGCCAGAC ATTATTATCG CGGCGACCCA TATGGGGCAT
TACGATAATG GTGAGCACGG CTCTAACGCA CCGGGCGATG TGGAGATGGC ACGCGCGCTG
CCTGCCGGAT CGCTGGCGAT GATCGTCGGT GGTCACTCGC AAGATCCGGT CTGCATGGCG
GCAGAAAACA AAAAACAGGT CGATTACGTG CCGGGTACGC CATGCAAACC AGATCAACAA
AACGGCATCT GGATTGTGCA GGCGCATGAG TGGGGCAAAT ACGTGGGACG GGCTGATTTT
GAGTTTCGTA ATGGCGAAAT GAAAATGGTT AACTACCAGC TGATTCCGGT GAACCTGAAG
AAGAAAGTGA CCTGGGAAGA CGGGAAAAGC GAGCGCGTGC TTTACACTCC TGAAATCGCT
GAAAACCAGC AAATGATCTC GCTGTTATCA CCGTTCCAGA ACAAAGGCAA AGCGCAGCTG
GAAGTGAAAA TAGGCGAAAC CAATGGTCGT CTGGAAGGCG ATCGTGACAA AGTGCGTTTT
GTACAGACCA ATATGGGGCG GTTGATTCTG GCAGCCCAAA TGGATCGCAC TGGTGCCGAC
TTTGCGGTGA TGAGCGGAGG CGGAATTCGT GATTCTATCG AAGCAGGCGA TATCAGCTAT
AAAAACGTGC TGAAAGTGCA GCCATTCGGC AATGTGGTGG TGTATGCCGA CATGACCGGT
AAAGAGGTGA TTGATTACCT GACCGCCGTC GCGCAGATGA AGCCAGATTC AGGTGCCTAC
CCGCAATTTG CCAACGTTAG CTTTGTGGCG AAAGACGGCA AACTGAACGA CCTTAAAATC
AAAGGCGAAC CGGTCGATCC GGCGAAAACT TACCGTATGG CGACATTAAA CTTCAATGCC
ACCGGCGGTG ATGGATATCC GCGCCTTGAT AACAAACCGG GCTATGTGAA TACCGGCTTT
ATTGATGCCG AAGTGCTGAA AGCGTATATC CAGAAAAGCT CGCCGCTGGA GGTGAGTGTT
TATGAACCGA AAGGTGAGGT GAGCTGGCAG TAA
 
Protein sequence
MKLLQRGVAL ALLTTFTLAS ETALAYEQDK TYKITVLHTN DHHGHFWRNE YGEYGLAAQK 
TLVDGIRKEV AAEGGSVLLL SGGDINTGVP ESDLQDAEPD FRGMNLVGYD AMAIGNHEFD
NPLTVLRQQE KWAKFPLLSA NIYQKSTGER LFKPWALFKR QDLKIAVIGL TTDDTAKIGN
PEYFTDIEFR KPADEAKLVI QELQQTEKPD IIIAATHMGH YDNGEHGSNA PGDVEMARAL
PAGSLAMIVG GHSQDPVCMA AENKKQVDYV PGTPCKPDQQ NGIWIVQAHE WGKYVGRADF
EFRNGEMKMV NYQLIPVNLK KKVTWEDGKS ERVLYTPEIA ENQQMISLLS PFQNKGKAQL
EVKIGETNGR LEGDRDKVRF VQTNMGRLIL AAQMDRTGAD FAVMSGGGIR DSIEAGDISY
KNVLKVQPFG NVVVYADMTG KEVIDYLTAV AQMKPDSGAY PQFANVSFVA KDGKLNDLKI
KGEPVDPAKT YRMATLNFNA TGGDGYPRLD NKPGYVNTGF IDAEVLKAYI QKSSPLEVSV
YEPKGEVSWQ