Gene EcSMS35_0524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0524 
SymbolushA 
ID6146962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp533120 
End bp534772 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content51% 
IMG OID641615418 
Productbifunctional UDP-sugar hydrolase/5'-nucleotidase periplasmic precursor 
Protein accessionYP_001742625 
Protein GI170682853 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAT TGCAGCGGGG CGTGGCGTTA GCACTGTTAA CCACATTTAC ACTGGCGAGT 
GAAGCTGCTC TGGCGTATGA GCAGGATAAA ACCTACAAAA TTACAGTTCT GCATACCAAT
GATCATCATG GGCATTTTTG GCGCAATGAA TATGGCGAAT ATGGTCTGGC GGCGCAAAAA
ACGCTGGTAG ATGGTATCCG CAAAGAGGTT GCGGCTGAAG GCGGTAGCGT GCTGCTACTT
TCCGGTGGTG ACATTAACAC TGGCGTGCCC GAGTCTGACT TACAGGATGC CGAACCTGAT
TTTCGCGGCA TGAATCTGGT GGGCTACGAT GCGATGGCAA TCGGTAATCA TGAATTTGAT
AACCCGCTCA CCGTATTACG CCAGCAGGAA AAGTGGGCAA AGTTCCCGTT ACTTTCTGCC
AATATCTACC AGAAAAGTAC CGGCGAGCGC CTGTTTAAAC CATGGGCGCT GTTTAAGCGT
CAGGATCTGA AAATTGCCGT TATTGGCCTG ACGACGGATG ATACAGCGAA GTTAGGCAAC
CCAGAAAACT TCACCGATAT TGAGTTCCGT AAGCCCGCCG ATGAAGCGAA GCTGGTGATT
CAGGAGCTGC AACAGACAGA AAAGCCAGAC ATTATTATCG CGGCGACCCA TATGGGGCAT
TACGACAATG GTGAGCACGG CTCTAACGCA CCGGGCGATG TGGAGATGGC GCGCGCGCTG
CCTGCCGGAT CGCTGGCGAT GATCGTTGGT GGTCACTCGC AAGATCCGGT CTGCATGGCG
GCAGAAAACA AAAAACAGGT CGATTACGTG CCAGGTACGC CATGCAAACC GGATCAACAA
AACGGCATCT GGATTGTGCA GGCGCATGAG TGGGGCAAAT ACGTGGGACG GGCTGATTTT
GAGTTTCGTA ATGGCGAAAT GAAAATGGTT AACTACCAGC TGATTCCGGT GAACCTGAAG
AAGAAAGTGA CCTGGGAAGA CGGGAAAAGC GAGCGCGTGC TTTACACTCC TGAAATCGCT
GAAAACCAGC AAATGATCTC GCTGTTATCG CCGTTCCAGA ACAAAGGCAA AGCGCAGCTG
GAAGTGAAAA TAGGCGAAAC CAATGGTCGT CTGGAAGGCG ATCGTGACAA AGTGCGTTTT
GTACAGACCA ATATGGGGCG GTTAATTCTG GCAGCCCAAA TGGATCGCAC TGGTGCTGAC
TTTGCGGTGA TGAGTGGAGG CGGAATTCGT GATTCTATCG AAGCAGGCGA TATCAGCTAT
AAAAACGTGC TGAAAGTGCA GCCATTCGGC AATGTGGTGG TGTATGCCGA CATGACCGGG
AAAGAGGTGA TTGATTACCT GACCGCCGTC GCGCAGATGA AGCCAGATTC AGGTGCCTAC
CCGCAATTTG CTAACGTTAG CTTTGTGGCG AAAGGCGGCA AACTGGACGA CCTTAAAATC
AAAGGCGAAC CGGTCGATCC GGCGAAAACT TATCGTATGG CGACATTAAA CTTCAATGCC
ACCGGCGGTG ATGGCTATCC GCGCCTTGAT AACAAACCGG GCTATGTGAA TACCGGCTTT
ATTGATGCCG AAGTGCTTAA AGCGTATATC CAGAAAAGCT CGCCGCTGGA TGTGAGTGTT
TATGAACCGA AAGGTGAGGT GAGCTGGCAG TAA
 
Protein sequence
MKLLQRGVAL ALLTTFTLAS EAALAYEQDK TYKITVLHTN DHHGHFWRNE YGEYGLAAQK 
TLVDGIRKEV AAEGGSVLLL SGGDINTGVP ESDLQDAEPD FRGMNLVGYD AMAIGNHEFD
NPLTVLRQQE KWAKFPLLSA NIYQKSTGER LFKPWALFKR QDLKIAVIGL TTDDTAKLGN
PENFTDIEFR KPADEAKLVI QELQQTEKPD IIIAATHMGH YDNGEHGSNA PGDVEMARAL
PAGSLAMIVG GHSQDPVCMA AENKKQVDYV PGTPCKPDQQ NGIWIVQAHE WGKYVGRADF
EFRNGEMKMV NYQLIPVNLK KKVTWEDGKS ERVLYTPEIA ENQQMISLLS PFQNKGKAQL
EVKIGETNGR LEGDRDKVRF VQTNMGRLIL AAQMDRTGAD FAVMSGGGIR DSIEAGDISY
KNVLKVQPFG NVVVYADMTG KEVIDYLTAV AQMKPDSGAY PQFANVSFVA KGGKLDDLKI
KGEPVDPAKT YRMATLNFNA TGGDGYPRLD NKPGYVNTGF IDAEVLKAYI QKSSPLDVSV
YEPKGEVSWQ