Gene EcolC_3136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3136 
SymbolushA 
ID6066411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3434994 
End bp3436646 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content51% 
IMG OID641602552 
Productbifunctional UDP-sugar hydrolase/5'-nucleotidase periplasmic precursor 
Protein accessionYP_001726086 
Protein GI170021132 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.161033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000215982 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAATTAT TGCAGCGGGG CGTGGCGTTA GCGCTGTTAA CCACATTTAC ACTGGCGAGT 
GAAACTGCTC TGGCGTATGA GCAGGATAAA ACCTACAAAA TTACAGTTCT GCATACCAAT
GATCATCATG GGCATTTTTG GCGCAATGAA TATGGCGAAT ATGGTCTGGC GGCGCAAAAA
ACGCTGGTGG ATGGTATCCG CAAAGAGGTT GCGGCTGAAG GCGGTAGCGT GCTGCTACTT
TCCGGTGGCG ACATTAACAC TGGCGTGCCC GAGTCTGACT TACAGGATGC CGAACCTGAT
TTTCGCGGTA TGAATCTGGT GGGCTATGAC GCGATGGCGA TCGGTAATCA TGAATTTGAT
AATCCGCTCA CCGTATTACG CCAGCAGGAA AAGTGGGCCA AGTTCCCGTT GCTTTCCGCG
AATATCTACC AGAAAAGTAC TGGCGAGCGC CTGTTTAAAC CGTGGGCGCT GTTTAAGCGT
CAGGATCTGA AAATTGCCGT TATTGGGCTG ACAACCGATG ACACAGCAAA AATTGGTAAC
CCGGAATACT TCACTGATAT CGAATTTCGT AAGCCCGCCG ATGAAGCGAA GCTGGTGATT
CAGGAGCTGC AACAGACAGA AAAGCCAGAC ATTATTATCG CGGCGACCCA TATGGGGCAT
TACGATAATG GTGAGCACGG CTCTAACGCA CCGGGCGATG TGGAGATGGC ACGCGCGCTG
CCTGCCGGAT CGCTGGCGAT GATCGTCGGT GGTCACTCGC AAGATCCGGT CTGCATGGCG
GCAGAAAACA AAAAACAGGT CGATTACGTG CCGGGTACGC CATGCAAACC AGATCAACAA
AACGGCATCT GGATTGTGCA GGCGCATGAG TGGGGCAAAT ACGTGGGACG GGCTGATTTT
GAGTTTCGTA ATGGCGAAAT GAAAATGGTT AACTACCAGC TGATTCCGGT GAACCTGAAG
AAGAAAGTGA CCTGGGAAGA CGGGAAAAGC GAGCGCGTGC TTTACACTCC TGAAATCGCT
GAAAACCAGC AAATGATCTC GCTGTTATCA CCGTTCCAGA ACAAAGGCAA AGCGCAGCTG
GAAGTGAAAA TAGGCGAAAC CAATGGTCGT CTGGAAGGCG ATCGTGACAA AGTGCGTTTT
GTACAGACCA ATATGGGGCG GTTGATTCTG GCAGCCCAAA TGGATCGCAC TGGTGCCGAC
TTTGCGGTGA TGAGCGGAGG CGGAATTCGT GATTCTATCG AAGCAGGCGA TATCAGCTAT
AAAAACGTGC TGAAAGTGCA GCCATTCGGC AATGTGGTGG TGTATGCCGA CATGACCGGT
AAAGAGGTGA TTGATTACCT GACCGCCGTC GCGCAGATGA AGCCAGATTC AGGTGCCTAC
CCGCAATTTG CCAACGTTAG CTTTGTGGCG AAAGACGGCA AACTGAACGA CCTTAAAATC
AAAGGCGAAC CGGTCGATCC GGCGAAAACT TACCGTATGG CGACATTAAA CTTCAATGCC
ACCGGCGGTG ATGGATATCC GCGCCTTGAT AACAAACCGG GCTATGTGAA TACCGGCTTT
ATTGATGCCG AAGTGCTGAA AGCGTATATC CAGAAAAGCT CGCCGCTGGA TGTGAGTGTT
TATGAACCGA AAGGTGAGGT GAGCTGGCAG TAA
 
Protein sequence
MKLLQRGVAL ALLTTFTLAS ETALAYEQDK TYKITVLHTN DHHGHFWRNE YGEYGLAAQK 
TLVDGIRKEV AAEGGSVLLL SGGDINTGVP ESDLQDAEPD FRGMNLVGYD AMAIGNHEFD
NPLTVLRQQE KWAKFPLLSA NIYQKSTGER LFKPWALFKR QDLKIAVIGL TTDDTAKIGN
PEYFTDIEFR KPADEAKLVI QELQQTEKPD IIIAATHMGH YDNGEHGSNA PGDVEMARAL
PAGSLAMIVG GHSQDPVCMA AENKKQVDYV PGTPCKPDQQ NGIWIVQAHE WGKYVGRADF
EFRNGEMKMV NYQLIPVNLK KKVTWEDGKS ERVLYTPEIA ENQQMISLLS PFQNKGKAQL
EVKIGETNGR LEGDRDKVRF VQTNMGRLIL AAQMDRTGAD FAVMSGGGIR DSIEAGDISY
KNVLKVQPFG NVVVYADMTG KEVIDYLTAV AQMKPDSGAY PQFANVSFVA KDGKLNDLKI
KGEPVDPAKT YRMATLNFNA TGGDGYPRLD NKPGYVNTGF IDAEVLKAYI QKSSPLDVSV
YEPKGEVSWQ