Gene ECH74115_1030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1030 
Symbol 
ID6967936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1041320 
End bp1042750 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content54% 
IMG OID643385043 
ProductNAD dependent epimerase/dehydratase family protein 
Protein accessionYP_002269543 
Protein GI209395955 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0702] Predicted nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCAAC GCATTTTAGT TCTTGGTGCC AGTGGCTACA TTGGTCAGCA TCTGGTGCGC 
ACACTCAGCC AGCAAGGGCA TCAGATCCTG GCGGCGGCAC GTCATGTCGA CAGGCTTGCA
AAGCTGCAAC TGGCAAACGT CAGTTGCCAT AAAGTCGATC TCAGCTGGCC GGATAACCTT
CCGGCCCTGT TACAGGATAT CGATACGGTC TATTTTCTGG TGCATAGCAT GGGCGAAGGC
GGCGATTTTA TCGCTCAGGA GCGCCAGGTG GCTCTCAACT TCCGCGATGC GCTACGTGAA
GTACCAGTTA AGCAATTAAT CTTTCTCAGT TCGTTGCAGG CCCCGCCACA TGAGCAGTCG
GACCATCTGC GCGCGCGTCA GGCTACGGCG GACATTCTTC GTGAAGCGAA TGTACCAGTG
ACCGAATTGC GTGCCGGAAT AATCGTTGGT GCAGGTTCAG CAGCGTTCGA AGTCATGCGC
GATATGGTCT ACAACCTGCC AGTGTTAACG CCGCCACGCT GGGTACGTTC ACGCACCACG
CCCATCGCGC TGGAAAACTT GCTGCACTAT CTGGTGGCGC TGTTAGACCA TCCAGCCAAC
GAACACCGCA TCTTCGAAGC CGCCGGACCA GAGGTGCTCA GTTATCAGCA ACAGTTTGAA
CATTTTATGG CGGTGAGCGG TAAGCGCCGC TGGTTGATCC CCATCCCCCT CCCCACCCGC
TGGATTTCGG TGTGGTTTCT CAATGTGATT ACTTCCGTAC CGCCCACCAC CGCCAGGGCG
TTGATTCAGG GGCTGAAACA CGATCTGCTG GCGGATGACA CCGCGCTACG TGCGCTCATC
CCACAACGGC TGATCGCTTT CGATGACGCG GTACGTCGCA CCCTGAAAGA AGAAGAAAAG
CTGGTCAACT CCAGCGACTG GGGATACGAC ACTCAGGCCT TTGCCCGCTG GCGACCAGAG
TATGGTTATT TTGCCAAACA GGCGGGATTT ACCGTTAAAA CGTCCGCCAG CCTTGCGGCT
TTATGGCAGG TGGTGAACCA AATCGGCGGT AAAGAGCGTT ATTTCTTTGG CAATATTTTG
TGGCAGACAC GGGCGTTGAT GGACCGTGCG ATCGGTCATA AATTAGCGAA AGGCCGTCCG
GAGCGCGAAT ATTTGCAAAC TGGCGATGCG GTGGATAGCT GGAAAGTGAT TGTCGTTGAA
CCGGAAAAAC AACTTACGTT GTTATTTGGC ATGAAAGCAC CGGGGCTGGG ACGACTGTGT
TTTACCCTGG AAGATAAAGG CGACTATCGT ACTATCGATG TCCGCGCTTT CTGGCATCCG
CACGGTATGC CGGGGCTGTT TTACTGGTTA TTGATGATCC CCGCGCATCT GTTTATTTTT
CGCGGAATGG CAAAACGAAT CGCCAGACTG GCAGAACAAA GCACAGATTA A
 
Protein sequence
MPQRILVLGA SGYIGQHLVR TLSQQGHQIL AAARHVDRLA KLQLANVSCH KVDLSWPDNL 
PALLQDIDTV YFLVHSMGEG GDFIAQERQV ALNFRDALRE VPVKQLIFLS SLQAPPHEQS
DHLRARQATA DILREANVPV TELRAGIIVG AGSAAFEVMR DMVYNLPVLT PPRWVRSRTT
PIALENLLHY LVALLDHPAN EHRIFEAAGP EVLSYQQQFE HFMAVSGKRR WLIPIPLPTR
WISVWFLNVI TSVPPTTARA LIQGLKHDLL ADDTALRALI PQRLIAFDDA VRRTLKEEEK
LVNSSDWGYD TQAFARWRPE YGYFAKQAGF TVKTSASLAA LWQVVNQIGG KERYFFGNIL
WQTRALMDRA IGHKLAKGRP EREYLQTGDA VDSWKVIVVE PEKQLTLLFG MKAPGLGRLC
FTLEDKGDYR TIDVRAFWHP HGMPGLFYWL LMIPAHLFIF RGMAKRIARL AEQSTD