Gene ECH74115_4407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4407 
SymboluxaC 
ID6968269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4086044 
End bp4087456 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content54% 
IMG OID643388128 
Productglucuronate isomerase 
Protein accessionYP_002272565 
Protein GI209397829 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1904] Glucuronate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.50615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCCGT TTATGACTGA AGATTTCCTG TTAGATACCG AATTTGCCCG CCGTCTGTAT 
CACGACTACG CAAAAGACCA GCCGATTTTC GATTACCATT GCCATTTGCC GCCGCAGCAG
ATTGCGGAAG ACTATCGTTT TAAAAACCTG TATGACATCT GGCTGAAAGG CGATCACTAC
AAATGGCGCG CTATGCGTAC CAACGGTGTG GCCGAGCGTC TGTGTACCGG TGATGCGTCT
GACCGTGAAA AATTTGACGC CTGGGCGGCG ACTGTTCCGC ACACTATCGG CAACCCGTTA
TACCACTGGA CGCACCTCGA ACTGCGTCGT CCGTTTGGTA TCACTGGCAA ATTGCTTTCT
CCGTCAACTG CCGATGAAAT CTGGAACGAA TGTAACGAAC TGCTGGCGCA GGATAACTTC
TCCGCGCGCG GCATCATGCA GCAGATGAAC GTGAAAATGG TCGGCACCAC CGATGACCCG
ATCGATTCTC TGGAGCATCA CGCAGAGATC GCCAAAGACG GCTCTTTCAC CATCAAAGTG
CTGCCGAGCT GGCGTCCGGA CAAAGCCTTC AACATCGAAC AGGCGACCTT TAACGACTAC
ATGGCGAAGC TGGGCGAAGT TTCCGATACC GACATTCGCC GCTTTGCTGA CCTGCAAACT
GCCTTGACCA AACGTCTGGA TCACTTCGCC GCTCACGGCT GTAAAGTGTC TGACCATGCG
CTGGACGTGG TGATGTTTGC TGAAGCGAAC GAAGCGGAAT TGGACAGCAT TCTGGCGCGC
CGTCTGGCTG GCGAAACCCT GAGCGAGCAC GAAGTGGCAC AGTTCAAAAC TGCGGTACTG
GTGTTCCTTG GTGCCGAATA TGCACGTCGC GGCTGGGTAC AGCAGTACCA CATTGGCGCG
CTGCGTAATA ACAACCTGCG TCAGTTCAAA CTGCTGGGGC CGGATGTAGG CTTTGACTCC
ATCAACGACC GTCCGATGGC AGAAGAGTTG TCTAAGCTGC TGAGCAAGCA GAACGAAGAA
AACCTGCTGC CGAAAACCAT TCTGTACTGC CTGAACCCGC GCGATAACGA AGTGCTGGGC
ACCATGAGCG GTAACTTCCA GGGCGAAGGT ATGCCGGGCA AAATGCAGTT CGGTTCCGGC
TGGTGGTTTA ACGATCAGAA AGACGGTATG GAACGTCAGA TGACCCAACT GGCGCAGCTC
GGTTTGCTGA GCCGCTTTGT CGGTATGCTG ACTGACAGCC GTAGCTTCCT GTCATACACC
CGTCACGAAT ACTTCCGCCG CATTCTGTGC CAGATGATCG GTCGCTGGGT GGAAGCGGGC
GAAGCACCGG CGGACATCAA CCTGCTGGGC GAGATGGTGA AAAATATTTG CTTTAACAAT
GCGCGTGACT ACTTCGCCAT TGAACTGAAC TAA
 
Protein sequence
MTPFMTEDFL LDTEFARRLY HDYAKDQPIF DYHCHLPPQQ IAEDYRFKNL YDIWLKGDHY 
KWRAMRTNGV AERLCTGDAS DREKFDAWAA TVPHTIGNPL YHWTHLELRR PFGITGKLLS
PSTADEIWNE CNELLAQDNF SARGIMQQMN VKMVGTTDDP IDSLEHHAEI AKDGSFTIKV
LPSWRPDKAF NIEQATFNDY MAKLGEVSDT DIRRFADLQT ALTKRLDHFA AHGCKVSDHA
LDVVMFAEAN EAELDSILAR RLAGETLSEH EVAQFKTAVL VFLGAEYARR GWVQQYHIGA
LRNNNLRQFK LLGPDVGFDS INDRPMAEEL SKLLSKQNEE NLLPKTILYC LNPRDNEVLG
TMSGNFQGEG MPGKMQFGSG WWFNDQKDGM ERQMTQLAQL GLLSRFVGML TDSRSFLSYT
RHEYFRRILC QMIGRWVEAG EAPADINLLG EMVKNICFNN ARDYFAIELN