Gene EcSMS35_3384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3384 
SymboluxaC 
ID6146085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3468043 
End bp3469455 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content54% 
IMG OID641618213 
Productglucuronate isomerase 
Protein accessionYP_001745362 
Protein GI170683343 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1904] Glucuronate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.666142 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCCGT TTATGACTGA AGATTTCCTG TTAGATACCG AATTTGCCCG CCGTCTGTAT 
CACGACTACG CAAAAGACCA GCCGATTTTC GATTACCATT GCCATTTGCC GCCGCAGCAG
ATTGCGGAAG ACTATCGTTT TAAAAACCTG TATGACATCT GGTTGAAGGG CGATCACTAC
AAATGGCGCG CTATGCGTAC CAACGGTGTG GCCGAGCGTC TGTGCACCGG TGATGCGTCT
GACCGTGAAA AATTTGACGC CTGGGCGGCG ACTGTTCCGC ATACTATCGG CAACCCGTTA
TACCACTGGA CGCACCTCGA ACTGCGTCGT CCGTTTGGTA TCACTGGCAA ATTGCTTTCT
CCGTCAACTG CCGATGAAAT CTGGAACGAA TGTAACGAAT TGCTGGCGCA GGATAATTTC
TCTGCACGCG GCATCATGCA GCAGATGAAC GTGAAAATGG TCGGCACCAC CGATGACCCG
ATCGATTCTC TGGAGCATCA CGCAGAGATC GCCAAAGATG GCTCTTTCAC CATTAAAGTG
CTGCCGAGCT GGCGTCCGGA CAAAGCCTTT AACATTGAAC AGGCGACCTT TAACGACTAC
ATGGCGAAGC TGGGCGAAGT TTCCGATACC GACATTCGCC GCTTTGCTGA CCTGCAAACA
GCCCTGACTA AACGTCTGGA TCACTTCGCC GCTCACGGCT GTAAAGTGTC TGACCACGCG
CTGGATGTAG TGATGTTTGC TGAAGCGAAC GAAGCGGAAC TGGACAGCAT CCTCGCGCGC
CGTCTGGCAG GCGAAACCCT GAGCGAGCAC GAAGTGGCAC AGTTCAAAAC TGCGGTGCTG
GTGTTCCTCG GCGCTGAATA CGCACGTCGC GGCTGGGTAC AGCAGTACCA TATTGGCGCA
CTGCGTAATA ACAACCTGCG TCAGTTTAAA CTGCTGGGGC CGGATGTCGG CTTTGACTCC
ATCAACGACC GTCCGATGGC GGAAGAGCTG TCTAAGCTGC TGAGCAAGCA GAACGAAGAA
AACCTGCTGC CGAAAACCAT TCTCTACTGC CTGAACCCGC GCGATAACGA AGTGCTGGGC
ACCATGATCG GTAACTTCCA GGGCGAAGGT ATGCCGGGCA AAATGCAGTT CGGTTCCGGC
TGGTGGTTTA ACGACCAGAA AGACGGTATG GAACGTCAGA TGACCCAACT GGCGCAGCTC
GGTCTGCTGA GCCGCTTTGT CGGTATGCTG ACTGACAGCC GTAGCTTCCT GTCATACACC
CGTCACGAAT ACTTCCGCCG CATTCTGTGC CAGATGATCG GTCGCTGGGT GGAAGCAGGC
GAAGCACCGG CGGACATCAA CCTGCTGGGC GAGATGGTGA AAAATATTTG CTTTAACAAT
GCGCGTGACT ACTTCGCCAT TGAACTGAAC TAA
 
Protein sequence
MTPFMTEDFL LDTEFARRLY HDYAKDQPIF DYHCHLPPQQ IAEDYRFKNL YDIWLKGDHY 
KWRAMRTNGV AERLCTGDAS DREKFDAWAA TVPHTIGNPL YHWTHLELRR PFGITGKLLS
PSTADEIWNE CNELLAQDNF SARGIMQQMN VKMVGTTDDP IDSLEHHAEI AKDGSFTIKV
LPSWRPDKAF NIEQATFNDY MAKLGEVSDT DIRRFADLQT ALTKRLDHFA AHGCKVSDHA
LDVVMFAEAN EAELDSILAR RLAGETLSEH EVAQFKTAVL VFLGAEYARR GWVQQYHIGA
LRNNNLRQFK LLGPDVGFDS INDRPMAEEL SKLLSKQNEE NLLPKTILYC LNPRDNEVLG
TMIGNFQGEG MPGKMQFGSG WWFNDQKDGM ERQMTQLAQL GLLSRFVGML TDSRSFLSYT
RHEYFRRILC QMIGRWVEAG EAPADINLLG EMVKNICFNN ARDYFAIELN