Gene EcSMS35_3383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3383 
SymboluxaA 
ID6144124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3466541 
End bp3468028 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content54% 
IMG OID641618212 
Productaltronate dehydratase 
Protein accessionYP_001745361 
Protein GI170681891 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2721] Altronate dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATACA TCAAGATCCA TGCGCTGGAT AACGTCGCGG TTGCTTTAGC GGATTTGGCT 
GAAGGCACAG AAGTCAGTGT CGATAACCAG ACTGTTACGC TGCGCCAGGA TGTTGCTCGT
GGACATAAAT TTGCGTTAAC GGATATCGCA AAAGGGGCCA ACGTCATTAA ATATGGCCTG
CCGATTGGTT ATGCATTGGC GGATATTGCT GCGGGGGAAC ACGTTCACGC CCACAATACG
CGCACGAATC TGAGCGATCT GGATCAGTAT CGCTATCAAC CTGATTTTCA GGATCTGCCT
GCGCAAGCGG CAGATCGTGA AGTGCAGATC TATCGTCGCG CGAACGGCGA TGTCGGGGTG
CGTAATGAGC TGTGGATCCT GCCAACCGTG GGTTGTGTTA ACGGCATCGC GCGGCAGATC
CAGAACCGTT TCCTGAAAGA GACCAACAAC GCTGAAGGTA CCGACGGCGT GTTCCTCTTC
AGCCACACCT ACGGCTGCTC ACAACTGGGC GACGATCACA TCAATACTCG CACCATGCTG
CAAAACATGG TGCGCCACCC GAACGCGGGC GCAGTGCTGG TGATTGGTCT GGGCTGTGAA
AACAACCAGG TTGCCGCATT CCGTGAAACG TTGGGCGATA TCGATCCTGA ACGCGTTCAT
TTCATGATCT GCCAACAGCA GGATGATGAA ATCGAGGCCG GGATCGCGCA TTTGCATCAG
CTGTATAACG TGATGCGCAA CGATAAACGC GAGCCAGGCA AACTCAGCGA ACTGAAGTTT
GGTCTGGAGT GCGGTGGTTC TGACGGTCTT TCTGGTATTA CTGCTAACCC GATGCTGGGG
CGTTTCTCTG ACTACGTGAT TGCTAACGGT GGTACTACCG TACTGACCGA AGTGCCGGAG
ATGTTTGGCG CAGAGCAGTT GCTGATGGAC CATTGCCGCG ACGAAGCAAC GTTTGAAAAA
CTGGTCACCA TGGTCAACGA CTTCAAACAG TACTTTATTG CCCATGACCA GCCGATCTAC
GAAAACCCAT CACCGGGGAA CAAAGCGGGC GGTATCACCA CGCTGGAAGA CAAATCACTT
GGCTGTACCC AGAAAGCGGG TTCCAGTGTC GTGGTTGACG TGCTGCGTTA CGGCGAGCGT
CTGAAAACGC CGGGGCTGAA CTTGTTAAGT GCGCCGGGTA ACGATGCCGT AGCGACCAGC
GCCCTGGCGG GTGCGGGCTG CCATATGGTG CTGTTCAGTA CTGGTCGTGG TACGCCGTAT
GGTGGATTTG TGCCGACGGT GAAAATCGCC ACCAACAGTG AACTGGCGGC GAAGAAAAAA
CACTGGATCG ACTTTGACGC GGGTCAGCTG ATCCACGGTA AAGCGATGCC GCAGTTGCTG
GAAGAATTTA TCGATACCAT CGTTGAGTTT GCCAACGGTA AGCAAACCTG CAACGAGCGT
AACGACTTCC GTGAACTGGC GATCTTTAAA AGCGGCGTAA CGCTATAA
 
Protein sequence
MQYIKIHALD NVAVALADLA EGTEVSVDNQ TVTLRQDVAR GHKFALTDIA KGANVIKYGL 
PIGYALADIA AGEHVHAHNT RTNLSDLDQY RYQPDFQDLP AQAADREVQI YRRANGDVGV
RNELWILPTV GCVNGIARQI QNRFLKETNN AEGTDGVFLF SHTYGCSQLG DDHINTRTML
QNMVRHPNAG AVLVIGLGCE NNQVAAFRET LGDIDPERVH FMICQQQDDE IEAGIAHLHQ
LYNVMRNDKR EPGKLSELKF GLECGGSDGL SGITANPMLG RFSDYVIANG GTTVLTEVPE
MFGAEQLLMD HCRDEATFEK LVTMVNDFKQ YFIAHDQPIY ENPSPGNKAG GITTLEDKSL
GCTQKAGSSV VVDVLRYGER LKTPGLNLLS APGNDAVATS ALAGAGCHMV LFSTGRGTPY
GGFVPTVKIA TNSELAAKKK HWIDFDAGQL IHGKAMPQLL EEFIDTIVEF ANGKQTCNER
NDFRELAIFK SGVTL