Gene EcSMS35_3903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3903 
Symbollyx 
ID6143850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3973155 
End bp3974651 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content57% 
IMG OID641618729 
Productcryptic L-xylulose kinase 
Protein accessionYP_001745868 
Protein GI170679902 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.783257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAT ACTGGCTGGG GTTAGATTGT GGCGGTAGCT GGCTGAAAGC CGGGCTGTAT 
GACCGCGAAG GCCGGGAGGC AGGCGTGCAG CGCCTGCCGC TGTGCGCATT AAGCCCGCAG
CCAGGCTGGG CAGAGCGCGA TATGGCAGAA CTGTGGCAAT GCTGCATGGC TGTCATTCGC
ACCCTGCTTA CTCATTCTGG TGTCAGCGGG GAGCAAATAG TGGGTATCGG TATCTCCGCA
CAGGGAAAGG GCTTGTTTTT GCTGGATAAA AACGACAAGC CGCTCGGGAA TGCTATTTTG
TCCTCGGACC GCCGGGCGAT GGAAATCGTT CGTCGCTGGC AGGAAGATGG CATCCCGGAA
AAACTCTACC CGCTGACCCG ACAAACCTTG TGGACCGGGC ATCCGGTGTC GCTGTTACGC
TGGCTGAAAG AGCACGAACC GGAACGCTAC GCGCAAATTG GCTGCGTAAT GATGACGCAC
GACTATCTGC GCTGGTGTTT AACCGGCGTC AAAGGTTGCG AAGAGAGCAA TATTTCCGAG
TCCAACCTCT ACAACATGAG TCTTGAGGAA TATGACCCGT GCCTCACCGA CTGGCTGGGG
ATCGCTGAAA TCAACCACGC CCTGCCGCCT GTTGTCGGAT CTGCCGAAAT TTGCGGGGAA
ATCACCGCTC AGACAGCCGC ACTGACCGGT CTGAAAGCGG GTACGCCCGT TGTCGGCGGC
CTGTTTGATG TGGTTTCCAC CGCACTCTGT GCCGGGATCG AAGACGAATT TACCCTCAAT
GCGGTGATGG GCACCTGGGC GGTGACCAGC GGCATAACCC GTGGTTTACG TGACGGTGAA
GCGCATCCGT ATGTCTATGG TCGCTACGTT AACGATGGTC AATTTATCGT TCACGAAGCC
AGCCCCACCT CTTCCGGCAA CCTCGAATGG TTTACCGCAC AGTGGGGAGA AATCTCGTTT
GATGAGATTA ACCAGGCCGT TGCCAGCTTG CCGAAGGCCG GGGGCGATCT CTTTTTCCTG
CCGTTCCTGT ACGGCAGCAA CGCCGGATTG GAGATGACCA GCGGTTTCTA CGGGATGCAG
GCCATTCATA CCCGCGCACA CCTGTTGCAG GCCATTTATG AAGGCGTGGT GTTCAGCCAT
ATGACCCATC TCAACCGGAT GCGCGAACGT TTTACTGATG TGCACACCCT GCGCGTCACT
GGCGGCCCGG CGCATTCCGA TGTCTGGATG CAAATGCTGG CGGACGTCAG CGGTCTGCGT
ATCGAGCTGC CGCAGGTGGA AGAAACCGGC TGCTTTGGTG CGGCACTTGC CGCCCGCGTC
GGCACCGGAG TTTATCGCGA TTTCAGCGAA GCCCAACGTG ATTTACAGCA CCCGGTACGC
ACCCTGCTGC CGGATATGAC CGCACATCAG CTTTACCAAC AAAAATACCA ACGCTATCAG
CATCTCATTG CCGCACTTGA GGGCTTTCAC GCCCGCATCA AGGAGCACAC ATTATGA
 
Protein sequence
MTQYWLGLDC GGSWLKAGLY DREGREAGVQ RLPLCALSPQ PGWAERDMAE LWQCCMAVIR 
TLLTHSGVSG EQIVGIGISA QGKGLFLLDK NDKPLGNAIL SSDRRAMEIV RRWQEDGIPE
KLYPLTRQTL WTGHPVSLLR WLKEHEPERY AQIGCVMMTH DYLRWCLTGV KGCEESNISE
SNLYNMSLEE YDPCLTDWLG IAEINHALPP VVGSAEICGE ITAQTAALTG LKAGTPVVGG
LFDVVSTALC AGIEDEFTLN AVMGTWAVTS GITRGLRDGE AHPYVYGRYV NDGQFIVHEA
SPTSSGNLEW FTAQWGEISF DEINQAVASL PKAGGDLFFL PFLYGSNAGL EMTSGFYGMQ
AIHTRAHLLQ AIYEGVVFSH MTHLNRMRER FTDVHTLRVT GGPAHSDVWM QMLADVSGLR
IELPQVEETG CFGAALAARV GTGVYRDFSE AQRDLQHPVR TLLPDMTAHQ LYQQKYQRYQ
HLIAALEGFH ARIKEHTL