Gene EcHS_A3784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3784 
Symbollyx 
ID5595246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3776387 
End bp3777883 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content57% 
IMG OID640922898 
Productcryptic L-xylulose kinase 
Protein accessionYP_001460376 
Protein GI157163058 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAAT ACTGGCTGGG GTTAGATTGT GGCGGTAGCT GGCTGAAAGC CGGGCTGTAT 
GACCGCGAAG GCCGGGAGGC AGGCGTGCAG CGCCTGCCGC TGTGCGCATT AAGCCCGCAG
CCAGGCTGGG CAGAGCGCGA TATGGCAGAA CTGTGGCAAT GCTGCATGGC TGTCATTCGC
GCCCTGCTTA CTCATTCTGG TGTTAGCGGG GAACAAATTG TCGGTATCGG CATCTCCGCA
CAGGGAAAGG GCTTGTTTTT GCTGGATAAA AACGACAAAC CGCTCGGGAA TGCTATTTTG
TCCTCGGACC GCCGGGCGAT GGAAATCGTT CGTCGCTGGC AGGAAGATGG CATCCCGGAA
AAACTCTACC CGCTGACCCG ACAAACCTTG TGGACCGGGC ATCCGGTGTC GCTGTTACGC
TGGCTGAAAG AGCACGAACC AGAACGCTAC GCGCAAATTG GCTGCGTGAT GATGACGCAC
GACTACCTGC GCTGGTGTTT AACTGGCGTC AAAGGCTGTG AAGAGAGCAA TATTTCCGAG
TCCAACCTCT ACAACATGAG TCTTGGGGAA TATGACCCGT GCCTCACCGA CTGGCTGGGG
ATCGCTGAAA TCAATCACGC CCTGCCGCCT GTTGTCGGAT CTGCCGAAAT CTGCGGGGAG
ATCACCGCTC AGACAGCCGC CCTGACCGGT CTGAAAGCGG GTACGCCCGT TGTTGGCGGC
CTGTTTGATG TGGTTTCCAC CGCACTCTGC GCCGGGATCG AAGACGAATT TACCCTCAAT
GCGGTGATGG GGACCTGGGC GGTGACCAGC GGCATAACCC GCGGTTTACG TGACGGTGAA
GCGCATCCGT ATGTCTATGG TCGCTACGTT AACGATGGTG AATTTATCGT TCACGAAGCC
AGCCCTACCT CTTCCGGCAA CCTCGAATGG TTTACCGCAC AGTGGGGAGA AATCTCGTTT
GATGAGATCA ATCAGGCCGT TGCCAGCTTG CCGAAGGCTG GGGGCGATCT CTTTTTCCTG
CCGTTCCTGT ACGGCAGCAA CGCCGGACTC GAGATGACCA GTGGTTTCTA CGGGATGCAG
GCCATTCACA CCCGCGCGCA CCTGTTGCAG GCCATCTATG AAGGGGTGGT GTTCAGCCAT
ATGACCCACC TCAACCGAAT GCGCGAACGT TTTACTGATG TTCACACCCT ACGCGTCACT
GGCGGCCCGG CGCACTCCGA TGTCTGGATG CAAATGCTGG CGGACGTCAG CGGTCTGCGT
ATCGAGCTGC CGCAGGTGGA AGAAACCGGC TGCTTTGGTG CGGCCCTTGC CGCCCGCGTC
GGCACCGGGG TTTATCACAA CTTCAGCGAA GCCCAACGTG ACTTGCGACA CCCGGTGCGC
ACCCTGCTGC CAGATATGAC CGCCCATCAG CTTTACCAAA AAAAATATCA ACGTTATCAG
CATCTCATTG CCGCACTTCA GGGCTTTCAC GCCCGCATTA AGGAGCACAC ATTATGA
 
Protein sequence
MTQYWLGLDC GGSWLKAGLY DREGREAGVQ RLPLCALSPQ PGWAERDMAE LWQCCMAVIR 
ALLTHSGVSG EQIVGIGISA QGKGLFLLDK NDKPLGNAIL SSDRRAMEIV RRWQEDGIPE
KLYPLTRQTL WTGHPVSLLR WLKEHEPERY AQIGCVMMTH DYLRWCLTGV KGCEESNISE
SNLYNMSLGE YDPCLTDWLG IAEINHALPP VVGSAEICGE ITAQTAALTG LKAGTPVVGG
LFDVVSTALC AGIEDEFTLN AVMGTWAVTS GITRGLRDGE AHPYVYGRYV NDGEFIVHEA
SPTSSGNLEW FTAQWGEISF DEINQAVASL PKAGGDLFFL PFLYGSNAGL EMTSGFYGMQ
AIHTRAHLLQ AIYEGVVFSH MTHLNRMRER FTDVHTLRVT GGPAHSDVWM QMLADVSGLR
IELPQVEETG CFGAALAARV GTGVYHNFSE AQRDLRHPVR TLLPDMTAHQ LYQKKYQRYQ
HLIAALQGFH ARIKEHTL