Gene SeHA_C3997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3997 
Symbol 
ID6492343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3874054 
End bp3875550 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content61% 
IMG OID642744098 
ProductL-xylulose/3-keto-L-gulonate kinase 
Protein accessionYP_002047703 
Protein GI194449029 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value0.970359 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATT ACTGGCTGGG GTTAGATTGT GGTGGGAGTT GGCTAAAAGC CGGGTTGTAC 
GATGGCGCAG GCCGGGAAGT AGCGGTGCAA CGCCTGCCGC TGCACGCTTT AAGCCCGCAG
CCAGGCTGGG TTGAACGCGA TATGACCGAA CTGTGGCAAC AGTGCGGCTC GGTCATCAGC
AAACTGCTGG CGCACACGGG GGTGAGCGGC TCACAAATCC GCGGTCTGGG TATTTCCGCT
CAGGGTAAGG GCCTGTTCCT GTTAGATAAA AGCGATCGGC CATTAGGTAA AGCGATACTC
TCTTCCGACC GTCGCGCCAT GGAAATTGTC CAGCGCTGGC AAAAAGAAGC GGTTCCGCAA
AAACTCTACC CGCTGACTCG GCAAACCCTG TGGACCGGGC ATCCGGTCTC CCTTTTACGC
TGGGTAAAAG AGAATGAGCC GCAGCGCTAC GCGCAGATAG GCTGCGTCAT GATGACGCAT
GACTATCTGC GCTGGTGCTT AACCGGCGTG AAAGGCTGTG AGGAGAGCAA CATCTCCGAG
TCCAACCTCT ACAACATGGC GACGGGCCAG TACGACCCGC TTCTGACCGA GTGGCTGGGC
ATCAGTGAAA TCGACAGCGC GCTGCCCCCC GTGGTGGGTT CAGCCGAAAT CTGCGGGGAG
ATCACCGCTC AGGCAGCCGC CATCACCGGT CTGGCGGTGG GTACCCCCGT CGTCGGCGGC
CTGTTTGATG TGGTTTCCAC CGCCCTTTGC GCCGGTATTG AGGATGAATC AACGCTCAAT
GCGGTGATGG GTACCTGGGC CGTCACCAGC GGCATCGCTC ACGGTCTGCG CGACCATGAG
GCCCATCCTT ACGTCTATGG CCGCTACGTC AATGACGGGC AGTATATCGT TCACGAAGCC
AGCCCGACCT CCTCCGGCAA CCTCGAATGG TTTACCGCCC AGTGGGGCGA CCTCTCTTTT
GACGAGATCA ACCAGGCGGT CGCCAGCCTG CCGAAAGCCG GTAGCGACCT CTTTTTTCTG
CCGTTTCTCT ACGGCAGCAA TGCCGGGCTG GAGATGACCT GCGGCTTTTA CGGCATGCAG
GCGCTGCACA CCCGCGCCCA CCTGCTGCAG GCGATTTATG AAGGCGTGGT GTTCAGCCAT
ATGACCCACC TCAACCGCAT GCGTGAACGC TTTACCGACG TTTGCGCCCT GCGCGTTACC
GGCGGCCCGG CCCACTCCGA CGTCTGGATG CAGATGCTGG CGGACGTCAG CGGTTTACGC
ATCGAGCTGC CGCAGGTGGA GGAGACCGGC TGCTTCGGCG CGGCGCTGGC TGCCCGCGTC
GGCACCGGCG TATATCGCGA TTTCCGCGAG GCCCAACGCG ACCTGCAGCA CCCGGTGCGC
ACGCTGCTGC CGGACATGAC CGCACACGCC CTCTACCAGC GCAAATACCG CCAATACCAG
GATTTGATTG AAGCACTACA GGGCTATCAC GCCCGTATTA AGGAGCACGC ATTATGA
 
Protein sequence
MSNYWLGLDC GGSWLKAGLY DGAGREVAVQ RLPLHALSPQ PGWVERDMTE LWQQCGSVIS 
KLLAHTGVSG SQIRGLGISA QGKGLFLLDK SDRPLGKAIL SSDRRAMEIV QRWQKEAVPQ
KLYPLTRQTL WTGHPVSLLR WVKENEPQRY AQIGCVMMTH DYLRWCLTGV KGCEESNISE
SNLYNMATGQ YDPLLTEWLG ISEIDSALPP VVGSAEICGE ITAQAAAITG LAVGTPVVGG
LFDVVSTALC AGIEDESTLN AVMGTWAVTS GIAHGLRDHE AHPYVYGRYV NDGQYIVHEA
SPTSSGNLEW FTAQWGDLSF DEINQAVASL PKAGSDLFFL PFLYGSNAGL EMTCGFYGMQ
ALHTRAHLLQ AIYEGVVFSH MTHLNRMRER FTDVCALRVT GGPAHSDVWM QMLADVSGLR
IELPQVEETG CFGAALAARV GTGVYRDFRE AQRDLQHPVR TLLPDMTAHA LYQRKYRQYQ
DLIEALQGYH ARIKEHAL