Gene SeD_A4442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4442 
SymbolrhaB 
ID6875041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4284260 
End bp4285729 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content55% 
IMG OID642787360 
Productrhamnulokinase 
Protein accessionYP_002217971 
Protein GI198245223 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID[TIGR02627] rhamnulokinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTTC GCCATTGTGT CGCGGTTGAT CTCGGCGCAT CCAGCGGGCG CGTGATGCTG 
GCGCGTTACG ACAGCAAACA CCGTACCCTG ACGCTACGTG AAATTCACCG TTTTGTGAAC
TGCCTACAAA AAACAGACGG GTTTGACACC TGGGACATCG ACAGTCTGGA AAAGGATATC
CGTCTTGGTC TGAAGAAAGT CTGCAATGAA GGCATTCTTA TCGACAGCAT CAGCATCGAC
ACCTGGGGCG TCGATTATGT CCTGCTGGAT AAACAAGGTC AGCGTGTCGG CCTGCCGGTC
TCCTACCGCG ATAACCGTAC CACGGGCATC ATGCCGCAAG CGCTGGTCCA GATCGGCAAA
AGCGAAATCT ATCGCCGCAG CGGGATTCAG TTTCTGCCGT TTAACACCAT CTATCAGCTA
CACGCGCTGA CGAAACAACA GCCTGAGTTA ACGGCGCAGG TCGCTCATGC GCTGCTGATG
CCCGATTATT TCAGTTACCG CCTGACCGGT GAAATGAACT GGGAATACAC CAACGCCACG
ACCACCCAGT TGGTCAATAT CAATACCGAT GACTGGGACG ATACCCTGCT GGCGTGGACT
GGCGCGAAAA AGAGCTGGTT CGGTCGCCCC TCGCACCCTG GCAATGTTAT CGGCGACTGG
ATTTGCCCGC AGGGCAACCG TATTCCGGTG GTAGCCGTCG CCAGCCACGA TACCGCCAGC
GCCGTTATTG CCTCTCCGCT GGCAAATAAA CATAGCGCTT ACCTGTCTTC AGGCACCTGG
TCATTGATGG GTTTTGAAAG CAAAAAACCC TACACCACTG ACGAGGCGCT GGCCGCCAAT
ATCACCAACG AAGGCGGCGC GGAAGGGCGT TATCGGGTAC TGAAAAACAT TATGGGTTTG
TGGCTGCTCC AGCGTGTGCT GAAAGAACGG CGCATTACCG ATCTGCCTGC GCTTATCGCC
CAAACAGAAG CGTTGCCGGC CTGCCGTTTC CTGATTAACC CGAATGACGA TCGCTTTATC
AATCCTGACG ATATGCGCGC TGAAATCCAG GCCGCCTGTC GCGAGACCGA CCAGCCCGTT
CCCGTCAGCG ATGCCGAACT GGCGCGCTGC ATTTTCGACA GTCTGGCGCT GTTGTATGCC
GACATTCTGC ACGAACTGGC AAATCTGCGC GGCGAAAAAT TTACCCAACT GCATATTGTC
GGCGGCGGAT GCCAAAACGC GCTACTCAAC CAGTTGTGCG CCGATGCATG TGGCATTCGC
GTGATGGCCG GGCCAGTTGA AGCCTCCACC CTTGGCAATA TTGGTATTCA GCTTATGACC
CTCGACGAAT TAAACAACGT CGATGACTTC CGTCAGGTCG TTAGCGCTAA CTACGACCTG
ACAACCTATA TCCCTAATCC TGATAGTGAA ATTGCCCGCC ACGTTGCGCA GTTTCAACCC
AAACGACAGA CAAAGGAGCT TTGCGCATGA
 
Protein sequence
MTFRHCVAVD LGASSGRVML ARYDSKHRTL TLREIHRFVN CLQKTDGFDT WDIDSLEKDI 
RLGLKKVCNE GILIDSISID TWGVDYVLLD KQGQRVGLPV SYRDNRTTGI MPQALVQIGK
SEIYRRSGIQ FLPFNTIYQL HALTKQQPEL TAQVAHALLM PDYFSYRLTG EMNWEYTNAT
TTQLVNINTD DWDDTLLAWT GAKKSWFGRP SHPGNVIGDW ICPQGNRIPV VAVASHDTAS
AVIASPLANK HSAYLSSGTW SLMGFESKKP YTTDEALAAN ITNEGGAEGR YRVLKNIMGL
WLLQRVLKER RITDLPALIA QTEALPACRF LINPNDDRFI NPDDMRAEIQ AACRETDQPV
PVSDAELARC IFDSLALLYA DILHELANLR GEKFTQLHIV GGGCQNALLN QLCADACGIR
VMAGPVEAST LGNIGIQLMT LDELNNVDDF RQVVSANYDL TTYIPNPDSE IARHVAQFQP
KRQTKELCA