Gene SbBS512_E4698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4698 
Symbol 
ID6272819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4389615 
End bp4391162 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content57% 
IMG OID641728463 
Producthypothetical protein 
Protein accessionYP_001882858 
Protein GI187733105 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000112402 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGACC ATACAATGAA GAAAAACCCC GTAAGTATAC CACACACCGT CTGGTACGCC 
GACGATATCC GCCGCGGAGA ACGCGAGGCG GCAGATGTGC TGGGGCTCAC ACTCTATGAG
CTGATGCTTC GCGCTGGCGA GGCCGCATTC CAGGTGTGTC GTTCGGCGTA TCCTGACGCC
CGCCACTGGC TGGTGCTGTG CGGTCATGGT AATAACGGCG GCGATGGCTA CGTGGTCGCG
CGACTGGCCA AAGCGGTCGG CATTGAGGTC ACGTTGTTTG CCCAGGAGAG CGACAAACCG
TTGCCGGAAG AGGCCGCGCT GGCACGCGAA GCATGGTTAA ACGCGGGTGG CGAGATCCAT
GCTTCGAATA TTGTCTGGCC CGAATCGGTA GATCTGATTG TTGATGCGCT GCTCGGTACC
GGTTTGCGGC AAGCGCCCCG CGAATCCATT AGCCAGTTAA TCGACCACGC TAATTCCCAT
CCTGCGCCGA TTGTGGCGGT TGATATCCCT TCCGGCCTGC TGGCTGAAAC TGGCGCTACG
CCAGGCGCGG TGATCAACGC CGATCACACC ATCACTTTTA TTGCGCTGAA ACCAGGCTTG
CTCACTGGAA AAGCGCGGGA TGTTACCGGA CAACTGCATT TTGACTCACT GGGGCTGGAT
AGTTGGCTGG CAGGTCAGGA GACGAAAATT CAGCGGTTTT CAGCAGAACA ACTTTCTCAC
TGGCTAAAAC CGCGTCGCCC GACTTCGCAT AAAGGCGATC ACGGGCGGCT GGTAATTATC
GGTGGCGATC ACGGCACGGC GGGGGCTATT CGTATGACGG GGGAAGCGGC GCTGCGTGCT
GGTGCTGGTT TAGTCCGAGT ACTGACCCGC AGTGAAAACA TTGCGCCGCT GCTGACTGCA
CGACCGGAAT TGATGGTGCA TGAACTGACG ATGGACTCTC TTACCGAAAG CCTGGAATGG
GCCGATGTGG TGGTGATTGG TCCCGGTCTG GGCCAGCAAG AGTGGGGGAA AAAAGCACTG
CAAAAAGTTG AGAATTTTCG CAAACCGATG TTGTGGGATG CCGATGCATT GAACCTGCTG
GCAATCAATC CCGATAAGCG TCACAATCGC GTGATCACGC CGCATCCTGG CGAGGCCGCA
CGGTTGTTAG GCTGTTCCGT CGCTGAAATT GAAAGTGACC GCTTACATTG CGCCAAACGT
CTGGTACAAC GTTATGGCGG CGTAGCGGTG CTGAAAGGTG CCGGAACCGT GGTCGCCGCC
CATCCTGACG CTTTAGGCAT TATTGATGCC GGAAATGCAG GCATGGCGAG CTGCGGCATG
GGCGATGTGC TCTCTGGTAT TATTGGCGCA TTGCTTGGGC AAAAACTGTC GCCGTATGAT
GCCGCCTGTG CGGGCTGTGT CGCGCACGGT GCGGCAGCTG ACGTACTGGC GGCGCGTTTT
GGAACGCGCG GGATGCTGGC AACCGATCTC TTTTCCACGC TACAGCGTAT TGTTAACCCG
GAAGTGACTG ATAAAAACCA TGATGAATCG AGTAATTCCG CTCCCTGA
 
Protein sequence
MTDHTMKKNP VSIPHTVWYA DDIRRGEREA ADVLGLTLYE LMLRAGEAAF QVCRSAYPDA 
RHWLVLCGHG NNGGDGYVVA RLAKAVGIEV TLFAQESDKP LPEEAALARE AWLNAGGEIH
ASNIVWPESV DLIVDALLGT GLRQAPRESI SQLIDHANSH PAPIVAVDIP SGLLAETGAT
PGAVINADHT ITFIALKPGL LTGKARDVTG QLHFDSLGLD SWLAGQETKI QRFSAEQLSH
WLKPRRPTSH KGDHGRLVII GGDHGTAGAI RMTGEAALRA GAGLVRVLTR SENIAPLLTA
RPELMVHELT MDSLTESLEW ADVVVIGPGL GQQEWGKKAL QKVENFRKPM LWDADALNLL
AINPDKRHNR VITPHPGEAA RLLGCSVAEI ESDRLHCAKR LVQRYGGVAV LKGAGTVVAA
HPDALGIIDA GNAGMASCGM GDVLSGIIGA LLGQKLSPYD AACAGCVAHG AAADVLAARF
GTRGMLATDL FSTLQRIVNP EVTDKNHDES SNSAP