Gene YpsIP31758_3050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3050 
SymbolddhB 
ID5387088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3434682 
End bp3435857 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content46% 
IMG OID640866056 
ProductCDP-glucose 4,6-dehydratase 
Protein accessionYP_001402010 
Protein GI153949280 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGGTT CTGGCAGCCA ATGGATACGT TGCGTGACAA GATCTATCTG CATGAACTAT 
GGGAAGAAGG CAGGGCACCT TGGAAGGTAT GGGAATAACA AAATGATTAA TAATAGTTTC
TGGCAAGGTA AACGGGTTTT TGTAACAGGC CATACTGGGT TTAAAGGTGG CTGGTTGAGT
TTATGGTTGC AAACCATGGG GGCAACGGTA AAAGGTTACT CTCTGACCGC CCCCACTGTG
CCTAGCCTAT TTGAGACCGC ACGAGTTGCC GACGGGATGC AATCGGAAAT CGGTGATATT
CGTGATCAAA ACAAATTATT AGAATCAATC CGCGAATTCC AACCAGAGAT TGTTTTCCAC
ATGGCTGCTC AGCCACTGGT CCGTCTATCC TATTCCGAGC CTGTTGAAAC CTACTCGACG
AATGTTATGG GTACCGTTTA TTTACTGGAA GCTATTCGCC ATGTTGGTGG CGTCAAAGCG
GTGGTCAATA TCACCAGTGA TAAATGCTAC GATAATAAAG AGTGGATCTG GGGCTATCGC
GAAAATGAAG CGATGGGGGG GTATGATCCT TACTCCAACA GTAAAGGTTG TGCGGAATTA
GTGACGTCAT CCTACCGTAA TTCGTTCTTC AATCCAGCGA ACTATGGCCA GCATGGCACT
GCCGTAGCGA CAGTGCGTGC GGGTAATGTC ATCGGTGGTG GCGATTGGGC ATTGGATCGC
ATCGTTCCAG ATATTCTTCG GGCGTTTGAA CAGTCCCAAC CAGTGATTAT TCGCAACCCA
CATGCCATTC GCCCATGGCA GCATGTGTTG GAGCCTTTGT CGGGTTATTT GCTGTTGGCA
CAGAAGTTAT ATACTGACGG TGCTGAATAT GCCGAAGGTT GGAACTTTGG TCCTAACGAT
GCTGATGCTA CTCCAGTAAA AAACATTGTT GAACAAATGG TGAAATATTG GGGAGAGGGT
GCAAGCTGGC AATTAGATGG CAATGCTCAC CCTCATGAAG CTCATTATCT GAAACTGGAT
TGTTCAAAAG CTAAAATGCA ACTTGGCTGG CATCCTCGCT GGAACTTGAA TACTACGCTC
GAATATATTG TGGGCTGGCA CAAGAACTGG TTATCAGGCA CAGATATGCA TGAATACAGT
ATTACTGAAA TTAATAATTA CATGAACACT AAATGA
 
Protein sequence
MPGSGSQWIR CVTRSICMNY GKKAGHLGRY GNNKMINNSF WQGKRVFVTG HTGFKGGWLS 
LWLQTMGATV KGYSLTAPTV PSLFETARVA DGMQSEIGDI RDQNKLLESI REFQPEIVFH
MAAQPLVRLS YSEPVETYST NVMGTVYLLE AIRHVGGVKA VVNITSDKCY DNKEWIWGYR
ENEAMGGYDP YSNSKGCAEL VTSSYRNSFF NPANYGQHGT AVATVRAGNV IGGGDWALDR
IVPDILRAFE QSQPVIIRNP HAIRPWQHVL EPLSGYLLLA QKLYTDGAEY AEGWNFGPND
ADATPVKNIV EQMVKYWGEG ASWQLDGNAH PHEAHYLKLD CSKAKMQLGW HPRWNLNTTL
EYIVGWHKNW LSGTDMHEYS ITEINNYMNT K