Gene Dshi_3868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3868 
Symbol 
ID5714397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp80804 
End bp81883 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content69% 
IMG OID641276781 
ProductUDP-glucose 4-epimerase 
Protein accessionYP_001542077 
Protein GI159046406 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.955537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGA CCATCCTTCT GACCGGCGGC GCGGGCTATA TCGGCTCGCA CACCTACGTG 
GCGCTGAAGG CGGCCGGGTT CGAGGTGGTG ATCCTGGACG ATTTCTCCAA TGCCGCCCGC
GATGTGCCCG ACCGGCTGGA GCTGATCACC GGCGCGCCGG TGCGGCTTTA TGAAGGCTCG
GTTCTGGACC GGGGCCTGCT GGCGCGGCTC TTCACCGAGA CCCGGATCGA CGCGGTGGTC
CATTTCGCCG CGCGCAAGGC GGTGGGCGAA AGCGTGGCGA TGCCGCTGGC GTATTTCGAG
ACCAACTGCA CGGGCCTCGT GGGCCTGTTG CAGGAAATGG AGGCCGCTTG CGTCCACCGG
CTGGTGTTTT CCTCCTCGGC CACGGTCTAC GGCATCCCCG ATGTCACCCC GACGCCCGAG
ACCGCGCCCC ACCGGCACAT GAACCCCTAC GGGCTGACCA AGATCACCGG GGAGCTGATC
CTCGACGCGC TCGCGACGTC GGACCCGAAA TGGGCCTTCG GCACCTTGCG CTATTTCAAC
CCCGCGGGCG CGCACGGCTC GGCGCTGATC GGGGAGGATC CGTCGGACAT CCCCAACAAC
CTGATGCCCT ACATCGCCCA GGTCGCCATG GGCCAGCGCC CCCATCTGCA GGTCTTCGGC
GATGACTATC CGACGCCCGA CGGCACCGGC GTGCGCGACT ACATCCATGT GGAGGATCTG
GCCGAGGGCC ATGTGCTGTC GCTGAAATCC CTGCTGGAGA CCGGCGAGAG CCACCTGGTC
AACCTCGGCA CCGGGCGGGG ATATTCCGTG CTGGAGATGG TCGCGGCCTA CTCGGCGGCC
TGCGGGCGCG CCCTGCCCTA CCGCATCGTG GACCGCCGCC CGGGCGACGT GCCGATCTAT
TGCGCCACGG TGGAGCGTGC CCGCGCGCTG CTGGGGTTCG AGGCGAAACG GGACCTGGCG
CAGATGTGCG CGAGCAGCTG GGCCTGGATC CAGGCCCGGG CGCAGGCCAA TGCCGCCGGG
CGCCCCACGC CCCCCCGCCA GGACGGCCCG TACCGGCCCC GAGGCACACC TTCCGCCTAA
 
Protein sequence
MTQTILLTGG AGYIGSHTYV ALKAAGFEVV ILDDFSNAAR DVPDRLELIT GAPVRLYEGS 
VLDRGLLARL FTETRIDAVV HFAARKAVGE SVAMPLAYFE TNCTGLVGLL QEMEAACVHR
LVFSSSATVY GIPDVTPTPE TAPHRHMNPY GLTKITGELI LDALATSDPK WAFGTLRYFN
PAGAHGSALI GEDPSDIPNN LMPYIAQVAM GQRPHLQVFG DDYPTPDGTG VRDYIHVEDL
AEGHVLSLKS LLETGESHLV NLGTGRGYSV LEMVAAYSAA CGRALPYRIV DRRPGDVPIY
CATVERARAL LGFEAKRDLA QMCASSWAWI QARAQANAAG RPTPPRQDGP YRPRGTPSA