Gene Dole_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1010 
Symbol 
ID5693845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1188981 
End bp1190030 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content57% 
IMG OID641263607 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_001528897 
Protein GI158521027 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.626547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAAAA GTGATTTGCA GAAGAAAACG ATTCTTGTCA CCGGCGGCGC CGGTTTTATC 
GGTACCAATT TTATCTATCA CGCCCTGGAT GCAAGTTTCC AGTGTCGGAT TGTTAACCTG
GATGCGCTTT TGTGTGGCGG CAATGCCTCC AATCTTGACC GGCTGCCGGA CCCTGCCAGG
TCGCGATACC GGTTTGTGCG CGGTAAGGTT CAGGACGGTG CCCTGCTGGA CCGGCTTTTT
GCCGAAGAAC AGTTTGCCGG TGTGTTTCAT TTCGCGGCCC AGACCCACGT GGACCGCTCC
ATCACCGATC CCGGGGATTT TGTTGAATCC AATGTGGTGG GGACTTTTCG CCTGCTGGAT
ACATGTCTCA AGTACTGGCG CCGGGGTGCT CTGGATCCTG ACTTTCGCAT GGTTCATGTC
TCCACGGACG AGGTGTATGG CAGCCTGGGC AGTGAAGGGC GCTTCTCCGA AACCAGCCCT
TACGATCCCT CCAGCCCCTA CTCTGCGTCA AAGGCGGGTT CGGACCATCT GGTCAAATCC
TATGTGCGGA CCTATGGATT GCCGGCCATG GTGACCAACT GCTCCAACAA TTTTGGCCCC
TACCAGTATC CTGAAAAACT GATTCCCCTG ATGATTGCCA GTATTCTAAA CGAAGAACCG
CTCCCGGTTT ACGGGGATGG CAAAAATGTC CGGGACTGGC TCTACGTGCT GGACCACTGC
GAGGCCCTGA TGCGGGTGTT TGAGGCCGGC CGGCCGGGGG AGAGTTACAA CATCGGCGGA
GGACAGGAGT ATGAGAACAT CGAACTGGTA CACATGCTTT GCGACCTGGT GGACACCCGG
CTGGGACGTC CCGAGGCGCA AAGCCGTCGT CTGGTCCGGT TTGTCACCGA CCGGCCGGGC
CATGACCGGC GGTACGCCAT TGACGCATCA AAAATCAAAC ACGCCCTGGA CTGGAGCCCC
CGGCACGATT TTACCCGGGC CCTGGACCAG ACCGTGACAT GGTACCTGAG CAACCGGCAG
TGGCTGACAG GCGGCAAGCA GGATCAATAA
 
Protein sequence
MVKSDLQKKT ILVTGGAGFI GTNFIYHALD ASFQCRIVNL DALLCGGNAS NLDRLPDPAR 
SRYRFVRGKV QDGALLDRLF AEEQFAGVFH FAAQTHVDRS ITDPGDFVES NVVGTFRLLD
TCLKYWRRGA LDPDFRMVHV STDEVYGSLG SEGRFSETSP YDPSSPYSAS KAGSDHLVKS
YVRTYGLPAM VTNCSNNFGP YQYPEKLIPL MIASILNEEP LPVYGDGKNV RDWLYVLDHC
EALMRVFEAG RPGESYNIGG GQEYENIELV HMLCDLVDTR LGRPEAQSRR LVRFVTDRPG
HDRRYAIDAS KIKHALDWSP RHDFTRALDQ TVTWYLSNRQ WLTGGKQDQ