Gene Dole_2193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2193 
Symbol 
ID5695039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2666832 
End bp2667842 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content61% 
IMG OID641264797 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001530074 
Protein GI158522204 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGTT CAAAACAGAT GGCCCAGCCC GTGCTGGTGA CCGGGGCCAC CGGCTTTATC 
GGCAGCCAGG TGGTTCACAA GCTGCTGGAG CAGGATATGG CGGTCAAGGC ACTGGTGCTG
CCGGACGAAG CCCTGCCCGC TGCCTGGGGC GACCGGGTGG AAGTGGTACG GGGCGGCATC
TCGGAGTCCG GGGCTGTGGC AAAGGCCGTT TCCGGAGCCG GGACCATCAT TCATCTGGCC
GCGGTGGTGT CGGACTGGGG GGATGAAAAA AAATACTGGG AGTTTACCGT GGAGGGCAGC
CGCCTGGTGT TTGAACAGGC CGCAAAAACC GGAGCCCGGG TGGTGCTGGT CTCCAGTGTG
GTGGTGTACG GCGACAATGT CCGCAAGCAA GTGTGTCACG AAGATGTGGG TTACGGAAAA
ACCTTTGGCC CTTACAGCCG CACCAAGCAG GCCCAGGAAA AGCTGGCATG GGAGTACCAC
AGGAAAAAGA ACCTGGCCCT GACCGTGGTG CGGCCCGGCA ATGTCTACGG ACCGCGTTCC
GGCCCCTGGC TTCATGACGT GGTCAATGTT TTACGCAGCG GCGCGCCGGG TCTTATCTCC
GGCGGCAACA TGAACGCCGG CCTTGCCTAC GTGGACAACG TGGCCGACCT GTTCCTCCTG
GCCGGGGCCA GTGACACGGC CCTGGGCCGG GCCTACAACG CCGCCGACGG AACTAAAGTC
ACCTGGCGCC GTTATTTTGA GGACATCGCC GCCATGATCG GCGCGAAAAA ACCGGGATCC
GTACCCCGGC CGGCGGCGGC CCTGAGCGCC TTTGTATTTG AAAAAACATG GAAGCTCTTC
GGCATTCAGA AACGGCCGCC CGTGACCCGG GACGCTCTGA ACCTGGTGGG ATCGGACAAC
CGCTTTCCCA TTGACCGGGC CAGGAAAGAA CTGGGCTATG CGCCAAAGGT CTCTTATGAA
GAGGGGCTGA AGCGGATTCG GGAGTATATT GATAAGGAAA GTATCCGATA A
 
Protein sequence
MNSSKQMAQP VLVTGATGFI GSQVVHKLLE QDMAVKALVL PDEALPAAWG DRVEVVRGGI 
SESGAVAKAV SGAGTIIHLA AVVSDWGDEK KYWEFTVEGS RLVFEQAAKT GARVVLVSSV
VVYGDNVRKQ VCHEDVGYGK TFGPYSRTKQ AQEKLAWEYH RKKNLALTVV RPGNVYGPRS
GPWLHDVVNV LRSGAPGLIS GGNMNAGLAY VDNVADLFLL AGASDTALGR AYNAADGTKV
TWRRYFEDIA AMIGAKKPGS VPRPAAALSA FVFEKTWKLF GIQKRPPVTR DALNLVGSDN
RFPIDRARKE LGYAPKVSYE EGLKRIREYI DKESIR