Gene Dole_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1960 
Symbol 
ID5694800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2371095 
End bp2372183 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content58% 
IMG OID641264558 
ProductCDP-glucose 4,6-dehydratase 
Protein accessionYP_001529841 
Protein GI158521971 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACC TTTTTCAGAA CGCGTATCAC CGCCGGAGAG TGCTGGTCAC CGGGCATACC 
GGCTTTAAAG GGTCATGGCT CTCCTTCTGG CTTTCGCAAA TGGGCGCCGA TGTGTACGGC
TACTCCCTGG CGCCCGAGAC CCGGCCCAAT CACTTCTCTC TGCTCAACCC GGGTGACGAA
ACCCCTGAAA CCGACATCCG GGATATCCGA CAAGTAATTG ACTGCTTTCA GTCCTTTCAG
CCGGAAATCG TTTTTCACCT GGCCGCTCAG TCCCTGGTGC GCCGGTCCTA CCGTGAACCC
CTCGACACCT TTGCCGCCAA TGTCATGGGC ACGGCCAACA TACTCGAGGC CTGCCGGCTG
ACAAAATCCG TGCGGGCCGT GGTGATCGTG ACCAGTGACA AGTGCTACCA GAACAATGAA
TGGGAATGGG GATACCGGGA GAGCGACCCC ATGGGCGGCC ATGACCCTTA CAGCGCCTCC
AAGGGGTGCG CGGAACTTGT CACCGCCGCT TTCCGGAATT CTTTTTTTTC TACAGGCACC
GGCCATCCGG CCCTGATGGC CACGGCCCGG GCCGGTAATG TGATCGGCGG CGGCGACTGG
GCCGAAGACC GCCTGATTCC GGACGTGGCC CGTGCTTTCA ACAAAAAAGA AACCATGAAA
ATCCGTAACC CCCATGGACT CCGTCCCTGG CAGCATGTGC TGGAGCCGCT TTCCGGATAC
CTGATGCTGG GACAACGCCT GATTGAAGGA GACCGGGGAC TTGCCGATGC CTGGAATTTT
GGGCCGTCGG AAGAAGACAC GCTTCCGGTA ATAACGCTTC TGAAACGGTT AAAAACTCAC
TGGTCCGACC TGGACTTTGA TGTGGACCAA CAGCCGGACC AGCCCCACGA GGCCGGTCTG
CTCCGGCTGG ACTCTTCTAA AGCCAGGCGG AAACTTGGCT GGCAACCGGT CTGGAACTGT
GACCAGGCCC TTGAAAGGAC CGCAGCCTGG TACCAGGCGT TTTACAACCA GGCTACGATT
CTGACCGGCG CGGATCTGGC GGCCTATATC GAGTCGGCCC GTTCAAAAGG ATTGCCATGG
GCGCAATAA
 
Protein sequence
MKNLFQNAYH RRRVLVTGHT GFKGSWLSFW LSQMGADVYG YSLAPETRPN HFSLLNPGDE 
TPETDIRDIR QVIDCFQSFQ PEIVFHLAAQ SLVRRSYREP LDTFAANVMG TANILEACRL
TKSVRAVVIV TSDKCYQNNE WEWGYRESDP MGGHDPYSAS KGCAELVTAA FRNSFFSTGT
GHPALMATAR AGNVIGGGDW AEDRLIPDVA RAFNKKETMK IRNPHGLRPW QHVLEPLSGY
LMLGQRLIEG DRGLADAWNF GPSEEDTLPV ITLLKRLKTH WSDLDFDVDQ QPDQPHEAGL
LRLDSSKARR KLGWQPVWNC DQALERTAAW YQAFYNQATI LTGADLAAYI ESARSKGLPW
AQ