Gene Dole_1765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1765 
Symbol 
ID5694604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2126385 
End bp2128004 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content61% 
IMG OID641264362 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_001529646 
Protein GI158521776 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase
[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR00693] thiamine-phosphate pyrophosphorylase
[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTTCCTG AAGATCTGAG AAAACACCTG CGGTTCTATT TTATCACCGA TGACAGCGGT 
GGCCCGGCGC CACTTGAACA GGCAAAAGCG GCCATTCTCG GCGGCGCCAC CATGGTTCAG
TACCGCAACA AGGCCTTTGA CGGCCGGTTT TTTGAAGAGG CCACGGCCAT TTTGCGTCTG
TGCCGGGTCA ATCAAATTCC TTTTATTGTC AATGACGACC CGGTCCTGGC ACGGGCATTG
GGTGCTGACG GCGTTCATGT GGGGCAGGCC GACGGCAGCC TGAAGACGGC ACGAAGCATC
GTGGGAAAAA ACGCGCTGGT GGGGGTGTCG GTCTCCACTC TTGACGAGCT TGCCCGGACC
CCTGTTGAGT TTTGTGATTA TATCGGCACC GGACCGGTGT TTGCCACGAG CACCAAGCCG
GACGCCAGCC CGGTGATCGG GGTGGCGGGG CTCAAGGCGG TCATCGACCG GTCGAAAAAG
CCGGTGGTGG CCATCGGCGG TATCAATGCC GCAAACGCCG CTGCCTGCTT CTCTGCCGGG
GCTGCGGGCG TGGCCGTGAT CAGTTGCGTG AGCCGTGCTG ACAGTCCCCT TGAAGACGCC
CGGTTTCTGG CAGGGGCCTG CGGTATTGAG GTTTTTTCTG AAAAGCTGAA TGTGCCGTGG
AACGATGAGT TCGGCCTGAT CGACAGGCTT CTGGCCGGGG ATAAGAAGGC CAACGCGGCA
GAAGAGGAAA TTTTGAAGGT GGGACCCGGG GATGACGCGG CCGTGCTGCA TGCCCTGAAA
ACACCGGTGA TCACCACCGA CGCCCAGGTG GAAAATGTCC ATTTCTCTTT TTCCTGGCAG
CGGCCCGGGG AGGTGGGGCA AAGGGCCGTG ACCGTGGTGT TAAGCGATCT GGCCGCCGCC
TATGCCCGTC CGGTGTCCCT GTTTGTCAAC CTGACCCTTC CGCACGACAG GCCCGAGTCT
TTGGCCATAG ACCTTTACGC GGGGCTGAAG AAAGGACTTG CCGTCTATGA TTGTGCGCTG
GGCGGTGGCA ATCTATCCGG CGGCCGGGAA GTTTCCCTGA ACCTGTTTGC CGTGGGAGAG
GCAAGGGCGC CTTTTTATCC GGCCCGCGCC AATGCCCGGC CCGGTGACGA TTTGTATTGT
ACCGGCCCCC TGGGCCGATC CAGGGCCGGG CTGCTGGCAT TGGCGGCCGG CCTGGAGGGA
TATGATTCTC TGGTCGAGGC ATTCAAGTTT CCCCGCGCCC GGTTTGACGC GGCTATCGTG
CTGGCGGATT ACAATGTGCG CTGCGTCATG GATATCAGCG ACGGCCTGGC CGGGGATGCC
CGCCATATTG CCAGGGCTTC GGGCATTACA CTCTGTTTTG ACGTGGATAC CGCCGTCTGT
TCCGATGACC TTCAGCGGTT CTGCGAGAAA ACCGGCAACC GGCCCGAAGA GATGATCTTT
TCCGGGGGTG AAGACTATGA GCTGCTGTTT GCCTGCCCAC CGGAAACGGC CCGGCGCATC
GGGGATGTCA TGCCTGTTTA CCGTCTGGGC CGCTGCCTTT CTTTTGATGG TGAATACCTG
CGCAACCTGC CTGAAGGCGT GGCCCCGTTT CAGCATGGCC ATGCCGGTTC CGGAGACTGA
 
Protein sequence
MLPEDLRKHL RFYFITDDSG GPAPLEQAKA AILGGATMVQ YRNKAFDGRF FEEATAILRL 
CRVNQIPFIV NDDPVLARAL GADGVHVGQA DGSLKTARSI VGKNALVGVS VSTLDELART
PVEFCDYIGT GPVFATSTKP DASPVIGVAG LKAVIDRSKK PVVAIGGINA ANAAACFSAG
AAGVAVISCV SRADSPLEDA RFLAGACGIE VFSEKLNVPW NDEFGLIDRL LAGDKKANAA
EEEILKVGPG DDAAVLHALK TPVITTDAQV ENVHFSFSWQ RPGEVGQRAV TVVLSDLAAA
YARPVSLFVN LTLPHDRPES LAIDLYAGLK KGLAVYDCAL GGGNLSGGRE VSLNLFAVGE
ARAPFYPARA NARPGDDLYC TGPLGRSRAG LLALAAGLEG YDSLVEAFKF PRARFDAAIV
LADYNVRCVM DISDGLAGDA RHIARASGIT LCFDVDTAVC SDDLQRFCEK TGNRPEEMIF
SGGEDYELLF ACPPETARRI GDVMPVYRLG RCLSFDGEYL RNLPEGVAPF QHGHAGSGD