Gene BURPS1710b_3475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3475 
SymbolthiL 
ID3691613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3792706 
End bp3793707 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content74% 
IMG OID637729930 
Productthiamine monophosphate kinase 
Protein accessionYP_334846 
Protein GI76809325 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCCATC CTCCCCTTTC GGAATTCTCG TTGATCGACC GCTTCTTCGC GCGCCGCGCG 
ACGGGGCCGC ACGCGCGCGC CGCGCTCGGC ATCGGCGACG ATTGCGCGCT GCTTGCACCA
GAACCGGGCA AGCTGCTGGC GGTTTCGACG GACATGCTGG TCGAAGGCCG GCACTTCCTC
GCCGATGTCG ATCCGCGCGC GCTCGGCCAC AAGACGCTCG CCGTCAATTT GTCCGATCTC
GCCGCGATGG GCGCCGCGCC GCGCGCGTTC ACGCTCGCGT GCGCGCTGCC GCGCGCCGAC
GCCGACTGGC TCGAGGCGTT TTCCGACGGC CTTTTCGCGC TCGCGGAGCG CCACGGCTGC
GAGCTGATCG GCGGCGACAC GACGAGCGGG CCGCTCAACC TGTGCGTCAC GGTGTTCGGC
GACGTCGCGT GCGGCGCCGC GTTGCGTCGA GACGCCGCAC GCGACGGCGA CGACGTCTGG
GTATCCGGCA CGCTCGGCGA TGCGCGCGCC GGCCTCGGCG TGATCCGCGG CGAATGGCGC
GCGGGCGAGC GCGAGGCGGC GGCGTTCCGG CGCGCGCTCG AATGGCCGCA ACCGCGCGTC
GCGCTCGGCG TCGCGCTCGC GGGCATCGCG CACGCGGCGC TCGACGTGTC CGACGGCCTC
GCGGGCGATC TGCCGCACAT CCTCGAGCGC TCGAACGTGC GCGCCGACGT GGACGTCGAC
GCGGTGCCGC GCTCGGCCGC GCTCGCGACC CTGCCCGCCG ACGTGCAGCG CCGCTGCATG
CTCGAAGGCG GCGACGACTA CGAGCTGTGC TTCACCGCCG CGCCGTCCGC GCGCACCGCG
ATCGACGCGG CCGGCGCACG CGCGGGCGTG GCCGTCACGC GCATCGGTAC AATACGCGGC
TTGTCCGCGC CGACGGACGC GCGCGCCGTG ACGTGGCGCG ACGCGTCGGG CGCGCCGCTT
TCCCTCACGC TGCACGGTTT CGATCATTTC CATGCCAACT GA
 
Protein sequence
MAHPPLSEFS LIDRFFARRA TGPHARAALG IGDDCALLAP EPGKLLAVST DMLVEGRHFL 
ADVDPRALGH KTLAVNLSDL AAMGAAPRAF TLACALPRAD ADWLEAFSDG LFALAERHGC
ELIGGDTTSG PLNLCVTVFG DVACGAALRR DAARDGDDVW VSGTLGDARA GLGVIRGEWR
AGEREAAAFR RALEWPQPRV ALGVALAGIA HAALDVSDGL AGDLPHILER SNVRADVDVD
AVPRSAALAT LPADVQRRCM LEGGDDYELC FTAAPSARTA IDAAGARAGV AVTRIGTIRG
LSAPTDARAV TWRDASGAPL SLTLHGFDHF HAN