Gene BURPS1106A_3477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3477 
SymbolthiL 
ID4903078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3383489 
End bp3384490 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content74% 
IMG OID640136703 
Productthiamine monophosphate kinase 
Protein accessionYP_001067714 
Protein GI126452891 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.695085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCCATC CTCCCCTTTC GGAATTTTCG TTGATCGACC GCTTCTTCGC GCGCCGCGCG 
ACGGGGCCGC ACGCGCGCGC CGCGCTCGGC ATCGGCGACG ATTGCGCGCT GCTTGCACCA
GAACCGGGCA AGCTGCTGGC GGTTTCGACG GACATGCTGG TCGAAGGCCG GCACTTCCTC
GCCGATGTCG ATCCGCGCGC GCTCGGCCAC AAGACGCTCG CCGTCAATTT GTCCGATCTC
GCCGCGATGG GCGCCGCGCC GCGCGCGTTC ACGCTCGCGT GCGCGCTGCC GCGCGCCGAC
GCCGACTGGC TCGAGGCGTT TTCCGACGGC CTTTTCGCGC TCGCGGAGCG CCACGGCTGC
GAGCTGATCG GCGGCGACAC GACGAGCGGG CCGCTCAACC TGTGCGTCAC GGTGTTCGGC
GACGTCGCGT GCGGCGCCGC GTTGCGTCGA GACGCCGCAC GCGACGGCGA CGACGTCTGG
GTATCCGGCA CGCTCGGCGA TGCGCGCGCC GGCCTCGGCG TGATCCGCGG CGAATGGCGC
GCGGGCGAGC GCGAGGCGGC GGCGTTCCGG CGCGCGCTCG AATGGCCGCA ACCGCGCGTC
GCGCTCGGCG TCGCGCTCGC GGGCATCGCG CACGCGGCGC TCGACGTGTC CGACGGCCTC
GCGGGCGATC TGCCGCACAT CCTCGAGCGC TCGAACGTGC GCGCCGACGT GGACGTCGAC
GCGGTGCCGC GCTCGGCCGC GCTCGCGACC CTGCCCGCCG ACGTGCAGCG CCGCTGCATG
CTCGAAGGCG GCGACGACTA CGAGCTGTGC TTCACCGCCG CGCCGTCCGC GCGCACCGCG
ATCGACGCGG CCGGCGCGCG CGCGGGCGTG GCCGTCACGC GCATCGGTAC AATACGCGGC
TTGTCCGCGC CGACGGACGC GCGCGCCGTG ACGTGGCGCG ACGCGTCGGG CGCGCCGCTT
TCCCTCACGC TGCACGGTTT CGATCATTTC CATGCCAACT GA
 
Protein sequence
MAHPPLSEFS LIDRFFARRA TGPHARAALG IGDDCALLAP EPGKLLAVST DMLVEGRHFL 
ADVDPRALGH KTLAVNLSDL AAMGAAPRAF TLACALPRAD ADWLEAFSDG LFALAERHGC
ELIGGDTTSG PLNLCVTVFG DVACGAALRR DAARDGDDVW VSGTLGDARA GLGVIRGEWR
AGEREAAAFR RALEWPQPRV ALGVALAGIA HAALDVSDGL AGDLPHILER SNVRADVDVD
AVPRSAALAT LPADVQRRCM LEGGDDYELC FTAAPSARTA IDAAGARAGV AVTRIGTIRG
LSAPTDARAV TWRDASGAPL SLTLHGFDHF HAN