Gene BURPS668_A2556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2556 
SymbolthrB 
ID4887885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2466138 
End bp2467133 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content69% 
IMG OID640132493 
Producthomoserine kinase 
Protein accessionYP_001063549 
Protein GI126444279 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR00938] homoserine kinase, Neisseria type 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGTTT TCACCGCAGT TTCCGACGCT GACCTCGCAC TCTGGATGCG CCACTACGAT 
CTCGGCGACG TTGTCGCGTT CCGCGGCATT CCGTCCGGCA TCGAGAACAG CAACTTCTTC
CTGACGACGA CGCGCGGCGA ATACGTGCTC ACGATCTTCG AGAACCTGAC GGCCGGGCAA
TTGCCGTTCT ACGTCCATCT GATGAGCCAT CTCGCGAAGC ACGGGGTGCC CGTGCCCGCG
CCCGTCGCGC GCGACGACGG CACGCTGTTC GGCGAGTTGC ACGGCAAGCC GGCCGCGATC
GTCACCAAGC TCGAGGGCGC GGCGCAGCTC GCGCCGGGCG TCGAGCACTG CGTCGAAGTC
GGGCAGATGC TCGCGCGCAT GCACCTCGCG GGCCGCGACT ATCCGCGGCA TCAGCCCAAC
TTGCGCAGCC TGCCGTGGTG GCGCGACACG GTGCCCGCGA TCGCGCCGTT CGTCACGGGC
GAGCAGCGCG CGCTGCTGGA AGGCGAGCTC GCGCACCAGG CCGCGTTCTT CGCATCGGAC
GATTACGCGG CGCTGCCGGA AGGCCCGTGC CATTGCGACC TGTTTCGCGA CAATGCGCTC
TTCGCGCACG CGGAGCCCGA CACCGGCCAT TCGGTGCGGC TCGGCGGCTT CTTCGATTTC
TACTTCGCCG GCTGCGACAA ATGGCTGTTC GACGTCGCGG TGACGGTCAA CGACTGGTGC
GTCGATCTGC CGACGGGCGC GCTCGACGCC GCGCGCGCCG ACGCGCTGCT GCGCGCGTAC
CAGACGGTGC GCCCGTTCAC CGCGGGCGAG CGCCGCCACT GGGGCGACAT GCTGCGCGCG
GGCGCGTACC GCTTCTGGGT ATCGCGCCTG TATGATTTCC ACCTTCCCCG CGCCGCGCAG
ATGCTCAAGC CGCACGACCC GGGCCATTTC GAACGCATCC TGCGCGAACG CATCGCGCAC
GCGGGCGCGC CCCCCGAGAC CCACGCATGC AACTGA
 
Protein sequence
MAVFTAVSDA DLALWMRHYD LGDVVAFRGI PSGIENSNFF LTTTRGEYVL TIFENLTAGQ 
LPFYVHLMSH LAKHGVPVPA PVARDDGTLF GELHGKPAAI VTKLEGAAQL APGVEHCVEV
GQMLARMHLA GRDYPRHQPN LRSLPWWRDT VPAIAPFVTG EQRALLEGEL AHQAAFFASD
DYAALPEGPC HCDLFRDNAL FAHAEPDTGH SVRLGGFFDF YFAGCDKWLF DVAVTVNDWC
VDLPTGALDA ARADALLRAY QTVRPFTAGE RRHWGDMLRA GAYRFWVSRL YDFHLPRAAQ
MLKPHDPGHF ERILRERIAH AGAPPETHAC N