Gene BURPS668_2232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2232 
Symbolhom 
ID4884299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2221420 
End bp2222748 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content68% 
IMG OID640128160 
Producthomoserine dehydrogenase 
Protein accessionYP_001059267 
Protein GI126439395 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACCGA TCAAAGTAGG CCTGTTGGGC TTCGGCACGG TGGGTGGCGG CACCTTCAAG 
GTGCTGCGCC GCAACCAGGA GGAAATCAAG CGGCGCGCCG GGCGCGGCAT CGAGATCGCG
CGCGTCGCCG TGCGTAATCC CGCGAAGGCG CTCGCCGCGC TCGACGGCGA CGCGAACGGC
GTGTCGATCG GCGACGATTT CAACGCGGTC GTCGACGATC CGTCGATCGC CATCGTCGCC
GAGATGATCG GCGGCACGGG CCTCGCGCGC GAGCTCGTGC TGCGCGCGAT CGCGAACGGC
AAGCACGTCG TGACCGCCAA CAAGGCGCTG CTCGCCGTGC ACGGCACCGA GATCTTCGAG
GCGGCGCGCG CGAAGGGCGT GATGGTCGCG TTCGAGGCGG CCGTCGCGGG CGGCATCCCG
ATCATCAAGG CGCTGCGCGA GGGGCTCACC GCGAACCGGA TTCAGTATAT CGCGGGCATC
ATCAACGGCA CGACGAACTA CATCCTGTCG GAGATGCGCG AGCGCGGGCT CGATTTCGCG
ACGGCGCTGA AGGCCGCGCA GGAACTCGGC TACGCGGAAG CCGATCCGAC CTTCGACATC
GAGGGCGTCG ACGCCGCGCA CAAGGCGACG ATCATGAGCG CGATCGCGTT CGGCGTGCCG
GTGCAGTTCG ACCGCGCGTA TGTCGAAGGC ATCAGCCGGC TCGCCGCGAC CGACATCAAA
TACGCGGAGG AACTCGGCTA CCGGATCAAG CTGCTCGGCA TCACGCGCCG CACCGAGCGC
GGCATCGAGC TGCGCGTGCA TCCGACGCTG ATTCCGGCCA AGCGCCTGCT CGCGAACGTC
GAGGGCGCGA TGAACGCGGT CGTCGTGCAC GGCGATGCGG TCGGCACGAC GCTGTACTAC
GGCAAGGGCG CGGGCGCCGA GCCGACGGCC TCGGCCGTCG TCGCGGATCT CGTCGACGTC
ACGCGCCTGC ATACGGCGGA CCCCGAGCAC CGCGTGCCGC ACCTCGCGTT CCAGCCGGAC
AGCCTGTCGA ACACGCCGAT CCTGCCGATC GAGGAGGTGA CGAGCGGCTA TTACCTGCGC
CTGCGCGTCG CCGACCAGAC GGGCGTGCTC GCCGACATCA CGCGCATCCT CGCCGAATCG
GGCATCTCGA TCGACGCGCT GTTGCAGAAG GAATCGGAGC AGGTGGACGA TGCGAACGGC
GAGACCGACA TCATCCTCAT CACGCACGAG ACGGTCGAGA AGAACGTCAA CGCGGCGATC
GCGCGCATCG AATCGCTCGC GACCGTCGTG TCGAAGGTCA CGAAGCTGCG CATGGAAGCG
CTCAACTGA
 
Protein sequence
MEPIKVGLLG FGTVGGGTFK VLRRNQEEIK RRAGRGIEIA RVAVRNPAKA LAALDGDANG 
VSIGDDFNAV VDDPSIAIVA EMIGGTGLAR ELVLRAIANG KHVVTANKAL LAVHGTEIFE
AARAKGVMVA FEAAVAGGIP IIKALREGLT ANRIQYIAGI INGTTNYILS EMRERGLDFA
TALKAAQELG YAEADPTFDI EGVDAAHKAT IMSAIAFGVP VQFDRAYVEG ISRLAATDIK
YAEELGYRIK LLGITRRTER GIELRVHPTL IPAKRLLANV EGAMNAVVVH GDAVGTTLYY
GKGAGAEPTA SAVVADLVDV TRLHTADPEH RVPHLAFQPD SLSNTPILPI EEVTSGYYLR
LRVADQTGVL ADITRILAES GISIDALLQK ESEQVDDANG ETDIILITHE TVEKNVNAAI
ARIESLATVV SKVTKLRMEA LN