Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2232 |
Symbol | hom |
ID | 4884299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 2221420 |
End bp | 2222748 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640128160 |
Product | homoserine dehydrogenase |
Protein accession | YP_001059267 |
Protein GI | 126439395 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACCGA TCAAAGTAGG CCTGTTGGGC TTCGGCACGG TGGGTGGCGG CACCTTCAAG GTGCTGCGCC GCAACCAGGA GGAAATCAAG CGGCGCGCCG GGCGCGGCAT CGAGATCGCG CGCGTCGCCG TGCGTAATCC CGCGAAGGCG CTCGCCGCGC TCGACGGCGA CGCGAACGGC GTGTCGATCG GCGACGATTT CAACGCGGTC GTCGACGATC CGTCGATCGC CATCGTCGCC GAGATGATCG GCGGCACGGG CCTCGCGCGC GAGCTCGTGC TGCGCGCGAT CGCGAACGGC AAGCACGTCG TGACCGCCAA CAAGGCGCTG CTCGCCGTGC ACGGCACCGA GATCTTCGAG GCGGCGCGCG CGAAGGGCGT GATGGTCGCG TTCGAGGCGG CCGTCGCGGG CGGCATCCCG ATCATCAAGG CGCTGCGCGA GGGGCTCACC GCGAACCGGA TTCAGTATAT CGCGGGCATC ATCAACGGCA CGACGAACTA CATCCTGTCG GAGATGCGCG AGCGCGGGCT CGATTTCGCG ACGGCGCTGA AGGCCGCGCA GGAACTCGGC TACGCGGAAG CCGATCCGAC CTTCGACATC GAGGGCGTCG ACGCCGCGCA CAAGGCGACG ATCATGAGCG CGATCGCGTT CGGCGTGCCG GTGCAGTTCG ACCGCGCGTA TGTCGAAGGC ATCAGCCGGC TCGCCGCGAC CGACATCAAA TACGCGGAGG AACTCGGCTA CCGGATCAAG CTGCTCGGCA TCACGCGCCG CACCGAGCGC GGCATCGAGC TGCGCGTGCA TCCGACGCTG ATTCCGGCCA AGCGCCTGCT CGCGAACGTC GAGGGCGCGA TGAACGCGGT CGTCGTGCAC GGCGATGCGG TCGGCACGAC GCTGTACTAC GGCAAGGGCG CGGGCGCCGA GCCGACGGCC TCGGCCGTCG TCGCGGATCT CGTCGACGTC ACGCGCCTGC ATACGGCGGA CCCCGAGCAC CGCGTGCCGC ACCTCGCGTT CCAGCCGGAC AGCCTGTCGA ACACGCCGAT CCTGCCGATC GAGGAGGTGA CGAGCGGCTA TTACCTGCGC CTGCGCGTCG CCGACCAGAC GGGCGTGCTC GCCGACATCA CGCGCATCCT CGCCGAATCG GGCATCTCGA TCGACGCGCT GTTGCAGAAG GAATCGGAGC AGGTGGACGA TGCGAACGGC GAGACCGACA TCATCCTCAT CACGCACGAG ACGGTCGAGA AGAACGTCAA CGCGGCGATC GCGCGCATCG AATCGCTCGC GACCGTCGTG TCGAAGGTCA CGAAGCTGCG CATGGAAGCG CTCAACTGA
|
Protein sequence | MEPIKVGLLG FGTVGGGTFK VLRRNQEEIK RRAGRGIEIA RVAVRNPAKA LAALDGDANG VSIGDDFNAV VDDPSIAIVA EMIGGTGLAR ELVLRAIANG KHVVTANKAL LAVHGTEIFE AARAKGVMVA FEAAVAGGIP IIKALREGLT ANRIQYIAGI INGTTNYILS EMRERGLDFA TALKAAQELG YAEADPTFDI EGVDAAHKAT IMSAIAFGVP VQFDRAYVEG ISRLAATDIK YAEELGYRIK LLGITRRTER GIELRVHPTL IPAKRLLANV EGAMNAVVVH GDAVGTTLYY GKGAGAEPTA SAVVADLVDV TRLHTADPEH RVPHLAFQPD SLSNTPILPI EEVTSGYYLR LRVADQTGVL ADITRILAES GISIDALLQK ESEQVDDANG ETDIILITHE TVEKNVNAAI ARIESLATVV SKVTKLRMEA LN
|
| |