Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1178 |
Symbol | |
ID | 3847725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 1323974 |
End bp | 1324783 |
Gene Length | 810 bp |
Protein Length | 269 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637840850 |
Product | short chain dehydrogenase |
Protein accession | YP_441725 |
Protein GI | 83719365 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGATC GCCCGAAAGG GAGCGGCGCG GTGAACCGGC TCGCGGGCAA GGTCGCCCTC GTGACGGGCG CGGGACGCGG CATCGGCGCG GCGATCGCGC GTGCGTTCGC GCGCGAAGGC GCGGCCGTCG CGATTGCGGA GCTCGACGCG GCGCTCGCCG ACGAAACCGT CGACGCGATC GCGCGCGACG TGGCCGATGC GCGCGTGCTC GCGGTGCCAG CGGACGTCGC GCAAGCCGAG TCGGTCGCGG CGGCGCTCGC GTGCACGGAG CGCGCGTTCG GCCCGCTCGA CGTGCTCGTC AACAACGCAG GCGTCAACGT GTTCGGCGAT CCGCTCGCGC TTGCCGAAGA AGACTGGCGG CGCTGCTTCG CGATCGATCT CGACGGCGTC TGGCACGGCT GCCGCGCGGC GCTGCCGGGC ATGGTCGAGC GCGGTCGGGG CAGCATCGTG AACATCGCGT CGACGCACGC GTTCAAGATC ATCCCGGGCT GCTTTCCGTA CCCGGTCGCG AAGCACGGCG TGCTGGGCCT CACGCGCGCG CTCGGCGTCG AATATGCGCC GCGCAACGTG CGCGTGAACG CGATCGCGCC CGGCTACATC GAGACGCAAT CGACACATGA CTGGTGGAAC GCGCAGCCCG ACCCCGAGGC CGCGCGCCGC GAAACGCTCG CACTGCAGCC GATGAAGCGG ATCGGGCGTG CGGACGAAGT CGCGATGACC GCGGTGTTTC TCGCATCGGA CGAGGCGCCG TTCATCAACG CGAGCTGCAT CACGATCGAC GGCGGCCGAT CGGTGCTGTA CCACGACTGA
|
Protein sequence | MADRPKGSGA VNRLAGKVAL VTGAGRGIGA AIARAFAREG AAVAIAELDA ALADETVDAI ARDVADARVL AVPADVAQAE SVAAALACTE RAFGPLDVLV NNAGVNVFGD PLALAEEDWR RCFAIDLDGV WHGCRAALPG MVERGRGSIV NIASTHAFKI IPGCFPYPVA KHGVLGLTRA LGVEYAPRNV RVNAIAPGYI ETQSTHDWWN AQPDPEAARR ETLALQPMKR IGRADEVAMT AVFLASDEAP FINASCITID GGRSVLYHD
|
| |