Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcen_4315 |
Symbol | |
ID | 4094902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia AU 1054 |
Kingdom | Bacteria |
Replicon accession | NC_008061 |
Strand | + |
Start bp | 1536041 |
End bp | 1536841 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638017606 |
Product | haloacid dehalogenase, type II |
Protein accession | YP_624174 |
Protein GI | 107026663 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01428] 2-haloalkanoic acid dehalogenase, type II [TIGR01493] Haloacid dehalogenase superfamily, subfamily IA, variant 2 with 3rd motif like haloacid dehalogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.103346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATCG AACCGGCGCC GCGCCATCCG GCGGCACGGC GATGTGAACC CTCATCGAAC GACCTTCACT GGATCACCAT GACGAACCCT GCCGAAATCA AGGCGCTCGT TTTCGATGTC TTCGGAACGA TCGTCGACTG GAGAAGCGGT GTCGCGCGCG GCACCGCCGC GTTCCTGGAT CGCCACGCGC CGACACTCGA CCCGTTCGCG TTCGCCGATG CGTGGCGCGC CGAGTATTCG CCGTCGATGG AAGAAATCCG CAGCGGACGC CGCCGTTACG TGCGGCTCGA CGTGCTGCAT CGCGAGAACC TGGTGCGCAC GCTCGACCGC TTCGGGATCG TCGACGTGCC GGAGGCCGAC ATCGACGCGC TCAATCTCGC GTGGCATCGG CTCGATCCGT GGCCGGATGC CGTCGCCGGG CTGCACCGGC TGAAGCAGCG CTACATCATC GCGCCGCTGT CCAACGGCAA TATCCGGCTG ATGGTCGACG TCGCGAAGCA TGGCGGGCTG CCGTGGGACG CGATTCTCGG CGCCGAGGTC GCCCGCGCGT ACAAGCCGTC GCCGGCGGTC TACACCGAAG CGGTCGAGAT TCTCGGCCTC GCGCCGGCCG AGCTGTGCCT GGTGGCCGCG CACAACGGCG ATCTTGGCGC CGCGCGCCGG CTCGGGCTGT CGACCGCGTT CGTGCTGCGG CCGACCGAGC ACGGGCCCGG CCAGACGACC GACCTGCAAG CCGACGATGC GTGGGATTTC GACGTCAGGG ACCTGAACGA ATTGGCGGAT CGGCTGGGCT GCCCGCGTTG A
|
Protein sequence | MSIEPAPRHP AARRCEPSSN DLHWITMTNP AEIKALVFDV FGTIVDWRSG VARGTAAFLD RHAPTLDPFA FADAWRAEYS PSMEEIRSGR RRYVRLDVLH RENLVRTLDR FGIVDVPEAD IDALNLAWHR LDPWPDAVAG LHRLKQRYII APLSNGNIRL MVDVAKHGGL PWDAILGAEV ARAYKPSPAV YTEAVEILGL APAELCLVAA HNGDLGAARR LGLSTAFVLR PTEHGPGQTT DLQADDAWDF DVRDLNELAD RLGCPR
|
| |