Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0854 |
Symbol | |
ID | 3846131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 996276 |
End bp | 997595 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637838157 |
Product | ubiquitin-specific proteinase 31, putative |
Protein accession | YP_439051 |
Protein GI | 83717089 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.15746 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAC GCCCCTCGAA ACACTCCGCC CATTGCGGCG CGCCGTCCTC GGCTCGCCGC GCGGTTCGCT CCGATGAATC CGCCCCGACA CGGCGCGCGG CCGCCGCCGG AACTTCGCCC GCGGCTCGAC TGGCGGCCCG CGCCGATGCG GCGCCGCCAG GTCGGTGCGC CGCCTGCGGT TCGACCCGCG CGCCGTCCGC ACCCGCGCGC GGCCTGCCGT CGACCCGCCG CTTCGGCGCG GCGCTCGCGC TGAGCGTCGC GTCGCTCACC GGATGCGGCG GCGGCGGCGA CGAATCGGGC GCGTCCGCCG CGCACGCGTT CGCGCCGCCG ACCGTGCAGC TCGCGTATCC GGCGCAGCCG GCGGCCGCGC AACCGAACGC CGCAATCGCA CATCCGCCCG CCGGACACGT GCCATCGGGC TCGACGCCGG CCACCCTGCC CGCCGCGCCG CCCGCCGTCG TATCCGCCGG CGTCGCGCCG ACGCACGCGG CGTTGCGACG ACCGACGATC GTGCTCGAGT TCGATCGCGC GATCGCGCCG CACTCGGTCG CGGGCGTGAC GCTGTACGAC CCCAATCGCG CGAGCGTCGC CATCGGCGCA TGGTCGTGGC TGAGCGATCG CCGGCTCGCG TTCGCGCCGC TCACGCCGCT CCAGTCGAAC AGCCGCTACG AGATCGCGGT GCCCGCCGGC ATCGAGAGCG CGGCGGGCGA GCGGTCGGCG TCCCCGCTGA CGGCCCAATT CGACACCGCG CCCACCACGC CGCCGCGAGG GCTGCCCAAC CTCGGCGCAA CCTGCTTCAT CAACACGGCG CTGCAACTGG CCGTGCATTC GGCTGCGCTC GACGACATCG TGTCGAACGC GGCCGTCGAT CCCGCCGTGC GCACGCTGCT CGATCGCTAC GATGCCGCTC CGGCCGCCGA GCTCGACGCG CGGCTGCACG CCGCGGTTGC CGCGCTGCGC GCGACGGCGG CGATTCCGGA CAGCGGACCG GGGCATACGC TGGACGTGCT GACGGCGTTG CGCCTGCCGC TGCATCCGGC AGGCGACGCC GATGCAATCC GTTACGCACC GCCCGACGCA AGGGCGTTCC GGCTGCACGG CTGGCCGTTC GACTACGCGG CGCTGCCGAA CCACGAGCGG CTCGTCGCGT TCGACTACAA CGCGGGCGGT CACTATGTCG CCTATGTGAA ACGGGATGGA ATCTGGTATT GCATCGACGA CGCGCAGGTG ACACCGGTCA CCGAGCAGCA GTTGCTCGCC CTGCCGGCGT TCAACCCCAA TCTGGGCAGC ATGGCGATCG AAATCGCGAT CTATCGTTGA
|
Protein sequence | MMKRPSKHSA HCGAPSSARR AVRSDESAPT RRAAAAGTSP AARLAARADA APPGRCAACG STRAPSAPAR GLPSTRRFGA ALALSVASLT GCGGGGDESG ASAAHAFAPP TVQLAYPAQP AAAQPNAAIA HPPAGHVPSG STPATLPAAP PAVVSAGVAP THAALRRPTI VLEFDRAIAP HSVAGVTLYD PNRASVAIGA WSWLSDRRLA FAPLTPLQSN SRYEIAVPAG IESAAGERSA SPLTAQFDTA PTTPPRGLPN LGATCFINTA LQLAVHSAAL DDIVSNAAVD PAVRTLLDRY DAAPAAELDA RLHAAVAALR ATAAIPDSGP GHTLDVLTAL RLPLHPAGDA DAIRYAPPDA RAFRLHGWPF DYAALPNHER LVAFDYNAGG HYVAYVKRDG IWYCIDDAQV TPVTEQQLLA LPAFNPNLGS MAIEIAIYR
|
| |