Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2048 |
Symbol | |
ID | 4882273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 2037733 |
End bp | 2039175 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640127976 |
Product | putative tryptophan halogenase |
Protein accession | YP_001059083 |
Protein GI | 126440212 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.826968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCTTC CGAATCGCAC ACAAGTCCTC GTCATCGGCG GCGGGCCGGC CGGCGCGACC GGCGCCGCGT TCCTCGCGCG CGAAGGCGTC GAGGTCACGC TCGTCGACAA GGAGGTATTC CCCCGCTATC ACATCGGCGA ATCGCTGTTG CCGTCCTGCC TCGAAATCCT CACGCTGATG GGCGCGCGCG ACACGTTCGA CCGCCACGGC TTCCAACGCA AGCCCGGCGC GTACTTCAAC TGGAAAGGCG AGACCTGGAA ACTCGATTTC GGCGAGCTCG GCGGCACCTA TCGCTACAGC TACCAGGTGC GCCGCGAGGA ATTCGATCAC CTGCTGCTGC AGCATGCGCG CGCGGTCGGC GCGCAGGTGC ACGAAGGCGT CAGCGTGCGC GAGATTCTGT TCGACGACGG CCGCCCGTGC GCCGCGCTGT GCGTCGCGCA AGGCACCGAG GAGGCCAGCA CCGTCGAGTT CGACTACCTC GTCGACGCAT CGGGGCGCAA CGGCCTGATG TCGACCCGCT ACCTCGACAA CCGCAAATTC CACGAGATCT TCCGCAACGT CGCCGCGTGG GGCTACTGGG AAGGCTTGAG CTGGCCCGAC GATTGCGCGC CGGGCTCGAT TCTCGTCAGC TCGATTCCCG ACGGCTGGTG GTGGGCGATC CCGCTCGCCG ATCGCCCGAC GAGCGTCGGC GTCGTCATGC ACCGCGACGC GTTCGTCGCG GCCAGGCGCA CGGGCACGCT CGAACAGGTC TACGCGCAGG CGCTCGCGCT GAGCCCGGTG ATGGCGAACC TCACCGAGCA TGCGCGCCTC GTCACGCCGC TCAAGACCGA GCAGGATTAT TCGTACACCT GCGATTCGTT CGCGGGCAAC GGCTACTTCC TGTCCGGCGA CGCGGCATGC TTTCTCGATC CGCTGCTGTC CACGGGCGTG CACCTCGCGA TGTACAGCGG CATGCTCGCC GCCGCGTCGC TCGCCAGCAT CCTGCGCGCC GAGGTGACCG AGCGGGAGGC CGCCGCATAC TATCGCGACA GCTACCGCCA GGCGTACCTG CGCTTTCTCG TGTTCGTGCA GACGTTCTAC GAGGCGCACG GCAAGCTCGG CTACTACAGC AAGGCCGACG AGCTGAGCCA CTACATGATC GAGGCGGGCG ACATCCGGCG CGCGTTCCTG AATCTCGTGT CGGGCCTCGA GGACATCGCC GACGCCGAGC AGGCCACCTC GCACCTGATG GGCGAGATGT CGCGCCGCAT CGATCAGAAC CTCGCGCTTC GCAAGGACAA GCGCGCGCTT TCGTCGGCGA TCGGCAGCAC GCAGGTCGAG GACAACGCGC GGTTCTTCGA CGCGATCGAG GGCCTGCCCT GCCTGTCGGC GAACATGGCG CTCGACGGGC TCTACGTATC GACCCGGCCT CGGCTCGGCC TGCAGCGCGT CGCCGCGATG TAA
|
Protein sequence | MHLPNRTQVL VIGGGPAGAT GAAFLAREGV EVTLVDKEVF PRYHIGESLL PSCLEILTLM GARDTFDRHG FQRKPGAYFN WKGETWKLDF GELGGTYRYS YQVRREEFDH LLLQHARAVG AQVHEGVSVR EILFDDGRPC AALCVAQGTE EASTVEFDYL VDASGRNGLM STRYLDNRKF HEIFRNVAAW GYWEGLSWPD DCAPGSILVS SIPDGWWWAI PLADRPTSVG VVMHRDAFVA ARRTGTLEQV YAQALALSPV MANLTEHARL VTPLKTEQDY SYTCDSFAGN GYFLSGDAAC FLDPLLSTGV HLAMYSGMLA AASLASILRA EVTEREAAAY YRDSYRQAYL RFLVFVQTFY EAHGKLGYYS KADELSHYMI EAGDIRRAFL NLVSGLEDIA DAEQATSHLM GEMSRRIDQN LALRKDKRAL SSAIGSTQVE DNARFFDAIE GLPCLSANMA LDGLYVSTRP RLGLQRVAAM
|
| |