Gene BURPS668_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2048 
Symbol 
ID4882273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2037733 
End bp2039175 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content67% 
IMG OID640127976 
Productputative tryptophan halogenase 
Protein accessionYP_001059083 
Protein GI126440212 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.826968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCTTC CGAATCGCAC ACAAGTCCTC GTCATCGGCG GCGGGCCGGC CGGCGCGACC 
GGCGCCGCGT TCCTCGCGCG CGAAGGCGTC GAGGTCACGC TCGTCGACAA GGAGGTATTC
CCCCGCTATC ACATCGGCGA ATCGCTGTTG CCGTCCTGCC TCGAAATCCT CACGCTGATG
GGCGCGCGCG ACACGTTCGA CCGCCACGGC TTCCAACGCA AGCCCGGCGC GTACTTCAAC
TGGAAAGGCG AGACCTGGAA ACTCGATTTC GGCGAGCTCG GCGGCACCTA TCGCTACAGC
TACCAGGTGC GCCGCGAGGA ATTCGATCAC CTGCTGCTGC AGCATGCGCG CGCGGTCGGC
GCGCAGGTGC ACGAAGGCGT CAGCGTGCGC GAGATTCTGT TCGACGACGG CCGCCCGTGC
GCCGCGCTGT GCGTCGCGCA AGGCACCGAG GAGGCCAGCA CCGTCGAGTT CGACTACCTC
GTCGACGCAT CGGGGCGCAA CGGCCTGATG TCGACCCGCT ACCTCGACAA CCGCAAATTC
CACGAGATCT TCCGCAACGT CGCCGCGTGG GGCTACTGGG AAGGCTTGAG CTGGCCCGAC
GATTGCGCGC CGGGCTCGAT TCTCGTCAGC TCGATTCCCG ACGGCTGGTG GTGGGCGATC
CCGCTCGCCG ATCGCCCGAC GAGCGTCGGC GTCGTCATGC ACCGCGACGC GTTCGTCGCG
GCCAGGCGCA CGGGCACGCT CGAACAGGTC TACGCGCAGG CGCTCGCGCT GAGCCCGGTG
ATGGCGAACC TCACCGAGCA TGCGCGCCTC GTCACGCCGC TCAAGACCGA GCAGGATTAT
TCGTACACCT GCGATTCGTT CGCGGGCAAC GGCTACTTCC TGTCCGGCGA CGCGGCATGC
TTTCTCGATC CGCTGCTGTC CACGGGCGTG CACCTCGCGA TGTACAGCGG CATGCTCGCC
GCCGCGTCGC TCGCCAGCAT CCTGCGCGCC GAGGTGACCG AGCGGGAGGC CGCCGCATAC
TATCGCGACA GCTACCGCCA GGCGTACCTG CGCTTTCTCG TGTTCGTGCA GACGTTCTAC
GAGGCGCACG GCAAGCTCGG CTACTACAGC AAGGCCGACG AGCTGAGCCA CTACATGATC
GAGGCGGGCG ACATCCGGCG CGCGTTCCTG AATCTCGTGT CGGGCCTCGA GGACATCGCC
GACGCCGAGC AGGCCACCTC GCACCTGATG GGCGAGATGT CGCGCCGCAT CGATCAGAAC
CTCGCGCTTC GCAAGGACAA GCGCGCGCTT TCGTCGGCGA TCGGCAGCAC GCAGGTCGAG
GACAACGCGC GGTTCTTCGA CGCGATCGAG GGCCTGCCCT GCCTGTCGGC GAACATGGCG
CTCGACGGGC TCTACGTATC GACCCGGCCT CGGCTCGGCC TGCAGCGCGT CGCCGCGATG
TAA
 
Protein sequence
MHLPNRTQVL VIGGGPAGAT GAAFLAREGV EVTLVDKEVF PRYHIGESLL PSCLEILTLM 
GARDTFDRHG FQRKPGAYFN WKGETWKLDF GELGGTYRYS YQVRREEFDH LLLQHARAVG
AQVHEGVSVR EILFDDGRPC AALCVAQGTE EASTVEFDYL VDASGRNGLM STRYLDNRKF
HEIFRNVAAW GYWEGLSWPD DCAPGSILVS SIPDGWWWAI PLADRPTSVG VVMHRDAFVA
ARRTGTLEQV YAQALALSPV MANLTEHARL VTPLKTEQDY SYTCDSFAGN GYFLSGDAAC
FLDPLLSTGV HLAMYSGMLA AASLASILRA EVTEREAAAY YRDSYRQAYL RFLVFVQTFY
EAHGKLGYYS KADELSHYMI EAGDIRRAFL NLVSGLEDIA DAEQATSHLM GEMSRRIDQN
LALRKDKRAL SSAIGSTQVE DNARFFDAIE GLPCLSANMA LDGLYVSTRP RLGLQRVAAM