Gene BURPS1106A_2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2103 
Symbol 
ID4900804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2094087 
End bp2095529 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content67% 
IMG OID640135333 
Productputative tryptophan halogenase 
Protein accessionYP_001066368 
Protein GI126453837 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0674783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCTTC CGAATCGCAC ACAAGTCCTC GTCATCGGCG GCGGCCCGGC CGGCGCGACC 
GGCGCCGCGT TCCTCGCGCG CGAAGGCGTC GAGGTCACGC TCGTCGACAA GGAGGTATTC
CCCCGCTATC ACATCGGCGA ATCGCTGTTG CCGTCCTGCC TCGAAATCCT CACGCTGATG
GGCGCGCGCG ACACGTTCGA CCGCCACGGC TTCCAACGCA AGCCCGGCGC GTACTTCAAC
TGGAAAGGCG AGACCTGGAA ACTCGATTTC GGCGAGCTCG GCGGCACCTA TCGCTACAGC
TACCAGGTGC GCCGCGAGGA ATTCGATCAC CTGCTGCTGC AGCATGCGCG CGCGGTCGGC
GCGCAGGTGC ACGAAGGCGT CAGCGTGCGC GAGATTCTGT TCGACGACGG CCGCCCATGC
GCCGCGCTGT GCGTCGCGCA AGGCGCCGAG GAGGCCACCA CCGTCGAGTT CGACTACCTC
GTCGACGCAT CGGGGCGCAA CGGCCTGATG TCGACCCGCT ACCTCGACAA CCGCAAATTC
CACGAGATCT TCCGCAACGT CGCCGCGTGG GGCTACTGGG AAGGCTTGAG CTGGCCCGAC
GATTGCGCGC CGGGCTCGAT TCTCGTCAGC TCGATTCCCG ACGGCTGGTG GTGGGCGATC
CCGCTCGCCG ATCGCCCGAC GAGCGTCGGC GTCGTCATGC ACCGCGACGC GTTCGTCGCG
GCCAGGCGCA CGGGCACGCT CGAACAGGTC TACGCGCAGG CGCTCGCGCT GAGCCCGGTG
ATGGCGAACC TCACCGAGCA TGCGCGCCTC GTCACGCCGC TCAAGACCGA GCAGGATTAT
TCGTACACCT GCGATTCGTT CGCGGGCAAC GGCTACTTCC TGTCCGGCGA CGCGGCATGC
TTTCTCGATC CGCTGCTGTC CACGGGCGTG CACCTCGCGA TGTACAGCGG CATGCTCGCC
GCCGCGTCGC TCGCCAGCAT CCTGCGCGCC GAGGTGACCG AGCGGGAGGC CGCCGCATAC
TATCGCGACA GCTACCGCCA GGCGTACCTG CGCTTTCTCG TGTTCGTGCA GACGTTCTAC
GAGGCGCACG GCAAGCTCGG CTACTACAGC AAGGCCGACG AGCTGAGCCA CTACATGATC
GAGGCGGGCG ACATCCGGCG CGCGTTCCTG AATCTCGTGT CGGGCCTCGA GGACATCGCC
GACGCCGAGC AGGCCACCTC GCACCTGATG GGCGAGATGT CGCGCCGCAT CGATCAGAAC
CTCGCGCTTC GCAAGGACAA GCGCGCGCTT TCGTCGACGA TCGGCAGCAC GCAGGTCGAG
GACAACGCGC GGTTCTTCGA CGCGATCGAG GGCCTGCCCT GCCTGTCGGC GAACATGGCG
CTCGACGGGC TCTACGTATC GACCCGGCCT CGGCTCGGCC TGCAGCGCGT CGCCGCGATG
TAA
 
Protein sequence
MHLPNRTQVL VIGGGPAGAT GAAFLAREGV EVTLVDKEVF PRYHIGESLL PSCLEILTLM 
GARDTFDRHG FQRKPGAYFN WKGETWKLDF GELGGTYRYS YQVRREEFDH LLLQHARAVG
AQVHEGVSVR EILFDDGRPC AALCVAQGAE EATTVEFDYL VDASGRNGLM STRYLDNRKF
HEIFRNVAAW GYWEGLSWPD DCAPGSILVS SIPDGWWWAI PLADRPTSVG VVMHRDAFVA
ARRTGTLEQV YAQALALSPV MANLTEHARL VTPLKTEQDY SYTCDSFAGN GYFLSGDAAC
FLDPLLSTGV HLAMYSGMLA AASLASILRA EVTEREAAAY YRDSYRQAYL RFLVFVQTFY
EAHGKLGYYS KADELSHYMI EAGDIRRAFL NLVSGLEDIA DAEQATSHLM GEMSRRIDQN
LALRKDKRAL SSTIGSTQVE DNARFFDAIE GLPCLSANMA LDGLYVSTRP RLGLQRVAAM