Gene BURPS668_A1482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1482 
Symbol 
ID4888043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1426594 
End bp1428252 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content67% 
IMG OID640131421 
Productputative halogenase 
Protein accessionYP_001062478 
Protein GI126442935 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.133442 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACA ATCAGGTCAG GAAATACGAC GTCGTCATCA TCGGGACGGG CATCGGCGGC 
ACGACGCTCG GCGCGATCCT CGCGCGGTAC GGGCTGCGGG TCGCGATGAT CGATTCCGGC
ACGCATCCGC GCTTTGCCGT CGGCGAATCG ACGATCGCCA CGACCACGCT GACGCTCGAG
CTGATGGCGA TGCGCTTCGA CGTGCCGGAG CTCAAGCACA TCACGTCGAT CGCCGAAGTG
AGCGAGAACG TGATGCCGTC GTGCGGCGTG AAGCGCAACT TCGGCTTCGT GTATCACCGC
GAGCACACCG AGCAGAATCC GCAGGAGGTC AATCAGGCGC TCGTCGTCAA CGAGGTGCAC
TATTTCCGGC AGGACATCGA CGCGTACATG CTTCACGTGG CCATTCGCTA CGGCTGCGAC
GCGTATCAGA ACACCGTCGT CGACGATATC CGGATCGACG CCGGCGGCGT GACGGTGACG
ACGCGCGGCG GCCTCACGTT CGAGGCGGAT TTCGTCGCCG ACGGCGCGGG GTACCGCTCG
GTGCTGGCCG ACAAGCTCGG CCTGCGCGAG ACGCCGTGCC GCGCGAAGAC GCATGCGCGC
GGCCTGTTCA CGCACATGAT CGACGTGAAG CCGTTCGACG CCTGCCGCGA GGTGCCCAAG
GCGCTGCAGC AGCCGGTGCC GTGGCATCAG GGGACGCTGC ACCACCTGTT CGACGGCGGC
TGGATGTGGG TGATTCCGTT CAACAACACG CCGGAATCGA AGAACCCGCT CGTGAGCGTC
GGCCTGATGC TCGATCCGCG CAAGCATCCG AAGCCGGACG TGCGGCCCGA GCAGGAATTC
GCCGATTTCA TCGCGAAGCA TCCGGACATG GCGCGGCAGT TCGCCGATGC GCGCGCGGTG
CGCGAATGGG TGTCCTCGGG CCGCATCCAG TACAGCGCGA GCGCATGCAC GGGCGACCGG
TTCTGCCTGC TCTCGCATGC GACGGGCTTC ATCGATCCGC TGTTCTCGCG CGGCCTGTTC
AACACGATGC AGACGACCAA CGCGCTCGCG GGGCTGCTGA TCGAAGCCGC GAAGGACCGC
GATTTCAGCA AGGCGCGCTT CGCGCCGGTC GAGAAGCTCC AGCAGGGCCT GATCGATTTC
AACGATCGGC TCGTCAACTG CTCGTACCTC TCGTGGGGCC ACTATCCGCT CTGGAACGCG
TGGTTCCGCC TGTGGCTGCT CACCGGCAAC TACGGCCAGC TTCACCTGCA GCGCGCGATG
ATGAAGTACC GGCAAACCGG CGACGCGCGC TGGCTCGAGC CGGCCGACGC GCTGTTGCCG
GGCGCGTTCA CCACGCTCGA GCCGATCATG CGGCTGTTCG AGGAGGCGGC GGTGTGCGTC
GAGCGGTACG GCGCGGGCGA ACTCTCGGGC GAGGCGGCCG AGCGGGCGAT CTACGCGCTG
CTCGAGGAGA ACGCCGCGCT GCTGCCGCCG TTCTTCGATT TCGTTTCGCC CGCCGAGCGG
ATCACCTGGC CGAGCACGCC CGAGAAGATC GCCGCGCTGC TGCTCGAGTG GGTCGAGCGG
CTGCCGGAGG ACGTGCGGGC GGAATACTTC GACTACGACG TGCGGGCGCT GCTCCAGCAG
CCGGTCGTCA AGGACACGAT CACCGCGGAC GTCGCGTGA
 
Protein sequence
MSNNQVRKYD VVIIGTGIGG TTLGAILARY GLRVAMIDSG THPRFAVGES TIATTTLTLE 
LMAMRFDVPE LKHITSIAEV SENVMPSCGV KRNFGFVYHR EHTEQNPQEV NQALVVNEVH
YFRQDIDAYM LHVAIRYGCD AYQNTVVDDI RIDAGGVTVT TRGGLTFEAD FVADGAGYRS
VLADKLGLRE TPCRAKTHAR GLFTHMIDVK PFDACREVPK ALQQPVPWHQ GTLHHLFDGG
WMWVIPFNNT PESKNPLVSV GLMLDPRKHP KPDVRPEQEF ADFIAKHPDM ARQFADARAV
REWVSSGRIQ YSASACTGDR FCLLSHATGF IDPLFSRGLF NTMQTTNALA GLLIEAAKDR
DFSKARFAPV EKLQQGLIDF NDRLVNCSYL SWGHYPLWNA WFRLWLLTGN YGQLHLQRAM
MKYRQTGDAR WLEPADALLP GAFTTLEPIM RLFEEAAVCV ERYGAGELSG EAAERAIYAL
LEENAALLPP FFDFVSPAER ITWPSTPEKI AALLLEWVER LPEDVRAEYF DYDVRALLQQ
PVVKDTITAD VA