Gene BURPS1106A_A1397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1397 
Symbol 
ID4903780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1362536 
End bp1364194 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content67% 
IMG OID640144503 
Productputative halogenase 
Protein accessionYP_001075431 
Protein GI126456710 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.666039 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACA ATCAGGTCAG GAAATACGAC GTCGTCATCA TCGGGACGGG CATCGGCGGC 
ACGACGCTCG GCGCGATCCT CGCGCGGCAC GGGCTGCGGG TCGCGATGAT CGATTCCGGC
ACGCATCCGC GCTTTGCCGT CGGCGAATCG ACGATCGCCA CGACCACGCT GACGCTCGAG
CTGATGGCGA TGCGCTTCGA TGTGCCGGAG CTCAAGCACA TCACGTCGAT CGCCGAAGTG
AGCGAGAACG TGATGCCGTC GTGCGGCGTG AAGCGCAACT TCGGCTTCGT GTATCACCGC
GAGCACACCG AGCAGAATCC GCAGGAGGTC AATCAGGCGC TCGTCGTCAA CGAGGTGCAC
TATTTCCGGC AGGACATCGA CGCGTACATG CTGCACGTGG CCATTCGCTA CGGCTGCGAC
GCGTATCAGA ACACCGTCGT CGACGATATC CGGATCGACG CCGGCGGCGT GACGGTGACG
ACGCGCGGCG GCCTCACGTT CGAGGCGGAT TTCGTCGCCG ACGGCGCGGG GTACCGCTCG
GTGCTGGCCG ACAAGCTCGG CCTGCGCGAG ACGCCGTGCC GCGCGAAGAC GCATGCGCGC
GGCCTGTTCA CGCACATGAT CGACGTGAAG CCGTTCGACG CCTGCCGCGA GGTGCCCAAG
GCGCTGCAGC AGCCGGTGCC GTGGCATCAG GGGACGCTGC ACCACCTGTT CGACGGCGGC
TGGATGTGGG TGATTCCGTT CAACAACACG CCGGAATCGA AGAACCCGCT CGTGAGCGTC
GGCCTGATGC TCGATCCGCG CAAGCATCCG AAGCCGGACG TGCGGCCCGA GCAGGAATTC
GCCGACTTCA TCGCGAAGCA TCCGGACATG GCGCGGCAGT TCGCCGATGC GCGCGCGGTG
CGCGAATGGG TGTCCTCGGG CCGCATCCAG TACAGCGCGA GCGCATGCAC GGGCGACCGG
TTCTGCCTGC TCTCGCATGC GACGGGCTTC ATCGATCCGC TGTTCTCGCG CGGCCTGTTC
AACACGATGC AGACGACCAA CGCGCTCGCG GGGCTGCTGA TCGAAGCCGC AAAGGACCGC
GATTTCAGCA AGGCGCGCTT CGCGCCGGTC GAGAAGCTCC AGCAGGGCCT GATCGATTTC
AACGATCGGC TCGTCAACTG CTCGTACCTC TCGTGGGGCC ACTATCCGCT CTGGAACGCG
TGGTTCCGCC TGTGGCTGCT CACCGGCAAC TACGGCCAGC TTCACCTGCA GCGCGTGATG
ATGAAGTACC GGCAAACCGG CGACGCGCGC TGGCTCGAGC CGGCCGACGC GCTGTTGCCG
GGCGCGTTCA CCACGCTCGA GCCGATCATG CGGCTGTTCG AGGAGGCGGC GGTGTGCGTC
GAGCGGTACG GCGCGGGCGA ACTCTCGGGC GAGGCGGCCG AGCGGGCGAT CTACGCGCTG
CTCGAGGAGA ACGCCGCGCT GCTGCCGCCG TTCTTCGATT TCGTTTCGCC CGCCGAGCGG
ATCACCTGGC CGAGCACGCC CGAGAAGATC GCCGCGCTGC TGCTCGAGTG GGTCGAGCGG
CTGCCGGAGG ACGTGCGGGC GGAATACTTC GACTACGACG TGCGGGCGCT GCTCCAGCAG
CCGGTCGTCA AGGACACGAT CACCGCGGAC GTCGCGTGA
 
Protein sequence
MSNNQVRKYD VVIIGTGIGG TTLGAILARH GLRVAMIDSG THPRFAVGES TIATTTLTLE 
LMAMRFDVPE LKHITSIAEV SENVMPSCGV KRNFGFVYHR EHTEQNPQEV NQALVVNEVH
YFRQDIDAYM LHVAIRYGCD AYQNTVVDDI RIDAGGVTVT TRGGLTFEAD FVADGAGYRS
VLADKLGLRE TPCRAKTHAR GLFTHMIDVK PFDACREVPK ALQQPVPWHQ GTLHHLFDGG
WMWVIPFNNT PESKNPLVSV GLMLDPRKHP KPDVRPEQEF ADFIAKHPDM ARQFADARAV
REWVSSGRIQ YSASACTGDR FCLLSHATGF IDPLFSRGLF NTMQTTNALA GLLIEAAKDR
DFSKARFAPV EKLQQGLIDF NDRLVNCSYL SWGHYPLWNA WFRLWLLTGN YGQLHLQRVM
MKYRQTGDAR WLEPADALLP GAFTTLEPIM RLFEEAAVCV ERYGAGELSG EAAERAIYAL
LEENAALLPP FFDFVSPAER ITWPSTPEKI AALLLEWVER LPEDVRAEYF DYDVRALLQQ
PVVKDTITAD VA