Gene BURPS1106A_A0747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0747 
Symbol 
ID4903420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp738058 
End bp739326 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content64% 
IMG OID640143853 
ProductRieske family iron-sulfur cluster-binding protein 
Protein accessionYP_001074783 
Protein GI126457442 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.239242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGCAA GGAATCCGGA GCAAACGATG AAAGTATCGG CAGACGTCCG CGCGCTGGTG 
GCGCGCCGCA AGGCAGGCTA CAGCCTCGAA GCCCCGTTCT ATCTGAGCGA CGAGATCTTT
GCGCTCGACA TGGACGCGAT CTTTCGGCGG CACTGGATCC AGGTGGGCGT CGAGCCGGAC
GTGCCCGAGC CCGGCGATTA CGTGACGGTG CAGCTCGGGG GCGATTCGAT CCTGATCGTG
CGCGACGACG ACATGCAGGT TCGCGCGTTC CACAACGTCT GCCGCCATCG CGGCGCGCGC
CTGTGCAACG AGGAAAAAGG GTCGGTCGGC AACATCGTGT GCCCGTATCA CAGCTGGACC
TACAACCTCA CGGGCCAGTT GATGTTCGCC GAGCACATGG GCGAGAAGTT CGACCGCTGC
AAGCACAGCC TGAAGCCCGT GCATCTGGAG AATCTCGCGG GGCTGCTGTT CGTGTGCCTC
GCCGACGAGC CGCCCGTCGA TTTCGCGACG ATGCGCGCGG CGATGGAGCC GTATCTGCTG
CCGCACGATC TGCCGAACAC GAAGATCGCC GCGCAGATCG ACATCGTCGA GAAAGGCAAC
TGGAAGCTGA CGATGGAGAA CAATCGCGAG TGCTATCACT GCGTCGCGAA CCATCCGGAG
TTGACCATTT CGTTGTACGA ATACGGCTTC GGCTATCAGC CATCGCCCGC GAACGCCGAA
GGCATGGCCG CGTTCGAGCG CACCTGCGTC GAGCGCGCCG CGCAGTGGGA AGCGCTGAAC
CTGCCGTCCG TCGAAGTGGA GCGCCTCACC GACGTGACGG GCTTTCGCAC GCAGCGTCTG
CCGCTCGACC GCAGCGGCGA ATCGCAAACG CTCGATGCGA AGGTCGCGTC GAAGAAGCTG
CTCGGCGAAT TCCGCCAGGC GGATCTCGGC GGCCTGTCGT TCTGGACGCA GCCGAATTCG
TGGCACCACT TCATGAGCGA TCACATCGTC ACGTTCTCGG TGATTCCGCT GTCGGCGGGC
GAGACGCTCG TGCGCACGAA ATGGCTCGTT CACAGGGACG CGAAGGAAGG CATCGACTAC
GACGTGAAGA ACCTCACGGC CGTCTGGAAC GCGACGAACG ATCAGGATCG CGCGCTCGTC
GAATTCTCGC AGCGCGGCGC GGCGAGCAGC GCCTACGAGC CCGGCCCGTA TTCGCCGTAC
ACCGAAGGGC TCGTCGAGAA GTTCTGCGAG TGGTACGTCG GCCGGCTGGC CGCGCATATC
GGCGCATAG
 
Protein sequence
MDARNPEQTM KVSADVRALV ARRKAGYSLE APFYLSDEIF ALDMDAIFRR HWIQVGVEPD 
VPEPGDYVTV QLGGDSILIV RDDDMQVRAF HNVCRHRGAR LCNEEKGSVG NIVCPYHSWT
YNLTGQLMFA EHMGEKFDRC KHSLKPVHLE NLAGLLFVCL ADEPPVDFAT MRAAMEPYLL
PHDLPNTKIA AQIDIVEKGN WKLTMENNRE CYHCVANHPE LTISLYEYGF GYQPSPANAE
GMAAFERTCV ERAAQWEALN LPSVEVERLT DVTGFRTQRL PLDRSGESQT LDAKVASKKL
LGEFRQADLG GLSFWTQPNS WHHFMSDHIV TFSVIPLSAG ETLVRTKWLV HRDAKEGIDY
DVKNLTAVWN ATNDQDRALV EFSQRGAASS AYEPGPYSPY TEGLVEKFCE WYVGRLAAHI
GA