Gene BURPS1106A_A0390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0390 
Symbol 
ID4905109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp371369 
End bp372361 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content71% 
IMG OID640143497 
ProductRieske family iron-sulfur cluster-binding protein 
Protein accessionYP_001074433 
Protein GI126455605 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.494645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACGG TCGCTCAGAT TCGTTTCGAT TCCATTGCCC GCTGGATGCC GGTCGCGCTG 
TCCGAGCAGG TGAGCGGCAG GGCGGCGCTT GCCGTCATCT GCATGGAGCA GCCGCTCGTG
CTGTTTCGCG ACGCGTCGGG CGCCGTATGC GCGATGGAGG ATCGTTGCGC GCATCGCCGA
GCGCCGCTAT CGCTCGGGCG CGTCACGCCC GACGGCCGGC TGCAGTGCGC GTATCACGGC
TGGACCTACG ACGGCGCGAC GGGCGCCTGC GTGGCGATTC CGAATCTGTC GGCGAGCGAG
CGCGTGCCCG CGCACTATGC CGCGCATGCG TACAAGACGC TCGAACGCGA CGGCTTCATA
TGGGCCTGCG CGCGCGATGC ACCGCCACCC GCCGAAGCGA TCGCTCGCGA CGCCCGCAGC
GCCCGGCGAT TCGCGGGCTC GGTGACGGTC GCCATCGCGC GCGACGAATA CGTCGCCGCA
TTGGCCGACG GGCCGCATCT GACGATGCGC ATCGCCGGCC TGTACATCAC GGATTACGTG
ATCGCGGACG CGACGCCGCA CGACGGCGAC ATCGCGACGG AACGCGGCGT CACGTGGCTG
GCGCACATCG TCGACAGGCA CTTCGGCGTG CGTCATCCGT GGACGCTGCG CGTCACGTCG
CCGCGAGACG GTGCCCTCGC GTCGGTCGAA CTCGCATCGC GCGACGGCGC GACGGCGCTC
TGGGCGTCGA TCGCGATCAC GCCGGCGGCG CGCGGCGCGA CGAACGTACT GTGGCGCGGC
GGCGTCGCGG CCGACGCGAG CGGCTTCGGC GCAAAACTGT TTCGGACGTG GGCGCGCCTG
CACGCCGCGC CGTTCGCGAT GCTCGCGCAC GTCGACGGCC GCGCGCTGTC GACGCTCGAC
GCGCTCTATT CGCGGGCATG GCGCGGCCCG ATCCCGGAGG GCATCGCCCA CACGCGGCCG
ATGCCGGCCG ACTATCGCAC AAGGAGCCGA TGA
 
Protein sequence
MNTVAQIRFD SIARWMPVAL SEQVSGRAAL AVICMEQPLV LFRDASGAVC AMEDRCAHRR 
APLSLGRVTP DGRLQCAYHG WTYDGATGAC VAIPNLSASE RVPAHYAAHA YKTLERDGFI
WACARDAPPP AEAIARDARS ARRFAGSVTV AIARDEYVAA LADGPHLTMR IAGLYITDYV
IADATPHDGD IATERGVTWL AHIVDRHFGV RHPWTLRVTS PRDGALASVE LASRDGATAL
WASIAITPAA RGATNVLWRG GVAADASGFG AKLFRTWARL HAAPFAMLAH VDGRALSTLD
ALYSRAWRGP IPEGIAHTRP MPADYRTRSR