Gene BURPS1106A_3962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3962 
Symbol 
ID4899538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3866697 
End bp3867761 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content65% 
IMG OID640137188 
ProductRieske family iron-sulfur cluster-binding protein 
Protein accessionYP_001068182 
Protein GI126453073 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTCG ATACGGACCT CCTCCATCGC CACTGGCATC TCGGCTGTCA CCGCCGGGAG 
CTTCCGAACG ACGGCGATTT CGTGCGCTTC GACACCGCAA TCGGCGAAAT CGTGATCTTC
AACGATGCGG GCGAGCTCGT CGCGTTCGAC AACCGCTGCC CGCACCGCGG CGCCCGCATG
TATGTGGACG ACAGCGGCAA CCAGCCGGCG AGCTGCCCGT ACCACGGCTG GACGTATCGC
GAGGGCCGGC TGCTGATACC GGGCCGCGAG CGCTTCGACG GCTGCGCGCT CGAGCGCGCG
AAGCTGCGTA CGTTCGCCGT CGACTGGTGC GGCGACTTCC TGTTCTTTGC CGTCCACCCG
CAGACCGATC TCTACACGCA GCTCGGCAGA TTCGCCGAGG CCGTCGAGAA CATCTCGTTC
AACATCGATC GACGCCTCGA CTTGAACCGC TACGATTTCG AATGCTACTG GCCGCTCGCG
ATCGAGAACG CGCTCGAGCC GTACCACATC GGCGCCGTTC ATCCGCAGAC ACTCGCCACG
CTCGGGCTCG AAGACGGCGA GAACGTGTTC GACGGCGTCA ATTGCGCATG GTACGCCCCC
GTCGGCGCGA GCCGGCAGCG CAATCAGCTC GCCCGGCTCA AGCGCTTCTT CAATCTCGAT
TACCAATACG AAGGGTACGC GAGCATCTAT CTGTTTCCGT TCACGATGAT CTCGTCGACG
TACGGCTACT CGTATTCGCT GCAGCATTTT CTGCCCGCGG GCGGCGGCGG CGATCGCACG
CGCTTCACTA GCCGGCTTTA TGCGGCGCCC GCGGCGAGCG AACAGGCGGC GCAGGCGCTC
GGCGCCTTCT TCGAATCGAC GCGAGACGTC AATCGGCGGG TGTTCGAAGA GGACCACGCG
ATCTGCAAGC GAATGCCGAG GAACGCGTGG TCGATGGCGC CGCTCGCGTG CGCGGCCGAC
ACCGAAGCGA AAATCGATCA TTTCCGCCGC GCGTGCCGCA CGTTCGCCGC GTCGCGCGCC
GCGCTTCCCG TCGTCGACGC GACACGCGAG GCGGCGGCCG GGTAA
 
Protein sequence
MNFDTDLLHR HWHLGCHRRE LPNDGDFVRF DTAIGEIVIF NDAGELVAFD NRCPHRGARM 
YVDDSGNQPA SCPYHGWTYR EGRLLIPGRE RFDGCALERA KLRTFAVDWC GDFLFFAVHP
QTDLYTQLGR FAEAVENISF NIDRRLDLNR YDFECYWPLA IENALEPYHI GAVHPQTLAT
LGLEDGENVF DGVNCAWYAP VGASRQRNQL ARLKRFFNLD YQYEGYASIY LFPFTMISST
YGYSYSLQHF LPAGGGGDRT RFTSRLYAAP AASEQAAQAL GAFFESTRDV NRRVFEEDHA
ICKRMPRNAW SMAPLACAAD TEAKIDHFRR ACRTFAASRA ALPVVDATRE AAAG