Gene BURPS1710b_A2115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2115 
Symbol 
ID3694523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2576789 
End bp2578030 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content64% 
IMG OID637732369 
Productiron-sulphur Rieske protein 
Protein accessionYP_337266 
Protein GI76817951 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTAT CGGCAGACGT CCGCGCGCTG GTGGCGCGCC GCAAGGCAGG CTACAGCCTC 
GAAGCCCCGT TCTATCTGAG CGACGAGATC TTTGCGCTCG ACATGGACGC GATCTTTCGG
CGGCACTGGA TCCAGGTGGG CGTCGAGCCG GACGTGCCCG AGCCCGGCGA TTACGTGACG
GTGCAGCTCG GGGGCGATTC GATCCTGATC GTGCGCGACG ACGACATGCA GGTTCGCGCG
TTCCACAACG TCTGCCGCCA TCGCGGCGCG CGCCTGTGCA ACGAGGAAAA AGGGTCGGTC
GGCAACATCG TGTGCCCGTA TCACAGCTGG ACCTACAACC TCACGGGCCA GTTGATGTTC
GCCGAGCACA TGGGCGAGAA GTTCGACCGC TGCAAGCACA GCCTGAAGCC CGTGCATCTG
GAGAATCTCG CGGGGCTGCT GTTCGTGTGC CTCGCCGACG AGCCGCCCGT CGATTTCGCG
ACGATGCGCG CGGCGATGGA GCCGTATCTG CTGCCGCACG ATCTGCCGAA CACGAAGATC
GCCGCGCAGA TCGACATCGT CGAGAAAGGC AACTGGAAGC TGACGATGGA GAACAATCGC
GAGTGCTATC ACTGCGTCGC GAACCATCCG GAGTTGACCA TTTCGTTGTA CGAATACGGC
TTCGGCTATC AGCCATCGCC CGCGAACGCC GAAGGCATGG CCGCGTTCGA GCGCACCTGC
GTCGAGCGCG CCGCGCAGTG GGAAGCGCTG AACCTGCCGT CCGTCGAAGT GGAGCGCCTC
ACCGACGTGA CGGGCTTTCG CACGCAGCGT CTGCCGCTCG ACCGCAGCGG CGAATCGCAA
ACGCTCGATG CGAAGGTCGC GTCGAAGAAG CTGCTCGGCG AATTCCGCCA GGCGGATCTC
GGCGGCCTGT CGTTCTGGAC GCAGCCGAAT TCGTGGCACC ACTTCATGAG CGATCACATC
GTCACGTTCT CGGTGATTCC GCTGTCGGCG GGCGAGACGC TCGTGCGCAC GAAATGGCTC
GTTCACAGGG ACGCGAAGGA AGGCATCGAC TACGACGTGA AGAACCTCAC GGCCGTCTGG
AACGCGACGA ACGATCAGGA TCGCGCGCTC GTCGAATTCT CGCAGCGCGG CGCGGCGAGC
AGCGCCTACG AGCCCGGCCC GTATTCGCCG TACACCGAAG GGCTCGTCGA GAAGTTCTGC
GAGTGGTACG TCGGCCGGCT GGCCGCGCAT ATCGGCGCAT AG
 
Protein sequence
MKVSADVRAL VARRKAGYSL EAPFYLSDEI FALDMDAIFR RHWIQVGVEP DVPEPGDYVT 
VQLGGDSILI VRDDDMQVRA FHNVCRHRGA RLCNEEKGSV GNIVCPYHSW TYNLTGQLMF
AEHMGEKFDR CKHSLKPVHL ENLAGLLFVC LADEPPVDFA TMRAAMEPYL LPHDLPNTKI
AAQIDIVEKG NWKLTMENNR ECYHCVANHP ELTISLYEYG FGYQPSPANA EGMAAFERTC
VERAAQWEAL NLPSVEVERL TDVTGFRTQR LPLDRSGESQ TLDAKVASKK LLGEFRQADL
GGLSFWTQPN SWHHFMSDHI VTFSVIPLSA GETLVRTKWL VHRDAKEGID YDVKNLTAVW
NATNDQDRAL VEFSQRGAAS SAYEPGPYSP YTEGLVEKFC EWYVGRLAAH IGA