Gene BURPS1710b_A2342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2342 
Symbol 
ID3692350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2848029 
End bp2849651 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content71% 
IMG OID637732597 
Productdi-haem cytochrome c peroxidase family protein 
Protein accessionYP_337494 
Protein GI76819839 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.349952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCCGCG CACGCTCGGC GCGGCGCCGC CCGCACGCGA CGCGCGCCCG ATACATCACC 
GATGCACGGC CGGCCGGCGC ACGCAACGCC GGCCGTCATT CCGGCGCGAC GCACGCCGGA
TTTTTCTTTT GCATGATCAC GACCGAACGC TCCAGCATGG CCGAACCGCT TTGCGCGCAA
CCCGCTCCGT CCACCCGATC CGACGCATGC GCGCCGGCCG CGCTTGCCAC CGTCTCGCGC
CGCCGCGGCC GCCGCAACGC GTGCGCGATG CGCCACGCGC CGGCGGCCGC CGCGTTCGGC
GCGCTCGGCT TCGCCGCGTT CGCGCTGGCG TTTCCCGAGC ACGTGCCGAA CGCGGTCGGC
GCGATCGTCG AAAACCTCAC GGGCGCGAAT CCGCAGCCGG TCGCGCTGCG CCGCCCGGGC
GCCGAGCCGC TGAGCGCGGT CGCGCAGCTC GGCCGCGCGC TGTTCTTCGA TCCCGCGCTG
TCCGCGTCGG GCCGGCAATC GTGTGCGTCG TGCCACAGCC CCGATCATGC GTACGGCCCG
CCGAACGATC TGGACGTGCA ACTGGGCGGC GCCGCGCTGA CGCAGCCCGG CTATCGGCCG
CCGCCGTCGC TGATGTATCT GTACCGGCAG CCGAACTTCA GCATCGGCCC GGACTCGTCC
GAGAACGACG ACGCGGCGAG CGTCGCGCAA CAGGCCGCAT CCCAGGCCGC ATCCGCGGCG
GGCGCCGTGC GCGCGCGGAA GAGCGCCGGC GCGGCGGCCG CGCCGCAGCT CGTGCCGCAG
GGCGGGATGT TCTGGGACGG CCGCGCGGAT ACGCTGCAGC AGCAGGCGTT CGGCCCGTTG
ATGAATCCGG TCGAGATGGC GAACGCGAGC ACCGGCGACG TCGCGCGCAA GCTCGCGCAC
GCGCGCTACG CGCCGCGGTT CCGGCAGTTG TTCGGCCCGC GCATCTTCGA CGACGCACGT
CTTGCGGTGT CCGAAGCGAT GTTCGCGATC GCGCGCTACC AGGTGGAGGA CCCGTCGTTC
CATCCGTATT CGAGCAAGTA CGACCGCTGG CTCGAAGGCG ACGCGCGGCT CACGCAGGCG
GAGCTGCGCG GCATGCGGCG CTTCAACGAT CCGAACAAGG CGAATTGCGC GGGCTGCCAC
CTGTCGAAGC CGAGCGCGGA CGGTCTGCCG CCGATGTTCA CCGATTTCCA GTACGAGGCG
CTCGGCGTGC CGCGCAACCG CGCGCTCGCG CAGAACCGCA ATCCGGCGTT CCACGATCTC
GGCATCTGCG GGCCGTTTCG CGACAACTTG AAGACGCAGA CGCAATACTG CGCGATGTTC
GCGACGCCTT CGCTGCGCAA CGTCGCGACG CGCCGCGTGT TCTTCCACAA CGGCGTCTAT
CATTCGCTCG ACCGGGTGCT CGCGTTCTAC AACCTGCGCA GCGTCGATCC GGGCAAGATC
TATCCGCGCG ACGCAAGCGG CCGGGTGCTG CAATACGACG ACATCCCGAG CGCGTATCGC
GCGAACGTCG ACGTCGCCGA TGCGCCGTTC GACCGCAAGC CGGGCGACGC GCCCGCGATG
ACCGAGCAGG ACATGCGCGA CATCGTTGCG TTTCTGAACA CGCTGACCGA CGAGAAGCGC
TGA
 
Protein sequence
MRRARSARRR PHATRARYIT DARPAGARNA GRHSGATHAG FFFCMITTER SSMAEPLCAQ 
PAPSTRSDAC APAALATVSR RRGRRNACAM RHAPAAAAFG ALGFAAFALA FPEHVPNAVG
AIVENLTGAN PQPVALRRPG AEPLSAVAQL GRALFFDPAL SASGRQSCAS CHSPDHAYGP
PNDLDVQLGG AALTQPGYRP PPSLMYLYRQ PNFSIGPDSS ENDDAASVAQ QAASQAASAA
GAVRARKSAG AAAAPQLVPQ GGMFWDGRAD TLQQQAFGPL MNPVEMANAS TGDVARKLAH
ARYAPRFRQL FGPRIFDDAR LAVSEAMFAI ARYQVEDPSF HPYSSKYDRW LEGDARLTQA
ELRGMRRFND PNKANCAGCH LSKPSADGLP PMFTDFQYEA LGVPRNRALA QNRNPAFHDL
GICGPFRDNL KTQTQYCAMF ATPSLRNVAT RRVFFHNGVY HSLDRVLAFY NLRSVDPGKI
YPRDASGRVL QYDDIPSAYR ANVDVADAPF DRKPGDAPAM TEQDMRDIVA FLNTLTDEKR