Gene BURPS1106A_A1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1042 
Symbol 
ID4906091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1008763 
End bp1010373 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content71% 
IMG OID640144148 
Productdi-haem cytochrome c peroxidase family protein 
Protein accessionYP_001075078 
Protein GI126457905 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.023459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCCGCG CACGCTCGGC GCGGCGCCGC CCGCACGCGA CGCGCGCCCG ATACATCACC 
GATGCACGGC CGGCCGGCGC ACGCAACGCC GGCCGTCATT CCGGCGCGAC GCACGCCGGA
TTTTTCTTTT GCATGATCAC GACCGAACGC TCCAGCATGG CCGAACCGCT TTGCGCGCAA
CCCGCTCCGT CCACCCGATC CGACGCATGC GCGCCGGCCG CGCTTGCCAC CGTCTCGCGC
CGCCGCGGCC GCCGCAACGC GCGCGCGATG CGCCACGCGC CGGCGGCCGC CGCGTTCGGC
GTGCTCGGCT TCGCCGCGTT CGCGCTGGCG TTTCCCGAGC ATGTGCCGAA CGCGGTCGGC
GCGATCGTCG AAAACCTCAC GGGCGCGAAT CCGCAGCCGG TCGCGCTGCG CCGCCCGGGC
GCCGAGCCGC TGAGCGCGGT CGCGCAGCTC GGCCGCGCGC TGTTCTTCGA TCCCGCGCTG
TCCGCGTCGG GCCGGCAATC GTGTGCGTCG TGCCACAGCC CCGATCATGC GTACGGCCCG
CCGAACGATC TGGACGTGCA ACTGGGCGGC GCCGCGCTGA CGCAGCCCGG CTATCGGCCG
CCGCCGTCGC TGATGTATCT GTACCGGCAG CCGAACTTCA GCATCGGCCC GGACTCGTCC
GAGAACGACG ACGCGGCGAG CGTCGCGCAA CAGGCCGCAT CCGCGGCGGG CGCCGTGCGC
GCGCGGAAGA GCGCCGGCGC GGCGGCCGCG CCGCAGCTCG TGCCGCAGGG CGGGATGTTC
TGGGACGGCC GCGCGGATAC GCTGCAGCAG CAGGCGTTCG GCCCGTTGAT GAATCCGGTC
GAGATGGCGA ACGCGAGCAC CGGCGACGTC GCGCGCAAGC TCGCGCACGC GCGCTACGCG
CCGCGGTTCC GGCAGTTGTT CGGCCCGCGC ATCTTCGACG ACGCACGTCT TGCGGTGTCC
GAAGCGATGT TCGCGATCGC GCGCTACCAG GTGGAGGACC CGTCGTTCCA TCCGTATTCG
AGCAAGTACG ACCGCTGGCT CGAAGGCGAC GCGCGGCTCA CGCAGGCGGA GCTGCGCGGC
ATGCGGCGCT TCAACGATCC GAACAAGGCG AATTGCGCGG GCTGCCACCT GTCGAAGCCG
AGCGCGGACG GTCTGCCGCC GATGTTCACC GATTTCCAGT ACGAGGCGCT CGGCGTGCCG
CGCAACCGCG CGCTCGCGCA GAACCGCAAT CCGGCGTTCC ACGATCTCGG CATCTGCGGG
CCGTTTCGCG ACGACTTGAA GACGCAGACG CAATACTGCG CGATGTTCGC GACGCCTTCG
CTGCGCAACG TCGCGACGCG CCGCGTGTTC TTCCACAACG GCGTCTATCA TTCGCTCGAC
CGGGTGCTCG CGTTCTACAA CCTGCGCAGC GTCGATCCGG GCAAGATCTA TCCGCGCGAC
GCAAGCGGCC GGGTGCTGCA ATACGACGAC ATCCCGAGCG CGTATCGCGC GAACGTCGAC
GTCGCCGATG CGCCGTTCGA CCGCAAGCCG GGCGACGCGC CCGCGATGAC CGAGCAGGAC
ATGCGCGACA TCGTTGCGTT TCTGAACACG CTGACCGACG AGAAGCGCTG A
 
Protein sequence
MRRARSARRR PHATRARYIT DARPAGARNA GRHSGATHAG FFFCMITTER SSMAEPLCAQ 
PAPSTRSDAC APAALATVSR RRGRRNARAM RHAPAAAAFG VLGFAAFALA FPEHVPNAVG
AIVENLTGAN PQPVALRRPG AEPLSAVAQL GRALFFDPAL SASGRQSCAS CHSPDHAYGP
PNDLDVQLGG AALTQPGYRP PPSLMYLYRQ PNFSIGPDSS ENDDAASVAQ QAASAAGAVR
ARKSAGAAAA PQLVPQGGMF WDGRADTLQQ QAFGPLMNPV EMANASTGDV ARKLAHARYA
PRFRQLFGPR IFDDARLAVS EAMFAIARYQ VEDPSFHPYS SKYDRWLEGD ARLTQAELRG
MRRFNDPNKA NCAGCHLSKP SADGLPPMFT DFQYEALGVP RNRALAQNRN PAFHDLGICG
PFRDDLKTQT QYCAMFATPS LRNVATRRVF FHNGVYHSLD RVLAFYNLRS VDPGKIYPRD
ASGRVLQYDD IPSAYRANVD VADAPFDRKP GDAPAMTEQD MRDIVAFLNT LTDEKR