Gene BURPS1106A_A1805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1805 
Symbol 
ID4905495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1777106 
End bp1778728 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content74% 
IMG OID640144911 
Productdi-haem cytochrome c peroxidase 
Protein accessionYP_001075839 
Protein GI126456552 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.231553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGGCG GCAAGCGAAG CGGCGCGGCA CGGCGCGCGG GGGCGGCGTT CGCCGCGCGC 
GGGCCCGCCC TCGCGGGCGC GCCCGGCGCG AGGAATGCAC GCCGCATCGT TTCGACCCTC
GCCATCGCGG CCGTCACGCG GGCGGCCGGC GGCGCGCTCG CCGCGTGCGC ATCGGCGATC
GCCTTCGCGT CCGGCGCCGC CGCGCCGGGC GCGCTCGACG CAGTGCGCGC GGCACACCCG
GCGAGTTCGC TGAGCCCGGC GCGCACGCCG GGCGCCGGCG GCGCGGCCCA TGTGAAGCCC
GTGCAGGAAG CGCTCCGGGC AAAGGCCGCC TCGCCTTCGC CCTCGCTTTT GCCTTCCCCC
CCGTCGCCGA CGACGTCCCT GCTTCCCGGC GCGCCGCCGC AGCGCGTCGT CGCCACGATC
GGCCGAGGCA CGCCGCAGGT CGCGTCGAAA GTCGACCCGA CCGCGGCCGC GTTCCATCCG
GACCCGGCGC TCGCCGCCCT CGGCAAGCGC GTGTTCTTCG ATCCGGCGTT ATCGGAGCCG
CGCGGCACGT CGTGCGCGAG CTGCCACGAT CCGGGCCGCG CATTCGCGCC GACGCTCTCG
CGCGCGGCGC TCGCCGGCCC GCGCGTGCCG CAGGGCAGCC GCCCCGGGCA TTTCAGCCGC
CGCAACGCGC CGTCGCTGCT GTACGTGCGC TACGTGCCGC GCCGCCATTT CTATCAGGAC
GACGACGCGC TCGCCCCCGC CCCGTTCGGC GGCTTGTTCT CGGACGGCCG CGCCGACACG
CTCGCCGAGC AGTTGCGCGG CCCGCTCTTC GATCCGGACG AGATGAACAA CGCGTCGCCC
GCGGCGCTCA CCCGCAAGAT CGGCGGCACC GCACTCGGCG CGGCGCTCGC CGAACGCTTC
GGCCCGTCGG TGCGCCGCGA TCCCGAACGC ATGGTGCGCG CGCTCGGCGA AGCGATGCAG
GCGTACCTGC AAAGCGACGA GATGGCGCCG TTCTCGTCGC GCTACGACGC GTACGTGATG
CGACGCGCGC CGCTCACGCC GCAGGAGAGG CGCGGGCTCG CGCTCTTCAG GAATCCGGAC
AAAGGCAACT GCATGAGTTG CCACACGCTG TCGGACACCG CGAGCCGGCC CGAGCGCTCG
CTCTTCACCG ACTTCGGCTA CGACGCGATC GCGGTGCCGC GCAATCGCGC GCTGCGTGCG
AACCGCGACC CGCGCCACTT CGACAACGGC CTGTGCGACA CCGCCGCGAA GCTGCGCTGG
CCCGAGCCGG CGCAATGGTG CGGCTATCTG CGCACGCCCG GCCTGCGCAA CGTCGCGATC
AAGGAGTCGT TCATGCACAA CGGCGTGTTC GACACGCTGC GCGATGCGGT GGCGTTCTAC
AACACGCGCT CGACGGATCC GAAGCGCTGG TATCACGGCC GCGATACGTT CGACGACGTG
CCGGCCGCGT ACCGCGGCAA CATCAACGTG AACTCGACGC CGATGAACCG CCGAGCCGGC
ACGCCACCCG CGATGACGGA CGCCGACGTC GACGACATCG TCGCGTTCCT GCGCACGCTG
ACGGACGCCC GCTACGTCGG GCTGATGCCC GCGGCGCCCG ACGGCAAGGC GGCGCGACCG
TGA
 
Protein sequence
MTGGKRSGAA RRAGAAFAAR GPALAGAPGA RNARRIVSTL AIAAVTRAAG GALAACASAI 
AFASGAAAPG ALDAVRAAHP ASSLSPARTP GAGGAAHVKP VQEALRAKAA SPSPSLLPSP
PSPTTSLLPG APPQRVVATI GRGTPQVASK VDPTAAAFHP DPALAALGKR VFFDPALSEP
RGTSCASCHD PGRAFAPTLS RAALAGPRVP QGSRPGHFSR RNAPSLLYVR YVPRRHFYQD
DDALAPAPFG GLFSDGRADT LAEQLRGPLF DPDEMNNASP AALTRKIGGT ALGAALAERF
GPSVRRDPER MVRALGEAMQ AYLQSDEMAP FSSRYDAYVM RRAPLTPQER RGLALFRNPD
KGNCMSCHTL SDTASRPERS LFTDFGYDAI AVPRNRALRA NRDPRHFDNG LCDTAAKLRW
PEPAQWCGYL RTPGLRNVAI KESFMHNGVF DTLRDAVAFY NTRSTDPKRW YHGRDTFDDV
PAAYRGNINV NSTPMNRRAG TPPAMTDADV DDIVAFLRTL TDARYVGLMP AAPDGKAARP