Gene BURPS668_A1890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1890 
Symbol 
ID4887905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1843316 
End bp1844938 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content74% 
IMG OID640131828 
Productdi-haem cytochrome c peroxidase 
Protein accessionYP_001062885 
Protein GI126443375 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGGCG GCAAGCGAAG CGGCGCGGCG CGGCGCGCGG GGGCGGCGTT CGCCGCGCGC 
GGGGCCGCCC TCGCGGGCGC GCCCGGCGCG AGGAATGCAC GCCGCATCGT TTCGACCCTC
GCCATCGCGG CCGTCACGCG GGCGGCCGGC GGCGCGCTCG CCGCGTGCGC ATCGGCGATC
GCCTTCGCGT CCGGCGCCGC CGCGCCGGGC GCGCTCGACG CAGTGCGCGC GGCACACCCG
GCGAGTTCGC TGAGCCCGGC GCGCACGCCG GGCGCCGGCG GCGCGGCTCA TGTGAAGCCC
GTGCAGGAGG CGCTCCGGGC AAAGGCCGCC TCGCCTTCGC CCTCGCTTTT GCCTTCCCCC
CCGTCGCCGA CGACGTCCCT GCTTCCCGGC GCGCCGCCGC AGCGCGTCGT CGCCACGATC
GGCCGAGGCA CGCCGCAGGT CGCGTCGAAA GTCGACCCGA CCGCGGCCGC GTTCCATCCG
GACCCGGCGC TCGCCGCCCT CGGCAAGCGC GTGTTCTTCG ATCCGGCGTT ATCGGAGCCG
CGCGGCACGT CGTGCGCGAG CTGCCACGAT CCGGGCCGCG CATTCGCGCC GACGCTCTCG
CGCGCGGCGC TCGCCGGCCC GCGCGTGCCG CAGGGCAGCC GCCCCGGGCA TTTCAGCCGC
CGCAACGCGC CGTCGCTGCT GTACGTGCGC TACGTGCCGC GCCGCCATTT CTATCAGGAC
GACGACGCGC TCGCCCCCGC CCCGTTCGGC GGCTTGTTCT CGGACGGCCG CGCCGACACG
CTCGCCGAGC AGTTGCGCGG CCCGCTCTTC GATCCGGACG AGATGAACAA CGCGTCGCCC
GCGGCGCTCA CCCGCAAGAT CGGCGGCACC GCACTCGGCG CGGCGCTCGC CGAACGCTTC
GGCCCGTCGG TGCGCCGCGA TCCCGAACGC ATGGTGCGCG CGCTCGGCGA AGCGATGCAG
GCGTACCTGC AAAGCGACGA GATGGCGCCG TTCTCGTCGC GCTACGACGC GTACGTGATG
CAACGCGCGC CGCTCACGCC GCAGGAGAAG CGCGGGCTCG CGCTCTTCAG GAATCCGGAC
AAAGGCAACT GCATGAGTTG CCACACGCTG TCGGACACCG CGAGCCGGCC CGAGCGCTCG
CTCTTCACCG ACTTCGGCTA CGACGCGATC GCGGTGCCGC GCAATCGCGC GCTGCGTGCG
AACCGCGACC CGCGCCACTT CGACAACGGC CTGTGCGACA CCGCCGCGAA GCTGCGCTGG
CCCGAGCCGG CGCAATGGTG CGGCTATCTG CGCACGCCCG GCCTGCGCAA CGTCGCGATC
AAGGAGTCGT TCATGCACAA CGGCGTGTTC GACACGCTGC GCGATGCGGT GGCGTTCTAC
AACACGCGCT CGACGGATCC GAAGCGCTGG TATCACGGCC GCGATACGTT CGACGACGTG
CCGGCCGCGT ACCGCAGCAA CATCAACGTG AACTCGACGC CGATGAACCG CCGAGCCGGC
ACGCCACCCG CGATGACGGA CGCCGACGTC GACGACATCG TCGCGTTCCT GCGCACGCTG
ACGGACGCCC GCTACGTCGG GCTGATGCCC GCGGCGCCCG ACGGCAAGGC GGCGCGACCG
TGA
 
Protein sequence
MTGGKRSGAA RRAGAAFAAR GAALAGAPGA RNARRIVSTL AIAAVTRAAG GALAACASAI 
AFASGAAAPG ALDAVRAAHP ASSLSPARTP GAGGAAHVKP VQEALRAKAA SPSPSLLPSP
PSPTTSLLPG APPQRVVATI GRGTPQVASK VDPTAAAFHP DPALAALGKR VFFDPALSEP
RGTSCASCHD PGRAFAPTLS RAALAGPRVP QGSRPGHFSR RNAPSLLYVR YVPRRHFYQD
DDALAPAPFG GLFSDGRADT LAEQLRGPLF DPDEMNNASP AALTRKIGGT ALGAALAERF
GPSVRRDPER MVRALGEAMQ AYLQSDEMAP FSSRYDAYVM QRAPLTPQEK RGLALFRNPD
KGNCMSCHTL SDTASRPERS LFTDFGYDAI AVPRNRALRA NRDPRHFDNG LCDTAAKLRW
PEPAQWCGYL RTPGLRNVAI KESFMHNGVF DTLRDAVAFY NTRSTDPKRW YHGRDTFDDV
PAAYRSNINV NSTPMNRRAG TPPAMTDADV DDIVAFLRTL TDARYVGLMP AAPDGKAARP