Gene BURPS668_0771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0771 
Symbol 
ID4884028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp750989 
End bp752134 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content75% 
IMG OID640126699 
Productcytochrome c oxidase, subunit II 
Protein accessionYP_001057823 
Protein GI126438974 
COG category[C] Energy production and conversion 
COG ID[COG1622] Heme/copper-type cytochrome/quinol oxidases, subunit 2 
TIGRFAM ID[TIGR02866] cytochrome c oxidase, subunit II 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0315373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGGCG CAAACATGTC GGTGAAGCCC GCACAGCGGC CCGAGCGCGG GTTCGCGAAG 
ACGCGGCGGC GCACCGCCGC GTATCGCCCC GCCGCGACGC GCGCGGCGGG CGTCGCGCGC
GTCGCGCGCG CGGCGGGCGC GGCGACACCG TCACTCGCCG CCGCGCTGCA CGCCGCCGCC
GCGCACGCGC AAGCGCACGC CGCCCAGCCC GCCGTGCTGC CGCTCGCGTA CGTGTTCGAC
AGTGCGGGCC CCGCCGCGCG GCCCGTGCTG ATCCTCGGCT GGGCGCTGCT CGCGCTGTGC
ACCTCGGTCT GCGTCGTGAT CGCGGTCCTG CTCGCACTCG CGTTGTTCAG GCGGCGCGCC
GCGACGGCCG GCCTCACCGA GCGCGGCGGG CTCGGCTTCG TCTACGTCGG CACCGCGATC
TCGACCGCGC TGCTGCTCGC CGCGCTCGTC TACATGCTGT GGGTGCTCGC CGCGGTCGCG
AAGCCGCCGC GCCCGCCCGC GGTGACGATC GCGGTCACGG CGTACGACTG GTGGTGGAAG
GCCGACTACG GCGGCGGCCC GCCCGACGGC TTCACGACCG CGAACGAACT GCACGTGCCC
GTCGGCGAAC CGGTGCTGAT CGAGCTGCGC AGCGCCGACG TGATTCATGC GTTCTGGGCG
CCGCAACTCG CGGGCAAGAC GCAGGCGATT CCCGGCCAGA TCAATCGTCA ATGGATGCAG
GCGGACCGGC CGGGCGTCTA TCGCGGGCAG TGCACGCAGT TCTGCGGCGC GCAGCACGCG
CAGATGGGCT TCGAAATCGT CGCCGAACCG CCCGACGCGT ACCGGCGCTG GTACGCGTCG
CAGCGGCGCG GCGCCGAAGC GCCGCGCACG GCCGACGCGC TGCGCGGCCA GCGAATCTTC
GCCGATCGCT GCGCGGGCTG CCACGCGGTG CGCGGCACCG GCGCGGCGGG CACGCAGGCG
CCCGATCTCA CGCATGTCGG CGCGCGCCGC CTGCTCGCGG CGGGCGCGCT CGCGAACACG
CCGGACGAGC TGCGCCGCTG GATCGCCGAT GCGCAGCAGG TGAAGCCACA GTCACTGATG
CCGTCGATCC GGCTCGACCC CGCGCAGCAG CGCGACCTGT CCGCGTATCT GGCAACGCTG
CGATGA
 
Protein sequence
MNGANMSVKP AQRPERGFAK TRRRTAAYRP AATRAAGVAR VARAAGAATP SLAAALHAAA 
AHAQAHAAQP AVLPLAYVFD SAGPAARPVL ILGWALLALC TSVCVVIAVL LALALFRRRA
ATAGLTERGG LGFVYVGTAI STALLLAALV YMLWVLAAVA KPPRPPAVTI AVTAYDWWWK
ADYGGGPPDG FTTANELHVP VGEPVLIELR SADVIHAFWA PQLAGKTQAI PGQINRQWMQ
ADRPGVYRGQ CTQFCGAQHA QMGFEIVAEP PDAYRRWYAS QRRGAEAPRT ADALRGQRIF
ADRCAGCHAV RGTGAAGTQA PDLTHVGARR LLAAGALANT PDELRRWIAD AQQVKPQSLM
PSIRLDPAQQ RDLSAYLATL R