Gene BURPS668_2697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2697 
SymbolhemN 
ID4885466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2669535 
End bp2670986 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content66% 
IMG OID640128625 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001059721 
Protein GI126439546 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATGA CAACGCAATC GAACGACGGC GCGGATACGA CCGCGCCCGC TCCCCAACAC 
GACGTGTCCG CGTTCGCGGA CGTTCAGATC TCCGAAGCGC TGATTCGGCG CTTCGACCGG
CAAGGGCCGC GCTATACGTC CTATCCGACG GCCGACCGGT TCTCCGACGC ATTCGACGAG
CGCGCGTACC GCGAACATCT GTCGCGCCGC GCGTCAGCCG AGCGTAATCC GCCGCTGTCG
GTCTATCTGC ATCTGCCGTT TTGCGAGTCG CTCTGCTACT TCTGCGCGTG CAACAAGATC
ATCACGCAGG ATCACACCCG CACGAGCGCG TACGTCGACT ATCTGATCCG CGAAATGGAG
CTCGTCGCGC CGGATCTCGG CCGCGATCGG CGGACGACGC AACTGCATCT GGGCGGCGGC
TCGCCGACGT TCTTCGCGAT CGACGAGCTC GCGCGCCTGA TGCGCGCGCT GCGCGAGCAC
TTCGACTTCG CGCCGCACGC GGAGCTCGGC GTCGAGATCG ATCCGCGCAC GGTCAACGAG
CGCACCCTGC AGTCGCTCGC GGCGCTCGGC TTCAACCGGA CGAGCTTCGG CGTGCAGGAC
TTCGATCCGT CGGTGCAGGA GGCGGTGCAT CGGATCCAGC CGCTGCCGAT GGTCGAGCGC
GCGCTCGAGG CGAGCCGCGC GGCCGGTTTC GAATCGGTCA ACATCGATCT GATCTACGGG
TTGCCGCGGC AGACGCCCGC GAGCTTTTCG CGCACGCTCG ACGAGGTGAT CCGGCTGTCG
CCCGAGCGCA TCGCGGTCTA CAACTACGCG CATTTGCCGA GCCGCTTCAA GGCGCAGCGC
CTGATCGTCG AAGCGCAGCT GCCGCCCGCG GAAGACCGGC TGCGGATCTT CATCGAATCG
ACGCGGCGGC TGCTCGACGC GGGCTACGTG TACATCGGGC TCGATCACTT CGCGAAGCCG
AACGACGAGC TCGGCAACGC GCTGCGCGAG CGCAGCCTGC ACCGCAATTT CCAGGGCTAT
ACGACGCAGG CCGAATGCGA TCTCGTCGGC TTCGGCGTAT CGGCGATCGG CAAGGTCGGC
GCCTCGTACA GCCAGTCGAC GCGCTCGCTG AAGACCTACT ACCGCCAGCT CGACGCGGGG
CGCCTGCCGA TCGAGCGGGG CTTCGCGCTG ACAGCCGACG ATTTGCTGCG CCGCGAAGTC
ATCATGACGG TGATGTGCAG CACACCCGTC GATTTCGCGG AGATCGGCCA CAGGCACGGC
ATCGATTTCG CCCGGTATTT CGCGCCCGAG CTCGCGCAGC TCGAGCCGTA TCGCGACGCG
GGGCTGCTCA CGATCGATGC GCAGCGCATC GCCGTCACGC CGAAGGGGCG CATGTTCGTG
CGCGCGATCG GCATGGTGTT CGACGCGTAT CTCGGCCGCA GCGCCGCGGC GTCTTATTCG
AAATTGATCT AG
 
Protein sequence
MDMTTQSNDG ADTTAPAPQH DVSAFADVQI SEALIRRFDR QGPRYTSYPT ADRFSDAFDE 
RAYREHLSRR ASAERNPPLS VYLHLPFCES LCYFCACNKI ITQDHTRTSA YVDYLIREME
LVAPDLGRDR RTTQLHLGGG SPTFFAIDEL ARLMRALREH FDFAPHAELG VEIDPRTVNE
RTLQSLAALG FNRTSFGVQD FDPSVQEAVH RIQPLPMVER ALEASRAAGF ESVNIDLIYG
LPRQTPASFS RTLDEVIRLS PERIAVYNYA HLPSRFKAQR LIVEAQLPPA EDRLRIFIES
TRRLLDAGYV YIGLDHFAKP NDELGNALRE RSLHRNFQGY TTQAECDLVG FGVSAIGKVG
ASYSQSTRSL KTYYRQLDAG RLPIERGFAL TADDLLRREV IMTVMCSTPV DFAEIGHRHG
IDFARYFAPE LAQLEPYRDA GLLTIDAQRI AVTPKGRMFV RAIGMVFDAY LGRSAAASYS
KLI