Gene BURPS1106A_A2567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2567 
SymbolcatA 
ID4904748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2522274 
End bp2523188 
Gene Length915 bp 
Protein Length304 aa 
Translation table11 
GC content68% 
IMG OID640145670 
Productcatechol 1,2-dioxygenase 
Protein accessionYP_001076597 
Protein GI126458110 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3485] Protocatechuate 3,4-dioxygenase beta subunit 
TIGRFAM ID[TIGR02439] catechol 1,2-dioxygenase, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAGACGC AAATGAACAA GGAAGCCATC GACGCCCTGC TGAAATCGTT CGACGACGCC 
GCGACGCGGC CCGGCAACCC GCGCGTGCGC GCGATCGTCA ACCGAATCGT GAAGGACCTG
TTCTACACGA TCGAGGATTT CGACGTGCAG CCGAGCGAAT TCTGGACCGC GCTCAACTAC
CTGAACGATG CCGGCAAGGA ATTCGGCCTC ATCGCGGCTG GCCTCGGCTT CGAGCGCTTC
CTCGACGTGC GGATGGACGA AGCCGAAGCG AAGGCGGGCC TGCAGGGCGG CACGCCGCGC
ACGATCGAGG GGCCGCTGTA CGTCGCGGGC GCGCCGGAGT CGGTCGGCCA CGCGCGGCTC
GACGACGGCA CCGATCCGGG CGAGACGCTC GTGATGCGCG GCCGCGTGCT CGGCGGCAAC
GGCGCGCCGC TTGCGAACGC GCTCGTCGAA GTCTGGCATG CGAACCATCT CGGCAATTAT
TCGTACTTCG ATCCGTCTCA GCCCGCGTTC AACCTGCGCC GCTCGATCCG CACCGACGCC
GAAGGCCGCT ACAGCTTCCG CAGCGTGCTG CCCGTCGGCT ACAGCGTGCC GCCCGGCAGC
AAGACCGAGC AATTGCTCGA CCAGCTCGGC CGCCACGGCC ACCGCCCCGC GCACATCCAC
TTCTTCGTGT CGGCGCCCGG CCATCGCAAG CTGACGACGC AGATCAACAT CGAGGGCGAT
CCGCACATCT GGGACGACTT CGCGTTCGCG ACGCGCGAGG GGCTGATTCC GAAGATCGCG
CAGGCGGAAG GCGCGCAGGG CAAGCCGTAC GGCATCGACG GCCGGTTCGC GCTGATCGAC
TTCGATTTCA CGCTGACGCG CGAGCGCGGC GACGTGCCGG CGAGCGAAGT CGAGCGCGTG
CGCGCGCAAG CGTGA
 
Protein sequence
METQMNKEAI DALLKSFDDA ATRPGNPRVR AIVNRIVKDL FYTIEDFDVQ PSEFWTALNY 
LNDAGKEFGL IAAGLGFERF LDVRMDEAEA KAGLQGGTPR TIEGPLYVAG APESVGHARL
DDGTDPGETL VMRGRVLGGN GAPLANALVE VWHANHLGNY SYFDPSQPAF NLRRSIRTDA
EGRYSFRSVL PVGYSVPPGS KTEQLLDQLG RHGHRPAHIH FFVSAPGHRK LTTQINIEGD
PHIWDDFAFA TREGLIPKIA QAEGAQGKPY GIDGRFALID FDFTLTRERG DVPASEVERV
RAQA