Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2711 |
Symbol | catA |
ID | 4887483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 2595079 |
End bp | 2595993 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640132647 |
Product | catechol 1,2-dioxygenase |
Protein accession | YP_001063703 |
Protein GI | 126443868 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3485] Protocatechuate 3,4-dioxygenase beta subunit |
TIGRFAM ID | [TIGR02439] catechol 1,2-dioxygenase, proteobacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.999962 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGAGACGC AAATGAACAA GGAAGCCATC GACGCCCTGC TGAAATCGTT CGACGACGCC GCGACGCGGC CCGGCAACCC GCGCGTGCGC GCGATCGTCA ACCGAATCGT GAAGGACCTG TTCTACACGA TCGAGGATTT CGACGTGCAG CCGAGCGAAT TCTGGACCGC GCTCAACTAC CTGAACGATG CCGGCAAGGA ATTCGGCCTC ATCGCGGCCG GCCTCGGCTT CGAGCGCTTC CTCGACGTGC GGATGGACGA AGCCGAAGCG AAGGCGGGCC TGCAGGGCGG CACGCCGCGC ACGATCGAGG GGCCGCTGTA CGTCGCGGGC GCGCCGGAGT CGGTCGGCCA CGCGCGGCTC GACGACGGCA CCGATCCGGG CGAGACGCTC GTGATGCGCG GCCGCGTGCT CGGCGGCAAC GGCGCGCCGC TTGCGAACGC GCTCGTCGAA GTCTGGCATG CGAACCATCT CGGCAATTAT TCGTACTTCG ATCCGTCTCA GCCCGCGTTC AACCTGCGCC GCTCGATCCG CACCGACGCC GAAGGCCGCT ACAGCTTCCG CAGCGTGCTG CCCGTCGGCT ACAGCGTGCC GCCCGGCAGC AAGACCGAGC AATTGCTCGA CCAGCTCGGC CGCCACGGCC ACCGCCCCGC GCACATCCAC TTCTTCGTAT CGGCGCCCGG CCATCGCAAG CTGACGACGC AGATCAACAT CGAGGGCGAT CCGCACATCT GGGACGACTT CGCGTTCGCG ACGCGCGAGG GGCTGATTCC GAAGATCGCG CAGGCGGAAG GCGCGCAGGG CAAGCCGTAC GGCATCGACG GCCGGTTCGC GCTGATCGAC TTCGATTTCA CGCTGACGCG CGAGCGCGGC GACGTGCCGG CGAGCGAAGT CGAGCGCGTG CGCGCGCAAG CGTGA
|
Protein sequence | METQMNKEAI DALLKSFDDA ATRPGNPRVR AIVNRIVKDL FYTIEDFDVQ PSEFWTALNY LNDAGKEFGL IAAGLGFERF LDVRMDEAEA KAGLQGGTPR TIEGPLYVAG APESVGHARL DDGTDPGETL VMRGRVLGGN GAPLANALVE VWHANHLGNY SYFDPSQPAF NLRRSIRTDA EGRYSFRSVL PVGYSVPPGS KTEQLLDQLG RHGHRPAHIH FFVSAPGHRK LTTQINIEGD PHIWDDFAFA TREGLIPKIA QAEGAQGKPY GIDGRFALID FDFTLTRERG DVPASEVERV RAQA
|
| |