Gene BURPS1106A_A2566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2566 
SymbolcatB 
ID4903647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2521118 
End bp2522251 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content72% 
IMG OID640145669 
Productmuconate cycloisomerase 
Protein accessionYP_001076596 
Protein GI126457415 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR02534] muconate and chloromuconate cycloisomerases 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGCAA CAGGCATCAC GATCGACCGG ATCGACACGC TGCTCGTCGA CGTGCCGACA 
GTCCGGCCGC ACAAGCTTTC GGTGGCGACG ATGAACTGCC AGACGCTCGT GCTCGTGCGC
GTCCGATGCT CGGACGGTAT CGAGGGCGTC GGCGAAGGCA CGACGATCGG CGGTCTCGCG
TACGGCGAAG AAAGCCCCGA GAGCATCAAG ACGAACATCG ACGCCTATTT CGCGCCGATG
CTGCGAGGCG CGGACGCGAG CCGCCCGGGC GCCGCGATGG CGCGCGTGCG CAAGCTGCTC
CAGGGCAACC GCTTCGCGAA GTGCGCGCTC GAGACCGCGC TGTTCGACGC GCACGCGCGC
CGGCTCGGCG TGCCGCTGTC CGAATTGCTC GGCGGCAGGA CGACCGACGC GCTCGACGTC
GCGTGGACGC TCGCGAGCGG CGACACCGCG CGCGACATCG CGGAGGCTGA GGCGATGCTC
GAAGCGCGCC GCCATCGCGC GTTCAAGCTG AAGATCGGCG CGCGCGCGGT GGCCGACGAC
GTCGCGCATG TCGTCGCGAT CAAGCGCGCG CTCGGCGAGC GCGGCGACGT GCGCGTCGAC
GTGAACCAGG CATGGACCGA AAGCGAGGCC GTGTGGGCCG GCGCGCGGCT CGCGGACGCG
GGCGTGAGCC TCGTCGAGCA GCCGATCGCC GCGGCCAATC GCGCGGGCCT GAAGCGCCTC
ACCGCGCTCG CGCACATCCC GATCATGGCC GACGAGGCGC TGCACGGCCC CGTCGACGCA
TTCGCGCTCG CGCGCGAGCG CGCGGCCGAC GTGTTCGCGG TGAAGATCGC ACAATCGGGC
GGCCTGCAGG GCGCGGCCGC CGTCGCGGCG ATCGCCGCCG CGGCCGGCAT CGAACTGTAC
GGCGGCACGA TGCTCGAAGG CGCGGCCGGC ACGATCGCGT CCGCGCAACT GTTCAGCACG
TTCGGCGCGC TCGAGTGGGG CACCGAGCTG TTCGGCCCGC TGCTGCTGAC CGAGGAGATC
CTCGTCGAGC CGCTGCGCTA CGAGGATTTC AAGCTGCACC TGCCGAGCGC CCCCGGCCTC
GGCATCGCTT TCGACTGGGC CCGTATCGAG CGGATGCAAC GCCGGGCCCG CTGA
 
Protein sequence
MIATGITIDR IDTLLVDVPT VRPHKLSVAT MNCQTLVLVR VRCSDGIEGV GEGTTIGGLA 
YGEESPESIK TNIDAYFAPM LRGADASRPG AAMARVRKLL QGNRFAKCAL ETALFDAHAR
RLGVPLSELL GGRTTDALDV AWTLASGDTA RDIAEAEAML EARRHRAFKL KIGARAVADD
VAHVVAIKRA LGERGDVRVD VNQAWTESEA VWAGARLADA GVSLVEQPIA AANRAGLKRL
TALAHIPIMA DEALHGPVDA FALARERAAD VFAVKIAQSG GLQGAAAVAA IAAAAGIELY
GGTMLEGAAG TIASAQLFST FGALEWGTEL FGPLLLTEEI LVEPLRYEDF KLHLPSAPGL
GIAFDWARIE RMQRRAR