Gene BURPS668_A1461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1461 
SymbolkatB 
ID4885695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1357843 
End bp1359378 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content66% 
IMG OID640131400 
Productcatalase 
Protein accessionYP_001062458 
Protein GI126444043 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0753] Catalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.627836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCCC GTCATTCCAT GCGCTGGTTG AGCCGCGCGC TCGTTGCGGT GTCGGCGTCG 
GGCGCGCTCG CCGCGCATGC GTCGACGCTC ACGCGCGACA ACGGCGCTCC CGTCGGCGAC
AACCAGAATT CGCAGACCGC GGGCGCGAAC GGGCCGGTCC TGTTGCAGGA TGGCCACCTG
ATCCAGAAGC TGCAGCGTTT CGATCGTGAG CGCATTCCCG AGCGCGTCGT GCATGCGCGG
GGCACGGGCG CGCACGGCGT GTTCGTCGCG ACCCGCGACA TCTCGGATCT CACGCGGGCG
AAGGTGTTCG AGCCCGGCAC GCAGACGCCC GTTTTCGTGC GCTTTTCCAG CGTGATCCAC
GGCGGCACGT CGCCCGAAAC GCTGCGGGAC CCGCGCGGTT TCGCCACGAA GTTCTACACG
GCGGAGGGCA ACTGGGATCT CGTCGGCAAT AATCTGCCGG TTTTCTTCAT CCGCGACGCG
ATGAAGTTCC CGGACATGGT GCATTCGCTG AAGCCGGCGC CGGACACGAA CATTCAGGAC
CCCGACCGAT TCTTCGATTT CTTCTCGCAC CAGCCGGAGG CGACGCACAT GATCACGCGC
GTCTATTCGG ACGCCGGCAC GCCGGCGAGT TACCGCGAGA TGGACGGCAA CAGCGTGCAC
GCGTACAAGT TCGTCAACGC CCGTGGCGGC GTGACCTACG TGAAGTTCCA CTGGAAGAGC
CTCCAGGGGC AAAAGAACCT GACTGCCGCG CAGGCCGAAG CGATCCAGGG CAAGGATTTC
AACCACATGA CGCGCGACCT GATCGCGGCG ATCGATGCCG GCCGGTATCC GAAGTGGGAT
CTCTATGTCC AGACGCTGAA GCCCGATCAG CTCGACCAGT TCGCGTTCGA TCCGCTCGAC
GCGACGAAAG TCTGGCCCGG CGTGCCCGAG GTGAAGATCG GCACGATGAC GCTCAACAGG
AATCCCGGCA ACGTGTTCCA GGAAACCGAG CAGGCGGCGT TTGCGCCGTC GAATCTCGTG
CCGGGCATCG AGCCGTCCGA GGACCGGCTG CTGCAAGGGC GCCTGTTCGC GTATGCGGAT
ACGCAGCTTC ATCGCGTCGG CGTGAACGGG GCGCAACTGC CGGTGAACCG GCCGCGCGCG
CCCGTCACCA ACTACAACCG GGACGGCGCG ATGAACGGCG GCGCGGCGCG CGGCACGGTC
AATTACGAGC CCGGCGCGCA AGCCGCGTTG GCCGCCGATC CGGCGTTCGC GGCGAGCCGC
GCGCCGCTGG CGGGCTCGAC CCAGCAGGCG CGCATCGCGA AGACGCGCAA TTTCGATCAG
GCGGGCGCGT TCTACCGTTC ACTGAGCGCG AGCGAGCGTG CGAACCTGGT CGCCAATCTG
GCCGGCGATC TGAAGCAGGT GCGAAACGAC GGCGTCAAAT ACACGATGCT GTCGTATTTC
CAGAAGGCCG ATGCGGAATA TGGGCGCAAG GTGACGGCGG CGCTCGGCGC GGACCAGGGC
CGTGTCGATG CGCTGACCGC CAAGCTCGCC GATTGA
 
Protein sequence
MISRHSMRWL SRALVAVSAS GALAAHASTL TRDNGAPVGD NQNSQTAGAN GPVLLQDGHL 
IQKLQRFDRE RIPERVVHAR GTGAHGVFVA TRDISDLTRA KVFEPGTQTP VFVRFSSVIH
GGTSPETLRD PRGFATKFYT AEGNWDLVGN NLPVFFIRDA MKFPDMVHSL KPAPDTNIQD
PDRFFDFFSH QPEATHMITR VYSDAGTPAS YREMDGNSVH AYKFVNARGG VTYVKFHWKS
LQGQKNLTAA QAEAIQGKDF NHMTRDLIAA IDAGRYPKWD LYVQTLKPDQ LDQFAFDPLD
ATKVWPGVPE VKIGTMTLNR NPGNVFQETE QAAFAPSNLV PGIEPSEDRL LQGRLFAYAD
TQLHRVGVNG AQLPVNRPRA PVTNYNRDGA MNGGAARGTV NYEPGAQAAL AADPAFAASR
APLAGSTQQA RIAKTRNFDQ AGAFYRSLSA SERANLVANL AGDLKQVRND GVKYTMLSYF
QKADAEYGRK VTAALGADQG RVDALTAKLA D