Gene BMAA0334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA0334 
Symbolgcp 
ID3087205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp321404 
End bp322444 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content71% 
IMG OID637564259 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_105141 
Protein GI53716439 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGTTC TCGGCATCGA AAGCTCCTGC GACGAAACCG GCCTCGCGCT CTACGACACC 
GAGCGCGGCC TGCTCGCGCA CGCGCTTCAC TCGCAGATCG CGATGCACCG CGAATACGGC
GGTGTCGTTC CCGAGCTCGC GTCGCGCGAC CACATTCGCC GCGCGCTGCC GCTGCTCGAA
GAGGTGCTCG CCGCAAGCGG CGCGCGCCGC GACGACATCG ACGCGATCGC GTTCACGCAG
GGGCCCGGCC TCGCGGGCGC GCTGCTCGTC GGCGCGAGCA TCGCGAACGC GCTCGCGTTC
GCGTGGGACA AGCCGACCAT CGGCATCCAC CACCTCGAAG GGCATCTGCT GTCGCCGCTG
CTCGTCGCCG AGCCGCCGCC GTTTCCGTTC GTCGCGCTGC TCGTGTCGGG CGGCCATACG
CAACTGATGC GCGTGAGCGA CGTCGGCGTC TACGAGACGC TCGGCGAGAC GCTCGACGAT
GCAGCCGGCG AAGCGTTCGA CAAGACCGCG AAGCTGCTCG GCCTCGGCTA TCCGGGCGGG
CCGGAGGTAT CGAGGCTCGC GGAAGCCGGC ACCCCGGGCG CGGTCGTGCT GCCGCGGCCG
ATGCTTCATT CGGGGGATCT CGACTTCAGC TTCAGCGGGC TGAAGACCGC CGTGCTCACG
CAAATGAAGA AGCTCGAAGC GGCGCACGCG GGCGGCGCCG TGCTCGAGCG GGCGAAGGCG
GATCTCGCGC GCGGCTTCGT CGACGCGGCC GTCGACGTGC TCGTCGCGAA GTCGCTCGCC
GCGTTGAAGG CGACGCGGCT CAAGCGGCTC GTCGTCGCCG GCGGCGTGGG CGCGAACCGG
CAATTGCGCG CGGCGCTGTC GGCCGCCGCC CAAAAGCGCG GCTTCGACGT CCATTATCCC
GATCTCGCGC TCTGCACCGA CAACGGCGCG ATGATCGCGC TCGCGGGCGC GCTGCGGCTC
GCGCGCTGGC CGTCGCAGGC GAGCCGCGAT TACGCGTTCA CGGTGAAGCC GCGCTGGGAT
CTCGCGTCGC TCGCGCGATA G
 
Protein sequence
MLVLGIESSC DETGLALYDT ERGLLAHALH SQIAMHREYG GVVPELASRD HIRRALPLLE 
EVLAASGARR DDIDAIAFTQ GPGLAGALLV GASIANALAF AWDKPTIGIH HLEGHLLSPL
LVAEPPPFPF VALLVSGGHT QLMRVSDVGV YETLGETLDD AAGEAFDKTA KLLGLGYPGG
PEVSRLAEAG TPGAVVLPRP MLHSGDLDFS FSGLKTAVLT QMKKLEAAHA GGAVLERAKA
DLARGFVDAA VDVLVAKSLA ALKATRLKRL VVAGGVGANR QLRAALSAAA QKRGFDVHYP
DLALCTDNGA MIALAGALRL ARWPSQASRD YAFTVKPRWD LASLAR