Gene BURPS1106A_A2389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2389 
Symbolgcp 
ID4905698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2366206 
End bp2367285 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content71% 
IMG OID640145494 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001076421 
Protein GI126457675 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCACGC GCGCGTCGAT GCGGCCGCCG CACACCATCA TGCTCGTTCT CGGCATCGAA 
AGCTCCTGCG ACGAAACCGG CCTCGCGCTC TACGACACCG AGCGCGGCCT GCTCGCGCAC
GCGCTTCACT CGCAGATCGC GATGCACCGC GAATACGGCG GTGTCGTTCC CGAGCTCGCG
TCGCGCGACC ACATTCGCCG CGCGCTGCCG CTGCTCGAAG AGGTGCTCGC CGCAAGCGGC
GCGCGCCGCG ACGACATCGA CGCGATCGCG TTCACGCAGG GGCCCGGCCT CGCGGGCGCG
CTGCTCGTCG GCGCGAGCAT CGCGAACGCG CTCGCGTTCG CGTGGGACAA GCCGACCATC
GGCATCCACC ACCTCGAAGG GCATCTGCTG TCGCCGCTGC TCGTCGCCGA GCCGCCGCCG
TTTCCGTTCG TCGCGCTGCT CGTGTCGGGC GGCCATACGC AACTGATGCG CGTGAGCGAC
GTCGGCGTCT ACGAGACGCT CGGCGAGACG CTCGACGATG CCGCCGGCGA AGCGTTCGAC
AAGACCGCGA AGCTGCTCGG CCTCGGCTAT CCGGGCGGGC CGGAGGTATC GAGGCTCGCG
GAAGCCGGCA CCCCGGGCGC GGTCGTGCTG CCGCGGCCGA TGCTTCATTC GGGGGATCTC
GACTTCAGCT TCAGCGGGCT GAAGACCGCC GTGCTCACGC AAATGAAGAA GCTCGAAGCG
GCGCACGCGG GCGGCGCCGT GCTCGAACGG GCGAAGGCGG ATTTCGCGCG CGGCTTCGTC
GACGCGGCCG TCGACGTGCT CGTCGCGAAG TCGCTCGCCG CGTTGAAGGC GACGCGGCTC
AAGCGGCTCG TCGTCGCCGG CGGCGTGGGC GCGAACCGGC AATTGCGCGC GGCGCTGTCG
GCCGCCGCCC AAAAGCGCGG CTTCGACGTC CATTATCCCG ATCTCGCGCT CTGCACCGAC
AACGGCGCGA TGATCGCGCT CGCGGGCGCG CTGCGGCTCG CGCGCTGGCC GTCGCAGGCG
AGCCGCGATT ACGCGTTCAC GGTGAAGCCG CGCTGGGATC TCGCGTCGCT CGCGCGATAG
 
Protein sequence
MRTRASMRPP HTIMLVLGIE SSCDETGLAL YDTERGLLAH ALHSQIAMHR EYGGVVPELA 
SRDHIRRALP LLEEVLAASG ARRDDIDAIA FTQGPGLAGA LLVGASIANA LAFAWDKPTI
GIHHLEGHLL SPLLVAEPPP FPFVALLVSG GHTQLMRVSD VGVYETLGET LDDAAGEAFD
KTAKLLGLGY PGGPEVSRLA EAGTPGAVVL PRPMLHSGDL DFSFSGLKTA VLTQMKKLEA
AHAGGAVLER AKADFARGFV DAAVDVLVAK SLAALKATRL KRLVVAGGVG ANRQLRAALS
AAAQKRGFDV HYPDLALCTD NGAMIALAGA LRLARWPSQA SRDYAFTVKP RWDLASLAR