Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2389 |
Symbol | gcp |
ID | 4905698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 2366206 |
End bp | 2367285 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640145494 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_001076421 |
Protein GI | 126457675 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.158561 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCACGC GCGCGTCGAT GCGGCCGCCG CACACCATCA TGCTCGTTCT CGGCATCGAA AGCTCCTGCG ACGAAACCGG CCTCGCGCTC TACGACACCG AGCGCGGCCT GCTCGCGCAC GCGCTTCACT CGCAGATCGC GATGCACCGC GAATACGGCG GTGTCGTTCC CGAGCTCGCG TCGCGCGACC ACATTCGCCG CGCGCTGCCG CTGCTCGAAG AGGTGCTCGC CGCAAGCGGC GCGCGCCGCG ACGACATCGA CGCGATCGCG TTCACGCAGG GGCCCGGCCT CGCGGGCGCG CTGCTCGTCG GCGCGAGCAT CGCGAACGCG CTCGCGTTCG CGTGGGACAA GCCGACCATC GGCATCCACC ACCTCGAAGG GCATCTGCTG TCGCCGCTGC TCGTCGCCGA GCCGCCGCCG TTTCCGTTCG TCGCGCTGCT CGTGTCGGGC GGCCATACGC AACTGATGCG CGTGAGCGAC GTCGGCGTCT ACGAGACGCT CGGCGAGACG CTCGACGATG CCGCCGGCGA AGCGTTCGAC AAGACCGCGA AGCTGCTCGG CCTCGGCTAT CCGGGCGGGC CGGAGGTATC GAGGCTCGCG GAAGCCGGCA CCCCGGGCGC GGTCGTGCTG CCGCGGCCGA TGCTTCATTC GGGGGATCTC GACTTCAGCT TCAGCGGGCT GAAGACCGCC GTGCTCACGC AAATGAAGAA GCTCGAAGCG GCGCACGCGG GCGGCGCCGT GCTCGAACGG GCGAAGGCGG ATTTCGCGCG CGGCTTCGTC GACGCGGCCG TCGACGTGCT CGTCGCGAAG TCGCTCGCCG CGTTGAAGGC GACGCGGCTC AAGCGGCTCG TCGTCGCCGG CGGCGTGGGC GCGAACCGGC AATTGCGCGC GGCGCTGTCG GCCGCCGCCC AAAAGCGCGG CTTCGACGTC CATTATCCCG ATCTCGCGCT CTGCACCGAC AACGGCGCGA TGATCGCGCT CGCGGGCGCG CTGCGGCTCG CGCGCTGGCC GTCGCAGGCG AGCCGCGATT ACGCGTTCAC GGTGAAGCCG CGCTGGGATC TCGCGTCGCT CGCGCGATAG
|
Protein sequence | MRTRASMRPP HTIMLVLGIE SSCDETGLAL YDTERGLLAH ALHSQIAMHR EYGGVVPELA SRDHIRRALP LLEEVLAASG ARRDDIDAIA FTQGPGLAGA LLVGASIANA LAFAWDKPTI GIHHLEGHLL SPLLVAEPPP FPFVALLVSG GHTQLMRVSD VGVYETLGET LDDAAGEAFD KTAKLLGLGY PGGPEVSRLA EAGTPGAVVL PRPMLHSGDL DFSFSGLKTA VLTQMKKLEA AHAGGAVLER AKADFARGFV DAAVDVLVAK SLAALKATRL KRLVVAGGVG ANRQLRAALS AAAQKRGFDV HYPDLALCTD NGAMIALAGA LRLARWPSQA SRDYAFTVKP RWDLASLAR
|
| |