Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A0839 |
Symbol | gcp |
ID | 3692124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | + |
Start bp | 1096116 |
End bp | 1097156 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637731094 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_335998 |
Protein GI | 162210103 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGTTC TCGGCATCGA AAGCTCCTGC GACGAAACCG GCCTCGCGCT CTACGACACC GAGCGCGGCC TGCTCGCGCA CGCGCTTCAC TCGCAGATCG CGATGCACCG CGAATACGGC GGTGTCGTTC CCGAGCTCGC GTCGCGCGAC CACATTCGCC GCGCGCTGCC GCTGCTCGAA GAGGTGCTCG CCGCAAGCGG CGCGCGCCGC GACGACATCG ACGCGATCGC GTTCACGCAG GGGCCCGGCC TCGCGGGCGC GCTGCTCGTC GGCGCGAGCA TCGCGAACGC GCTCGCGTTC GCGTGGGACA AGCCGACCAT CGGCATCCAC CACCTCGAAG GGCATCTGCT GTCGCCGCTG CTCGTCGCCG AGCCGCCGCC GTTTCCGTTC GTCGCGCTGC TCGTGTCGGG CGGCCATACG CAACTGATGC GCGTGAGCGA CGTCGGCGTC TACGAGACGC TCGGCGAGAC GCTCGACGAT GCCGCCGGCG AAGCGTTCGA CAAGACCGCG AAGCTGCTCG GCCTCGGCTA TCCGGGCGGG CCGGAGGTAT CGAGGCTCGC GGAAGCCGGC ACCCCGGGCG CGGTCGTGCT GCCGCGGCCG ATGCTTCATT CGGGGGATCT CGACTTCAGC TTCAGCGGGC TGAAGACCGC CGTGCTCACG CAAATGAAGA AGCTCGAAGC GGCGCACGCG GGCGGCGCCG TGCTCGAACG AGCGAAGGCG GATTTCGCGC GCGGCTTCGT CGACGCGGCC GTCGACGTGC TCGTCGCGAA GTCGCTCGCC GCGTTGAAGG CGACGCGGCT CAAGCGGCTC GTCGTCGCCG GCGGCGTGGG CGCGAACCGG CAATTGCGCG CGGCGCTGTC GGCCGCCGCC CAAAAGCGCG GCTTCGACGT CCATTATCCC GATCTCGCGC TCTGCACCGA CAACGGCGCG ATGATCGCGC TCGCGGGCGC GCTGCGGCTC GCGCGCTGGC CGTCGCAGGC GAGCCGCGAT TACGCGTTCA CGGTGAAGCC GCGCTGGGAT CTCGCGTCGC TCGCGCGATA G
|
Protein sequence | MLVLGIESSC DETGLALYDT ERGLLAHALH SQIAMHREYG GVVPELASRD HIRRALPLLE EVLAASGARR DDIDAIAFTQ GPGLAGALLV GASIANALAF AWDKPTIGIH HLEGHLLSPL LVAEPPPFPF VALLVSGGHT QLMRVSDVGV YETLGETLDD AAGEAFDKTA KLLGLGYPGG PEVSRLAEAG TPGAVVLPRP MLHSGDLDFS FSGLKTAVLT QMKKLEAAHA GGAVLERAKA DFARGFVDAA VDVLVAKSLA ALKATRLKRL VVAGGVGANR QLRAALSAAA QKRGFDVHYP DLALCTDNGA MIALAGALRL ARWPSQASRD YAFTVKPRWD LASLAR
|
| |