Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2256 |
Symbol | |
ID | 4881852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 2243646 |
End bp | 2244566 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640128184 |
Product | Ser/Thr protein phosphatase family protein |
Protein accession | YP_001059291 |
Protein GI | 126440025 |
COG category | [R] General function prediction only |
COG ID | [COG1409] Predicted phosphohydrolases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0412874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAATC CCATCCAACC CTTGAAACGG CGCGATTTCC TGCGCCTTGC CGCCTGCGGC GGCGGCGTCG CCTTCGCTTC CGCGCTGCCC GGCTGGAGCT TCGCCGCGAA CGCCGGCGCG GATTTCTTCT TCGTCCAGCT CTCCGACGCG CACTGGGGCT TCACCGGCCC CGCGATCAAC CCCGACGCGC GCGGCACGCT GCCGAAGGCG ATCGAGGCCG TCAACGCGCT GCCCGTCGCG CCCGACTTCG TGATGTTCAC GGGCGATCTG ACGCACACGA CCGACGATCC GGCCGAGCGC CGCGCACGGA TGCGGCAGTT CCAGTCGATC GTCGCGCAAC TGCGGGCGAA GCCGCTGCAC CTGATGCCGG GCGAGCACGA CGCGAGCCTC GATGCAGGCG CCGCGTACCG CGAGATCTTC GGCGACACCC ACTACGCGTT CGATCACAAG GGCGTGCATT TCGTCGTCGT CGACAACGTG TCGGATCCGG CCGGCCGCGT CGGCGACGCG CAGATCGAAT GGCTCGCGCG GGATCTCGCG CGACAGCCGA AGGACGCGCG CATCGTCGTC TTCACGCACC GGCCGCTCTT CGATCTCGCG CCGCAATGGG ACTGGGCCAC GCGCGACGGC GCGAAGGTCG TCGACGTGCT GATGCCCTAT CCGAACGTCA CCGTGTTCTA CGGACACATC CATCAGGAGC ACCACGCGAT GACGGGCCAC ATCGCGCACC ACGCGGCGCG CTCGCTGATG TTTCCGCTGC CCGCGCCCGG CTCGCAGGAC AAGCGCCTGC CGGTGCCGTG GGACGCCGCC GCGCCGTATC GCGGGCTCGG CTGGCGCGAA GTGCGCGTCG GCGACGCGGC GCGCGCGCCC GCGTTGACCG AGATGCCCGT CGCCGCGCCC CAACCGCAAC CGCGCGCCTG A
|
Protein sequence | MPNPIQPLKR RDFLRLAACG GGVAFASALP GWSFAANAGA DFFFVQLSDA HWGFTGPAIN PDARGTLPKA IEAVNALPVA PDFVMFTGDL THTTDDPAER RARMRQFQSI VAQLRAKPLH LMPGEHDASL DAGAAYREIF GDTHYAFDHK GVHFVVVDNV SDPAGRVGDA QIEWLARDLA RQPKDARIVV FTHRPLFDLA PQWDWATRDG AKVVDVLMPY PNVTVFYGHI HQEHHAMTGH IAHHAARSLM FPLPAPGSQD KRLPVPWDAA APYRGLGWRE VRVGDAARAP ALTEMPVAAP QPQPRA
|
| |