Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0824 |
Symbol | |
ID | 4885657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 802228 |
End bp | 803946 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640126752 |
Product | hypothetical protein |
Protein accession | YP_001057875 |
Protein GI | 126442235 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGTCA AATCGTCGTT TGCCGTCCTG CTGTGCGCGG CGCTCGCCTT GCCGCCCGGC GGCCACGCGC AGTCGCGCGG CGATGCGCCG CCGCTCGAAT CCGCGCGCGC CGCCGGCGCC GAGGACGCCG CGGCGCGCGC GCGCGATGCG CTGTCCACGG TGCCGTCCGG CATCGCGCCC GGCGTGTTCG GCATGTACGG CGGCGCGCAG AGCCGGCTTG CCGATCCGGC GTCGGGCACG CCCAGTTTGC GCGCGCCGCT TCGCTCGTTG CAACTGCCCG ATCTCGGCGA CGGCTCGGGC GGCTCGCTGA CGCCGCAAGC GGAGCGCCGG CTCGGCGAGC GCGTGATGCG CGAGGTGCGG CGCGATCCCG ACTATCTCGA CGACTGGCTC GTGCGCGACT ACCTGAATTC GGTCGCGGCG AAGCTCTCCG CGGCCGCCGC CGCGCAGTTC ATCGGCGGCT ACATGCCCGA TTTCGAGCTG TTCGCGATGC GCGATCCGCA GATCAACGCG TTCTCGCTGC CGGGCGGTTT CATCGGCATC AACAGCGGGC TCGTCGCGGC GACGCAGACG GAGTCCGAAC TCGCGTCGGT GATTGGCCAT GAGATGGGGC ACGTGCTGCA GCGGCACATC GCGCGGATGA TCGGCGCGAG CGAGAAGAGC GGCTATGCGG CGCTCGCGAC GATGCTGTTC GGCGTGCTCG CGGGCATTCT CGCGCGCAGC GGCGATCTCG GCAGCGCGAT CGCGATGGGC GGCCAGGCGT TCGCGGTCGA CAGCCAGCTC AGGTTCTCGC GCTCGGCCGA GCGCGAGGCG GACCGCGTCG GCTTCCAGTT GCTCGCGGGC GCCGGCTACG ATCCGTACGG CATGCCGGGC TTCTTCGAGC GGCTCGAGCG TGCGTCGGTG GGCGACGCGG GCGTGCCCGC GTACGCGCGC ACGCACCCGC TGACGGGCGA GCGGATCGCG GACATGGACG ACCGCGCGCG GCGCGCGCCG TACCGGCAGC CGCGGCAATC GGCGGAATAC GGTTTCGTGC GCGCGCGCCT GCGGATGCTG CAGAACCGCG CGCCGACCGA TTACGCGAAC GAGGCAAGAC GAATGCGCGC GGAGCTCGAC GATCGCGTCG CGCCGAATGT CGCGGCGAAC TGGTATGGGA TCGCGCTCGG CGAGATGCTG GGCGGCCGCT ACGATGACGC GGACCGCGCG CTCGCCGCGG CGCGCGATGC GTTCGCGCGC ACGGCCGCGC GCGAGGGCGA GGCGGCGCGC ACGTCGCCGA GCCTCGACGT GCTCGCGGCG GAGATCGCGC GTCGCGCCGG CCGCGGGGAC GACGCGGTGC GGCTCGCCGC CGCCGCGCAG GCGCGCTGGC CGGGTTCGCA CGCGGCTATC GCCGCGCATT TGCAGGCGCT TCTCGCCGCG CGGCGTTACG GGCAGGCGCA GGCGCTCGCA CAAGCGGAGG CGAACGCGGC CCCCCGCCAG CCCGATTGGT GGAACTATCT CGCGCAGGCG AGCCTTGGCC GGGGCGATGC GCTCACGCAG CGCCGCGCGC TCGCGGAGAA GTTCGCGCTC GAAGGCGCGT GGCCGTCGGC GATCCGGCAA CTGCGCGAGG CGCGCGATCT CAAGTCGGCC GGTTTCTACG AGCAGTCGAT CATCAGCGCG CGGCTGCACG AATTCGAGGC ACGCTACAAG GAAGAGCGGG AAGAGGACAA GGACGATCGG CGCGGTTGA
|
Protein sequence | MRVKSSFAVL LCAALALPPG GHAQSRGDAP PLESARAAGA EDAAARARDA LSTVPSGIAP GVFGMYGGAQ SRLADPASGT PSLRAPLRSL QLPDLGDGSG GSLTPQAERR LGERVMREVR RDPDYLDDWL VRDYLNSVAA KLSAAAAAQF IGGYMPDFEL FAMRDPQINA FSLPGGFIGI NSGLVAATQT ESELASVIGH EMGHVLQRHI ARMIGASEKS GYAALATMLF GVLAGILARS GDLGSAIAMG GQAFAVDSQL RFSRSAEREA DRVGFQLLAG AGYDPYGMPG FFERLERASV GDAGVPAYAR THPLTGERIA DMDDRARRAP YRQPRQSAEY GFVRARLRML QNRAPTDYAN EARRMRAELD DRVAPNVAAN WYGIALGEML GGRYDDADRA LAAARDAFAR TAAREGEAAR TSPSLDVLAA EIARRAGRGD DAVRLAAAAQ ARWPGSHAAI AAHLQALLAA RRYGQAQALA QAEANAAPRQ PDWWNYLAQA SLGRGDALTQ RRALAEKFAL EGAWPSAIRQ LREARDLKSA GFYEQSIISA RLHEFEARYK EEREEDKDDR RG
|
| |