Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0079 |
Symbol | |
ID | 4888477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 66772 |
End bp | 67809 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640130020 |
Product | hypothetical protein |
Protein accession | YP_001061085 |
Protein GI | 126444298 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02276] 40-residue YVTN family beta-propeller repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0781813 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTCCT TGAAAAAACG CCGCATCGGC ATTGCCGCGC TGCTCGCATT GTCCGCCGCG CTCGCTCTAT CCGGGCTCGC CGCGCACCGT GCGCTGCGGG CCGCACGTCC TGCGCCCGCC TTGCTCGCGC TGAAGCAGGT GGCCGATATT CCACTTCCGG GCGGCGCAAC GCGCTTCGAC TACGAGAGCA TCGATCCGAA CCGCCGTCTC CTTTACATCG CGCATCTCGG CGACGCGGAG ATCGTCGTCT TCGATCTGCG CGCGTCGCAA GTCACGGCGC GCATCGGCGA CATATCGTCC GTGCATGGCG TGCTTGCCGT TACCGAGCTA TCGCGCGTGT ACGCATCGGC CACGGGAACC GACGAAGTCG TCGCCATCGA CGCGCGGACA CGGAAGATCG TCGCGCGCAT TCCGGGCGGG CGCTACCCGG ACGGAATGGC CTACGCGCCC GAAGCATTCA AGCTGTACGT TTCGGACGAA TATGGGGAAA CCGAGACGGT CATCGACACG CGGACGAACC GGCGCATCGC GACGATTGCG CTCGGCGGCG AAGCAGGCAA CACGCAATAC GATCCGTCCT CCCGCCACGT GTTCGTGAAC GACCAGACGC ATGCGCGGCT TGTCGAGATC GATCCGGCAC TGGACCGGAT CGTCAATCGA TTCGATCTTC CTGGCGCCAA GGGCAACCAT GGGTTGCTGA TCGACCCGCG CGATCGGCTC GCGTTCATCG CTTGCGAAGG AAACGACAAA CTTCTGATTC TCGATATGCG CTCGATGCGA ATCGTCCAAT CGTTCGACAT CGGCGGGAGC CCGGATGTGC TTGCATTCGA TCCGTCGCTC GCGACGCTGT ACGTCGCGGG CGAGGCCGGC GTCATATCGA GGTTCCGCGT CGAGGCGAGC GGCGTCCGAA AGATCGACGA GGGCCGGCTC GCGGCCCATG CCCATGTCGT TGCGGTGGAC CCGTCCACGC ACCGGTCGTA CTTCCCATTG AAAAACATCG GTGGACGGCC CGTGTTGCGC GTCATGCGGC CTGCGTGA
|
Protein sequence | MNSLKKRRIG IAALLALSAA LALSGLAAHR ALRAARPAPA LLALKQVADI PLPGGATRFD YESIDPNRRL LYIAHLGDAE IVVFDLRASQ VTARIGDISS VHGVLAVTEL SRVYASATGT DEVVAIDART RKIVARIPGG RYPDGMAYAP EAFKLYVSDE YGETETVIDT RTNRRIATIA LGGEAGNTQY DPSSRHVFVN DQTHARLVEI DPALDRIVNR FDLPGAKGNH GLLIDPRDRL AFIACEGNDK LLILDMRSMR IVQSFDIGGS PDVLAFDPSL ATLYVAGEAG VISRFRVEAS GVRKIDEGRL AAHAHVVAVD PSTHRSYFPL KNIGGRPVLR VMRPA
|
| |