Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_1682 |
Symbol | |
ID | 3691572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | + |
Start bp | 1801228 |
End bp | 1802484 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637728138 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_333085 |
Protein GI | 76811214 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCACA TGAACGAACC GCGACAGTTC GGCCGCAAGA GCGGCGGCGA CTCGCACCCG GAGCAAGTGC TCGAAACCGT CACGAAGGAA CTCAAGCGCA TCGGTGACGA AGTGAAATCC GCCGGCGAGA AGGCGCTCGC TGAAGCAAAG AGGGCTGGCG ATCTGGGCGT AGAAACGAAA GCCACGGTCG ACGAGCTGTT GATCAAGCAG GGCGAACTCC AAGCCCGTCT GCTGGAGGCC GAGCAAAAGC TGGCTCGCGG CGGCGGTAGC GCCGAACTCG AAACGCCGAA GACCCTCGGT CAACTTGTGA CCGAATCGGA GGAGATGAAG GGGATGGACG GAAGCGCGCG CAAATCAGTG CGCGTTCGCG TCGATCGCAA GAGCATCATG AACGTGCCTG CGACGGTCGG CAGCGGCGTG AGCGGCAGCA ACTCGCTGGT CGTCGCGGAC CGTCAAGCGG GGATCATTGC GCCGCCGCAA CGGAAGATGA CGATTCGCGA TTTGCTCATG CCGGGCCAGA CGTCGTCGAG TAGCATCGAG TACACCGTCG AAACCGGCTT CACGAACAAC GCAGCGGCAG TAGCCGAAGG CGCACAGAAA CCGACTTCGG ATCTGAAGTT CAATCTGAAG AACCAGCCGG TTCGCACGAT CGCACATCTG TTCAAGGCGT CGCGTCAGAT TCTCGACGAT GCGCCGGCAC TGCAATCGTA TATCGACGGC CGTGCCCGGT ACGGGCTTCA ACTCACCGAG GAAGGCCAAA TTCTGAAGGG CGATGGTACC GGGGCGAACA TTCTCGGCAT CTTGCCCCAA GCGTCAGCGT TCATGCCATC CATCACGCTC GCGAATGCGA CGCCGATCGA CAAGATCCGT CTGGCACTGT TGCAAGCAGT TCTCGCCGAA TTTCCGGCGA CCGGGATCGT CCTGAATCCG ATCGACTGGG CGTCGATCGA GTTGACGAAG GACAGCCAGG GGCGATACAT CGTCGGCAAT CCGGTCAACG GTACGACGCC GCGCCTGTGG AATCTGCCGG TCGTTGAAAC GCAGGCGATG ACTGCGAACG AATTCCTCGT CGGCGCCTTC TCGATGGCGG CTCAGATCTT CGACCGCATG GAGATCGAGG TTCTTCTGTC GACCGAGAAC GTCGACGACT TCGAAAAGAA CATGGTGTCG ATCCGTGCGG AGGAACGCCT CGCGCTTGCC GTCTATCGTC CGGAATCGTT TGTGACCGGT GCATTAGTCG AGCAAGCCGG CGGCTGA
|
Protein sequence | MSHMNEPRQF GRKSGGDSHP EQVLETVTKE LKRIGDEVKS AGEKALAEAK RAGDLGVETK ATVDELLIKQ GELQARLLEA EQKLARGGGS AELETPKTLG QLVTESEEMK GMDGSARKSV RVRVDRKSIM NVPATVGSGV SGSNSLVVAD RQAGIIAPPQ RKMTIRDLLM PGQTSSSSIE YTVETGFTNN AAAVAEGAQK PTSDLKFNLK NQPVRTIAHL FKASRQILDD APALQSYIDG RARYGLQLTE EGQILKGDGT GANILGILPQ ASAFMPSITL ANATPIDKIR LALLQAVLAE FPATGIVLNP IDWASIELTK DSQGRYIVGN PVNGTTPRLW NLPVVETQAM TANEFLVGAF SMAAQIFDRM EIEVLLSTEN VDDFEKNMVS IRAEERLALA VYRPESFVTG ALVEQAGG
|
| |