Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0562 |
Symbol | |
ID | 4903372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 551988 |
End bp | 554975 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640143668 |
Product | putative lipoprotein |
Protein accession | YP_001074598 |
Protein GI | 126457912 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAAGC GTGAGGCGAC GGGCCGCAAG CGGCGCGGCG GCACATGGCT GGCCGTCGTG CTCGCATGCG CCCGTCCTTT CGACGCCGCC GCGCACGTCG AAGAAACGAT CGCCGCGCCG ACGCATCTCG GCGACTGGCT CGCCGCCCAT CAGTCGACGC CCGCCGTCGG AACGCCGCCG TCCGGCGCGC CTTCGCCGTA CCTCGGCGGC CTGAGCTGGC GCTCGAACCG CGAAGTCGCC GCACAGCAGG CGAGCAAGCG CCGCCTGCTC GCCGGCATCG ACGCGCTGCC CGCGCTCACG CCGGCCGCGC AGGCGGCGCG GGCACGCCTC TCGGCGATGA TCGCCGCGCG TGCCGCAACC GGCCGCGTGA TCGTCGCGCG AAGCGATGCG CGCTGGCTGC AGGCCAATCC CGCCCACGAT CCCTATCTCG AAGCCGGCGA CGTCGTGACG ATTCCCGAGC GCCCGTCGAG CGTCGCCGTC GTGCGCGCGG ACGGCTCGAT CTGCACGGTC GCTCACGTGC AGGACGTCGA AGCATTGCCG TACGTGCTCG CGTGCGACCC CGACGCGGCG CCCGATCTCG CGTGGATCGC GCAACCCGAC GGCACGGTCA GCGAAAGCAA GGTGGCGATG TGGAATCGCG ACGTGCAGGA CACGCCGGCG CCCGGCAGTT GGATCTGGGC GCCCGATCGG GGCAGCCGAT GGCCGCCGGC CCTGTCGCGC GCCCTGGCGG AATTCATGGC GACGCAGGGC GTATCCGGGC TCGCGGACGA CGGCTCGCCG CTGCCCGCGC CTCCCATTGC GCCCGTCCAC CAGACCGCGT TTCCGAGCGG CGCGCCCGGC CGGTCCGCAG CGTTCCCGGT AACGGGCGGC GACTGGGGCA CGGCGGGCAT TCTGCAAACG CCGACCGCGC GAATGAACGA CGCGGGCGAA GCATCGCTCA GCATGAGCCA CGTGAGCCCG TACACGCGCC TGAACTTCAC GCTGCAGCCG CTCGATTGGC TCGAAATCGG GTTCCGCTAC ACCGACGTCA GCAATCAGCC GTACGGCCCC GTCTCGCTGA GCGGCACCCA GTCGTACAAG GACAAGAGCA TCGACGCGAA GCTCAGGCTG TGGCGCGAAT CCGCCTATCT GCCCGACGTG GCCGTCGGCT TTCGCGACAT CGCCGGCTCG GGCCTGTTCT CCGGCGAGTA CCTGGTGGCC AGCAAGCGAA CCGGGCCGTT CGACTGGAGC GTCGGCCTCG GCTGGGGTTA CGTGGGCGCG CGCGGCAATC TGCGCAACCC GCTGGCGGTG ATCAGCCGGC GGTTCGACGA TCGCACGAAC AGCGCGACAC CGAACGGCGG CGAGCTCGGC TACAGCTCAT GGTTTCGCGG CCGCGTCTCG CCGTTCGGCG GCGTGCAATA CCAGACGCCG CACGAGCGCC TCATCCTGAA AGCCGAATAC GACGGCAACG ACTATCGGCA CGAACCGTTC GGTCAAGTGC TGAAGGCGCG ATCGCCATTC AACTTCGGCG CCGTCTATCG CGCGACGCGC AACATCGACT TGAGCCTCGG CTTCGAGCGA GGCGCGCGCG TGATGTTCGG CGTCTCGCTG CACGGCAATC TGAAGCGCGC GTCGATGCCC AAGCTCGGCA ATCCGCCGGC TCCGCCGGTG ACGCAACCGG CCGCGAACGC CGGGCCGCCC CCGCCGGCCG CCGATCCGGC ATCGGGCGAC GCGCAAGCGG CGACCGCGCA GGCATCGCGC ATCGGACGCG CGTCGCCGTC GCCGTTCGAT CGCGACTGGT CCGGCACCGT CGCGCAATTG CAGGCGCAAA CGCATTGGCA CGTGCGCAGC ATCCGTGCGC TCGGCATGGA TCTCGTCGTC GAGTTCGACG ACGTCGACGC GTTCTACCTG CAGGACCCGC TCGAGCGCAT CGCGACGATC CTGAACCGTG ACGCGCCGCT CAACGTGCGC ACGTTCCATG TCGTCGCGCT CGTGCACGGC GTGCCGGTTG CCGACTATCA GGTGCAGCGC ACGCAGTGGT TCGCGAGCCG CACCCGCGCC CTCACGCCGA GCGAGGCTGC GCCCGACACG GCGCTCGGCC GGCCGCTCAC GCGACAGTCG ATCGACATGC TGCCCTCTCT ATTCGAGCAG CGGCCCAAGG CCTTCGTGGC GTCGGTCGGC CCGGGCTACC GGCAAACCCT TGGCGGTCCG AACGGTTTCC TGCTCTACCA GATCTCCGCC GATGCATACG GCGAGGTGAG ACTGCCCGGC GGCGCATGGC TCGGCGGCGA ACTGAACGTG GGGCTCGTCG ACAACTACGG CAAGTTCACC TACACGGCGG ATAGCAAGCT GCCGCGCGTG CGCACGTATC TGCGCGAGTA CCTGACGACG TCGCACGTCA CGCTGCCGCT GCTGCAACTG ACGAAGATGG GACGCCTCGG CAACGATCAG TTCTACAGCG TATACGGCGG GCTGCTCGAA AGCATGTTCG CGGGCGTCGG GGCCGAATGG CTGTATCGCC CGGCGGATAG CCGCCTCGCG ATCGGCGTCG ACGTGAACGC GGTGCGGCAG CGCGGCTTCC GCCAGGATTT CTCGATGCGC GACTATCGGA CGCTCACCGG ACACGTGACG GCGTATTGGA ACACCGGATG GCAAGGCATC CAAATCAATC TGAGCGTCGG CCAGTATCTG GCGAAGGACA AGGGCGCGAC GCTCGACATT TCGCGGCGCT TTCGCAACGG CGTCGTGATC GGCGCCTATG CGACGAAGAC GAACATATCG GCGGCCCAAT TCGGCGAAGG CAGCTTCGAC AAGGGCATCT ACCTGACGAT TCCGTTCGAC GCGATGATGA CGCGCTCGAG CGGCAGCGTG GCGAATCTGC GCTGGAACCC CGTGACGCGC GACGGCGGCG CGAAGCTGGA TCGCAAATAT CCGCTGTACG ATCTCACCGA CATGGGCGAG CGCCGCAGCT TGTGGTACGC GCCGCCGGAT GGCGCATTGT CGCCGTGA
|
Protein sequence | MPKREATGRK RRGGTWLAVV LACARPFDAA AHVEETIAAP THLGDWLAAH QSTPAVGTPP SGAPSPYLGG LSWRSNREVA AQQASKRRLL AGIDALPALT PAAQAARARL SAMIAARAAT GRVIVARSDA RWLQANPAHD PYLEAGDVVT IPERPSSVAV VRADGSICTV AHVQDVEALP YVLACDPDAA PDLAWIAQPD GTVSESKVAM WNRDVQDTPA PGSWIWAPDR GSRWPPALSR ALAEFMATQG VSGLADDGSP LPAPPIAPVH QTAFPSGAPG RSAAFPVTGG DWGTAGILQT PTARMNDAGE ASLSMSHVSP YTRLNFTLQP LDWLEIGFRY TDVSNQPYGP VSLSGTQSYK DKSIDAKLRL WRESAYLPDV AVGFRDIAGS GLFSGEYLVA SKRTGPFDWS VGLGWGYVGA RGNLRNPLAV ISRRFDDRTN SATPNGGELG YSSWFRGRVS PFGGVQYQTP HERLILKAEY DGNDYRHEPF GQVLKARSPF NFGAVYRATR NIDLSLGFER GARVMFGVSL HGNLKRASMP KLGNPPAPPV TQPAANAGPP PPAADPASGD AQAATAQASR IGRASPSPFD RDWSGTVAQL QAQTHWHVRS IRALGMDLVV EFDDVDAFYL QDPLERIATI LNRDAPLNVR TFHVVALVHG VPVADYQVQR TQWFASRTRA LTPSEAAPDT ALGRPLTRQS IDMLPSLFEQ RPKAFVASVG PGYRQTLGGP NGFLLYQISA DAYGEVRLPG GAWLGGELNV GLVDNYGKFT YTADSKLPRV RTYLREYLTT SHVTLPLLQL TKMGRLGNDQ FYSVYGGLLE SMFAGVGAEW LYRPADSRLA IGVDVNAVRQ RGFRQDFSMR DYRTLTGHVT AYWNTGWQGI QINLSVGQYL AKDKGATLDI SRRFRNGVVI GAYATKTNIS AAQFGEGSFD KGIYLTIPFD AMMTRSSGSV ANLRWNPVTR DGGAKLDRKY PLYDLTDMGE RRSLWYAPPD GALSP
|
| |