Gene BURPS1106A_A0562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0562 
Symbol 
ID4903372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp551988 
End bp554975 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content69% 
IMG OID640143668 
Productputative lipoprotein 
Protein accessionYP_001074598 
Protein GI126457912 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAAGC GTGAGGCGAC GGGCCGCAAG CGGCGCGGCG GCACATGGCT GGCCGTCGTG 
CTCGCATGCG CCCGTCCTTT CGACGCCGCC GCGCACGTCG AAGAAACGAT CGCCGCGCCG
ACGCATCTCG GCGACTGGCT CGCCGCCCAT CAGTCGACGC CCGCCGTCGG AACGCCGCCG
TCCGGCGCGC CTTCGCCGTA CCTCGGCGGC CTGAGCTGGC GCTCGAACCG CGAAGTCGCC
GCACAGCAGG CGAGCAAGCG CCGCCTGCTC GCCGGCATCG ACGCGCTGCC CGCGCTCACG
CCGGCCGCGC AGGCGGCGCG GGCACGCCTC TCGGCGATGA TCGCCGCGCG TGCCGCAACC
GGCCGCGTGA TCGTCGCGCG AAGCGATGCG CGCTGGCTGC AGGCCAATCC CGCCCACGAT
CCCTATCTCG AAGCCGGCGA CGTCGTGACG ATTCCCGAGC GCCCGTCGAG CGTCGCCGTC
GTGCGCGCGG ACGGCTCGAT CTGCACGGTC GCTCACGTGC AGGACGTCGA AGCATTGCCG
TACGTGCTCG CGTGCGACCC CGACGCGGCG CCCGATCTCG CGTGGATCGC GCAACCCGAC
GGCACGGTCA GCGAAAGCAA GGTGGCGATG TGGAATCGCG ACGTGCAGGA CACGCCGGCG
CCCGGCAGTT GGATCTGGGC GCCCGATCGG GGCAGCCGAT GGCCGCCGGC CCTGTCGCGC
GCCCTGGCGG AATTCATGGC GACGCAGGGC GTATCCGGGC TCGCGGACGA CGGCTCGCCG
CTGCCCGCGC CTCCCATTGC GCCCGTCCAC CAGACCGCGT TTCCGAGCGG CGCGCCCGGC
CGGTCCGCAG CGTTCCCGGT AACGGGCGGC GACTGGGGCA CGGCGGGCAT TCTGCAAACG
CCGACCGCGC GAATGAACGA CGCGGGCGAA GCATCGCTCA GCATGAGCCA CGTGAGCCCG
TACACGCGCC TGAACTTCAC GCTGCAGCCG CTCGATTGGC TCGAAATCGG GTTCCGCTAC
ACCGACGTCA GCAATCAGCC GTACGGCCCC GTCTCGCTGA GCGGCACCCA GTCGTACAAG
GACAAGAGCA TCGACGCGAA GCTCAGGCTG TGGCGCGAAT CCGCCTATCT GCCCGACGTG
GCCGTCGGCT TTCGCGACAT CGCCGGCTCG GGCCTGTTCT CCGGCGAGTA CCTGGTGGCC
AGCAAGCGAA CCGGGCCGTT CGACTGGAGC GTCGGCCTCG GCTGGGGTTA CGTGGGCGCG
CGCGGCAATC TGCGCAACCC GCTGGCGGTG ATCAGCCGGC GGTTCGACGA TCGCACGAAC
AGCGCGACAC CGAACGGCGG CGAGCTCGGC TACAGCTCAT GGTTTCGCGG CCGCGTCTCG
CCGTTCGGCG GCGTGCAATA CCAGACGCCG CACGAGCGCC TCATCCTGAA AGCCGAATAC
GACGGCAACG ACTATCGGCA CGAACCGTTC GGTCAAGTGC TGAAGGCGCG ATCGCCATTC
AACTTCGGCG CCGTCTATCG CGCGACGCGC AACATCGACT TGAGCCTCGG CTTCGAGCGA
GGCGCGCGCG TGATGTTCGG CGTCTCGCTG CACGGCAATC TGAAGCGCGC GTCGATGCCC
AAGCTCGGCA ATCCGCCGGC TCCGCCGGTG ACGCAACCGG CCGCGAACGC CGGGCCGCCC
CCGCCGGCCG CCGATCCGGC ATCGGGCGAC GCGCAAGCGG CGACCGCGCA GGCATCGCGC
ATCGGACGCG CGTCGCCGTC GCCGTTCGAT CGCGACTGGT CCGGCACCGT CGCGCAATTG
CAGGCGCAAA CGCATTGGCA CGTGCGCAGC ATCCGTGCGC TCGGCATGGA TCTCGTCGTC
GAGTTCGACG ACGTCGACGC GTTCTACCTG CAGGACCCGC TCGAGCGCAT CGCGACGATC
CTGAACCGTG ACGCGCCGCT CAACGTGCGC ACGTTCCATG TCGTCGCGCT CGTGCACGGC
GTGCCGGTTG CCGACTATCA GGTGCAGCGC ACGCAGTGGT TCGCGAGCCG CACCCGCGCC
CTCACGCCGA GCGAGGCTGC GCCCGACACG GCGCTCGGCC GGCCGCTCAC GCGACAGTCG
ATCGACATGC TGCCCTCTCT ATTCGAGCAG CGGCCCAAGG CCTTCGTGGC GTCGGTCGGC
CCGGGCTACC GGCAAACCCT TGGCGGTCCG AACGGTTTCC TGCTCTACCA GATCTCCGCC
GATGCATACG GCGAGGTGAG ACTGCCCGGC GGCGCATGGC TCGGCGGCGA ACTGAACGTG
GGGCTCGTCG ACAACTACGG CAAGTTCACC TACACGGCGG ATAGCAAGCT GCCGCGCGTG
CGCACGTATC TGCGCGAGTA CCTGACGACG TCGCACGTCA CGCTGCCGCT GCTGCAACTG
ACGAAGATGG GACGCCTCGG CAACGATCAG TTCTACAGCG TATACGGCGG GCTGCTCGAA
AGCATGTTCG CGGGCGTCGG GGCCGAATGG CTGTATCGCC CGGCGGATAG CCGCCTCGCG
ATCGGCGTCG ACGTGAACGC GGTGCGGCAG CGCGGCTTCC GCCAGGATTT CTCGATGCGC
GACTATCGGA CGCTCACCGG ACACGTGACG GCGTATTGGA ACACCGGATG GCAAGGCATC
CAAATCAATC TGAGCGTCGG CCAGTATCTG GCGAAGGACA AGGGCGCGAC GCTCGACATT
TCGCGGCGCT TTCGCAACGG CGTCGTGATC GGCGCCTATG CGACGAAGAC GAACATATCG
GCGGCCCAAT TCGGCGAAGG CAGCTTCGAC AAGGGCATCT ACCTGACGAT TCCGTTCGAC
GCGATGATGA CGCGCTCGAG CGGCAGCGTG GCGAATCTGC GCTGGAACCC CGTGACGCGC
GACGGCGGCG CGAAGCTGGA TCGCAAATAT CCGCTGTACG ATCTCACCGA CATGGGCGAG
CGCCGCAGCT TGTGGTACGC GCCGCCGGAT GGCGCATTGT CGCCGTGA
 
Protein sequence
MPKREATGRK RRGGTWLAVV LACARPFDAA AHVEETIAAP THLGDWLAAH QSTPAVGTPP 
SGAPSPYLGG LSWRSNREVA AQQASKRRLL AGIDALPALT PAAQAARARL SAMIAARAAT
GRVIVARSDA RWLQANPAHD PYLEAGDVVT IPERPSSVAV VRADGSICTV AHVQDVEALP
YVLACDPDAA PDLAWIAQPD GTVSESKVAM WNRDVQDTPA PGSWIWAPDR GSRWPPALSR
ALAEFMATQG VSGLADDGSP LPAPPIAPVH QTAFPSGAPG RSAAFPVTGG DWGTAGILQT
PTARMNDAGE ASLSMSHVSP YTRLNFTLQP LDWLEIGFRY TDVSNQPYGP VSLSGTQSYK
DKSIDAKLRL WRESAYLPDV AVGFRDIAGS GLFSGEYLVA SKRTGPFDWS VGLGWGYVGA
RGNLRNPLAV ISRRFDDRTN SATPNGGELG YSSWFRGRVS PFGGVQYQTP HERLILKAEY
DGNDYRHEPF GQVLKARSPF NFGAVYRATR NIDLSLGFER GARVMFGVSL HGNLKRASMP
KLGNPPAPPV TQPAANAGPP PPAADPASGD AQAATAQASR IGRASPSPFD RDWSGTVAQL
QAQTHWHVRS IRALGMDLVV EFDDVDAFYL QDPLERIATI LNRDAPLNVR TFHVVALVHG
VPVADYQVQR TQWFASRTRA LTPSEAAPDT ALGRPLTRQS IDMLPSLFEQ RPKAFVASVG
PGYRQTLGGP NGFLLYQISA DAYGEVRLPG GAWLGGELNV GLVDNYGKFT YTADSKLPRV
RTYLREYLTT SHVTLPLLQL TKMGRLGNDQ FYSVYGGLLE SMFAGVGAEW LYRPADSRLA
IGVDVNAVRQ RGFRQDFSMR DYRTLTGHVT AYWNTGWQGI QINLSVGQYL AKDKGATLDI
SRRFRNGVVI GAYATKTNIS AAQFGEGSFD KGIYLTIPFD AMMTRSSGSV ANLRWNPVTR
DGGAKLDRKY PLYDLTDMGE RRSLWYAPPD GALSP