Gene BURPS668_A0655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0655 
Symbol 
ID4886507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp626180 
End bp629167 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content69% 
IMG OID640130595 
Productputative lipoprotein 
Protein accessionYP_001061654 
Protein GI126443604 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAAGC GTGAGGCGAC GGGCCGCAAG CGGCGCGGCT GCACATGGCT GGCTGTCGTG 
CTCGCATGCG CCCGTCCTTT CGACGCCGCC GCGCACGTCG AAGAAACGAT CGCCGCGCCG
ACGCATCTCG GCGACTGGCT CGCCGCCCAT CAGTCGACGC CCGCCGTCGG AACGCCGCCG
TCCGGCGCGC CTTCGCCGTA CCTCGGCGGC CTGAGCTGGC GCTCGAACCG CGAAGTCGCC
GCACAGCAGG CGAGCAAGCG CCGCCTGCTC GCCGGCATCG ACGCGCTGCC CGCGCTCACG
CCGGCCGCGC AGGCGGCGCG GGCACGCCTC TCGGCGATGA TCGCCGCGCG TGCCGCAACC
GGCCGCGTGA TCGTCGCGCG AAGCGATGCG CGCTGGCTGC AGGCCAATCC CGCCCACGAT
CCCTATCTCG AAGCCGGCGA CGTCGTGACG ATTCCCGAGC GCCCGTCGAG CGTCGCCGTC
GTGCGCGCGG ACGGCTCGAT CTGCACGGTC GCTCACGTGC AGGATGTCGA AGCATTGCCG
TACGTGCTCG CGTGCGACCC CGACGCGGCG CCCGATCTCG CGTGGATCGC GCAACCCGAC
GGCACGGTCA GCGAAAGCAA GGTGGCGATG TGGAATCGCG ACGTGCAGGA CACGCCGGCG
CCCGGCAGTT GGATCTGGGC GCCCGATCGG GGCAGCCGAT GGCCGCCGGC CCTGTCGCGC
GCCCTGGCGG AATTCATGGC GACGCAGGGC GTATCCGGGC TCGCGGACGA CGGCTCGCCG
CTGCCCGCGC CTCCCATTGC GCCCGTCCAC CAGACCGCGT TTCCGAGTGG CGCGCCCGGC
CGGTCCGCAG CGTTCCCGGT AACGGGCGGC GACTGGGGCA CGGCGGGCAT TCTGCAAACG
CCGACCGCGC GAATGAACGA CGCCGGCGAA GCATCGCTCA GCATGAGCCA CGTGAGCCCG
TACACGCGCC TGAACTTCAC GCTGCAGCCG CTCGATTGGC TCGAAATCGG GTTCCGCTAC
ACCGACGTCA GCAATCAGCC GTACGGCCCC GTCTCGCTGA GCGGCACCCA GTCGTACAAG
GACAAGAGCA TCGACGCGAA GCTCAGGCTG TGGCGCGAAT CCGCCTATCT GCCCGACGTG
GCCGTCGGCT TTCGCGACAT CGCCGGCTCG GGCCTGTTCT CCGGCGAGTA CCTGGTGGCC
AGCAAGCGAA CCGGGCCGTT CGACTGGAGC GTCGGCCTCG GCTGGGGTTA CGTGGGCGCG
CGCGGCAATC TGCGCAACCC GCTGGCGGTG GTCAGCCGGC GGTTCGACGA TCGCGCGAAC
AGCGCGACAC CGAACGGCGG CGAGCTCGGC TACAGCTCAT GGTTTCGCGG CCGCGTCTCG
CCGTTCGGCG GCGTGCAGTA CCAGACGCCG CACGAGCGCC TCATCCTGAA AGCCGAATAC
GACGGCAACG ACTATCGGCA CGAACCGTTC GGTCAAGTGC TGAAGGCGCG ATCGCCATTC
AACTTCGGCG CCGTCTATCG CGCGACGCGC AACATCGACT TGAGCCTCGG CTTCGAGCGA
GGCGCGCGCA TGATGTTCGG CGTCTCGCTG CACGGCAATC TGAAGCGCGC GTCGATGCCC
AAGCTCGGCA ATCCGCCGGC TCCGCCGGTG ACGCAACCGG CCTCGAACGC CGGGCCGCCC
CCGCCGGCCG CCGATCCGGC ATCGGGCGAC GCGCAAGCGG CGACTGCGCC GGCATCGCGC
ATCGGACGCG CGTCGCCGTC GCCGTTCGAT CGCGACTGGT CCGGCACCGT CGCGCAATTG
CAGGCGCAAA CGCATTGGCA CGTGCGCAGC ATCCGTGCGC TCGGCATGGA TCTCGTCGTC
GAGTTCGACG ACGTCGACGC GTTCTACCTG CAGGACCCGC TCGAGCGCAT CGCGACGATC
CTGAACCGTG ACGCGCCGCT CAACGTGCGC ACGTTCCATG TCGTCGCGCT CGTGCACGGC
GTGCCGGTTG CCGACTATCA GGTGCAGCGC ACGCAGTGGT TCGCGAGCCG CACCCGCGCC
CTCACGCCGA GCGAGGCTGC GCCCGACACG GCGCTCGGCC GGCCGCTCAC GCGACAGTCG
ATCGACATGC TGCCCTCTCT ATTCGAGCAG CGGCCCAAGG CCTTCGTGGC GTCGGTCGGC
CCGGGCTACC GGCAAACCCT TGGCGGTCCG AACGGTTTCC TGCTCTACCA GATCTCCGCC
GATGCATACG GCGAGGTGAG ACTGCCCGGC GGCGCATGGC TCGGCGGCGA ACTGAACGTG
GGGCTCGTCG ACAACTACGG CAAGTTCACC TACACGGCGG ACAGCAAGCT GCCGCGCGTG
CGCACGTATC TGCGCGAGTA CCTGACGACG TCGCGCGTCA CGCTGCCGCT GCTGCAACTG
ACGAAGATGG GACGCCTCGG CAACGATCAG TTCTACAGCG TATACGGCGG GCTGCTCGAA
AGCATGTTCG CGGGCGTCGG GGCCGAATGG CTGTATCGCC CGGCGGATAG CCGCCTCGCG
ATCGGCGTCG ACGTGAACGC GGTGCGGCAG CGCGGCTTCC GCCAGGATTT CTCGATGCGC
GACTATCGGA CGCTCACCGG ACACGTGACG GCGTATTGGA ACACCGGATG GCAAGGCGTC
CAGATCAATC TGAGCGTCGG CCAGTATCTG GCGAAGGACA AGGGCGCGAC GCTCGACATT
TCGCGGCGCT TTCGCAACGG CGTCGTGATC GGCGCCTATG CGACGAAGAC GAACATATCG
GCGGCCCAAT TCGGCGAAGG CAGCTTCGAC AAGGGCATCT ACCTGACGAT TCCGTTCGAC
GCGATGATGA CGCGCTCGAG CGGCAGCGTG GCAAATCTGC GCTGGAACCC CGTGACGCGC
GACGGCGGCG CGAAGCTGGA TCGCAAATAT CCGCTGTACG ATCTCACCGA CATGGGCGAG
CGCCGCAGCT TGTGGTACGC GCCGCCGGAT GGCGCATTGT CGCCGTGA
 
Protein sequence
MPKREATGRK RRGCTWLAVV LACARPFDAA AHVEETIAAP THLGDWLAAH QSTPAVGTPP 
SGAPSPYLGG LSWRSNREVA AQQASKRRLL AGIDALPALT PAAQAARARL SAMIAARAAT
GRVIVARSDA RWLQANPAHD PYLEAGDVVT IPERPSSVAV VRADGSICTV AHVQDVEALP
YVLACDPDAA PDLAWIAQPD GTVSESKVAM WNRDVQDTPA PGSWIWAPDR GSRWPPALSR
ALAEFMATQG VSGLADDGSP LPAPPIAPVH QTAFPSGAPG RSAAFPVTGG DWGTAGILQT
PTARMNDAGE ASLSMSHVSP YTRLNFTLQP LDWLEIGFRY TDVSNQPYGP VSLSGTQSYK
DKSIDAKLRL WRESAYLPDV AVGFRDIAGS GLFSGEYLVA SKRTGPFDWS VGLGWGYVGA
RGNLRNPLAV VSRRFDDRAN SATPNGGELG YSSWFRGRVS PFGGVQYQTP HERLILKAEY
DGNDYRHEPF GQVLKARSPF NFGAVYRATR NIDLSLGFER GARMMFGVSL HGNLKRASMP
KLGNPPAPPV TQPASNAGPP PPAADPASGD AQAATAPASR IGRASPSPFD RDWSGTVAQL
QAQTHWHVRS IRALGMDLVV EFDDVDAFYL QDPLERIATI LNRDAPLNVR TFHVVALVHG
VPVADYQVQR TQWFASRTRA LTPSEAAPDT ALGRPLTRQS IDMLPSLFEQ RPKAFVASVG
PGYRQTLGGP NGFLLYQISA DAYGEVRLPG GAWLGGELNV GLVDNYGKFT YTADSKLPRV
RTYLREYLTT SRVTLPLLQL TKMGRLGNDQ FYSVYGGLLE SMFAGVGAEW LYRPADSRLA
IGVDVNAVRQ RGFRQDFSMR DYRTLTGHVT AYWNTGWQGV QINLSVGQYL AKDKGATLDI
SRRFRNGVVI GAYATKTNIS AAQFGEGSFD KGIYLTIPFD AMMTRSSGSV ANLRWNPVTR
DGGAKLDRKY PLYDLTDMGE RRSLWYAPPD GALSP