Gene BURPS668_A2873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2873 
Symbol 
ID4887988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2729255 
End bp2732119 
Gene Length2865 bp 
Protein Length954 aa 
Translation table11 
GC content67% 
IMG OID640132809 
Productputative lipoprotein 
Protein accessionYP_001063865 
Protein GI126444258 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTATAGAC GTTTCCTGCT CGTTGGGATG ACGGCGCTCG CGCTCGCGGC GTGCAACAGC 
GATTCCGTCG ATACGCCCGC GCTCGGCGAC ACCCGCACGA CGCTGCTTGC GACGGGCCAG
TTGATTTCGC CCGCGGCCGC ACCCGGTGCC GCGTTCGTGC CGCTGAATCC GGGGCTTGCG
GATGCGCCGG GTTACATCGC GGGCCAACCG GTTTCGGAGG CGCTCAGCCC CGACCGCAAG
ACGCTCCTCG TGCTGACGAG CGGCTACAAC AACGTGCTCG ACGCGAACGG CAAGATGATC
AAGCCGGATT CGAACGAATA CGTGTTCGTG TTCGACGTGT CCGGCGGCGC GCCCGTGCGC
AAGCAGGTGC TGCAGGTGAG CGATACCTAC GTCGGCATCG CGTTCGCGCC GGACGGTCAG
CATTTCTACG TGACGGGCGG CGGCGACGAC AACCTGCATC GCTTCGCGTT CTCGCAGGGC
GCGTGGGGCG AGGTGGGCGC GCCGATCGCG CTGAATCACA AGGCGGGCAA CGGCATCGGC
AGCACGCCGC TTGCGACGGG CGTCGACGTG ACGGCGGACG GCAAGCGCGC GGTCGTCGCG
AATCGCTACA ACGATTCGAT CACGCTCATC GATCTCGGCG CGGGCGCGGT GCTCGCCGAG
CGCGATCTGC GGCCAGGCAA GAGCGGCGGC GCATCGGGCG CGCCGGGCGG CGAGTATCCG
AACGCGGTGC GCATCGTCGG CAACAAGACG GCCTATGTGT CGAGCGAGCG TGATCGCGAG
ATCGTCGTCG TCGATCTGTC GACGAACGCG CCGCAGGTCG TCACGCGCAT TCCCGTCAAG
GGCAATCCGA ACAAGATGGT GCTGAATGCC GCGCAATCGC GCCTGTACGT GGCGTCCGAC
AACGCGGACC TCGTGTCGGT GATCGATACG GCCGCGAACA AGGTGGTGTC CACCGTGTCG
ACGGTCGCGC CGGCGGGCCT CGTCACCGAG ATGCAGTATC GCGGCGCGTC GCCGAACGGG
CTCGCGCTAT CGGCCGACGA GCGCACGCTG TACGTGACGA ACCGCGGCAC GAACGACGTG
GCGGTGATTT CGCTTGCCGG CGCGTCGCCC GCCGTCACCG GCCTGATTCC GACCGGCTGG
TATCCGTCCG ACGTCGCGCT CGGCGCGACG AACGCGATGT ACGTCGTCTA CACGAAGAAC
ATGCCGGGGC CGAATCCGGG CAACTGCAAG GACAGCGGTC GCACGGTGCC GTGCCCGGTG
AAGAACACGC CGGTGAAGCT CGTCGAGAAT CAGTACATCG AACAACTGAG CAAGGGCGGC
CTGATGTGGA TGCCGGCGCC CGGCAGCAAG ACGCTCGACC TGTTGACGAC CCAGGTGGCG
AACAACAACA GCTTCAACGC GGCGCTCACG CCGAACGACA TCGCGACGAT GGCCGCGCTG
CGCAAGAAGA TCAAGCACGT GATCTATATC GTCAAGGAGA ACCGCACGTA CGACCAGATT
CTCGGCGACA TCGGCCGCGG CAACAGCGAT CCGTCGCTCG CGGAGTTTCC CGACGCGACG
ACGCCGAGCA TGCACGCGCT CGCGAAGACG TTCGTCACGC TCGACAACTT CTACGATTCC
GGCGACGTGA GCGGCGACGG CTGGCCGTGG AGCACGGGCG CGCGCGAATC GGACGCGGGC
GCGAAGATGC TGCCCGTCAA CTACGCGAAC AGCCCCGCGC GCGCCGGCGC GCGCGGCGGC
TCGTACGACT GGGAAGGCGC GAACCGCAAC GTCAACGTGG GCCTGACGGG CGCGCAACGC
GCGGCCGCGA ATCCGTCGCT GCCGTCCGAT CCGGACCTGC TGCCCGGCAA CGCCGACGTG
TCGGCTCCCG ACGGCCCGTC CGACGCGGTT CAGCAGGGCT ACATCTGGAA CGCCGCGCTG
CGCGCGGGGC TGACGGTGCG CAACTACGGT TTCTTCGCGG ATCTCGCGCG CTACAGCGGC
CCTGGCGCGA TCGCGCCGGA TCGCACGCCG TTCGTCGATC ACGCGGTGCA AACCTACGCG
ACGAGCCCCG CGCTCGTCGA TCGCACCGAT CCGTATTTCC GCGGCTATGA CAACGCATAT
CCGGATTTCT ATCGCGAACT CGAGTGGGAG CGCGAGTTCA ACGGCTTCGT CGCGAACGGC
CAGATGCCGT CGCTCACGCT GCTGCGCCTG CCGCACGATC ATACGGGTTC CTATTCGAGC
GCGCTCGACG GCGTCAACAC GCCGGAGATC CAGATGGCCG ACAACGACTA CGCGGTCGGC
CGCGTCGCGC AGGCCGTCGC GAACAGCCCG TATGCGGCCG ATACGCTGAT CTTCGTCGTC
GAGGACGACG CGCAGGACGG GCCCGATCAC GTCGATGCGC ATCGCAGCAC GGCGTTCGTG
ATCGGGCCTT ACGTGAAGCA GAACGCGGTC GTCAGCACGC ACTACACGAC GGTCAACATG
ATCCGCACGA TCACCGAAGT GCTCGGCCTC GATCACTTGG GGCTGTTCGA CGCGACGCAA
GGCCCGATGA CCGACGTATT CGATCTGAAC CAGTCGAAGT GGAGCTTCAA GGCGGTGGCG
TCCGGCTTGC TCGCGAACAC GCAACTGCCG GTTCCGTCGG GCGACATCAA GACGGCGGCG
TTCAAGCCGA CGCACGGGAT GCGCTACTGG GCGCTCGCGA CGCGCGGCAT GGATTTCTCG
GTCGAGGATC GGCTCGATGC GGTTGCGTAC AACAAGCTGC TGTGGAAGGG GTTGATGAGC
GGGCGCGCGT ACCCGGTGCG CGTCGGCGAG CGTGCGGCGC CGCATCGTCG CGACGACGAT
GACGACGTGA AGGTGTCGCA GGCGAGGAGC GCGGCGCACG GATGA
 
Protein sequence
MYRRFLLVGM TALALAACNS DSVDTPALGD TRTTLLATGQ LISPAAAPGA AFVPLNPGLA 
DAPGYIAGQP VSEALSPDRK TLLVLTSGYN NVLDANGKMI KPDSNEYVFV FDVSGGAPVR
KQVLQVSDTY VGIAFAPDGQ HFYVTGGGDD NLHRFAFSQG AWGEVGAPIA LNHKAGNGIG
STPLATGVDV TADGKRAVVA NRYNDSITLI DLGAGAVLAE RDLRPGKSGG ASGAPGGEYP
NAVRIVGNKT AYVSSERDRE IVVVDLSTNA PQVVTRIPVK GNPNKMVLNA AQSRLYVASD
NADLVSVIDT AANKVVSTVS TVAPAGLVTE MQYRGASPNG LALSADERTL YVTNRGTNDV
AVISLAGASP AVTGLIPTGW YPSDVALGAT NAMYVVYTKN MPGPNPGNCK DSGRTVPCPV
KNTPVKLVEN QYIEQLSKGG LMWMPAPGSK TLDLLTTQVA NNNSFNAALT PNDIATMAAL
RKKIKHVIYI VKENRTYDQI LGDIGRGNSD PSLAEFPDAT TPSMHALAKT FVTLDNFYDS
GDVSGDGWPW STGARESDAG AKMLPVNYAN SPARAGARGG SYDWEGANRN VNVGLTGAQR
AAANPSLPSD PDLLPGNADV SAPDGPSDAV QQGYIWNAAL RAGLTVRNYG FFADLARYSG
PGAIAPDRTP FVDHAVQTYA TSPALVDRTD PYFRGYDNAY PDFYRELEWE REFNGFVANG
QMPSLTLLRL PHDHTGSYSS ALDGVNTPEI QMADNDYAVG RVAQAVANSP YAADTLIFVV
EDDAQDGPDH VDAHRSTAFV IGPYVKQNAV VSTHYTTVNM IRTITEVLGL DHLGLFDATQ
GPMTDVFDLN QSKWSFKAVA SGLLANTQLP VPSGDIKTAA FKPTHGMRYW ALATRGMDFS
VEDRLDAVAY NKLLWKGLMS GRAYPVRVGE RAAPHRRDDD DDVKVSQARS AAHG