Gene BURPS1106A_A2720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2720 
Symbol 
ID4903299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2655936 
End bp2658800 
Gene Length2865 bp 
Protein Length954 aa 
Translation table11 
GC content67% 
IMG OID640145823 
Productbeta-propeller repeat-containing protein 
Protein accessionYP_001076750 
Protein GI126458596 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.501527 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTATAGAC GTTTCCTGCT CGTTGGGATG ACGGCGCTCG CGCTCGCGGC GTGCAACAGC 
GATTCCGTCG ATACGCCCGC GCTCGGCGAC ACCCGCACGA CGCTGCTTGC GACGGGCCAG
TTGATTTCGC CCGCGGCCGC ACCCGGTGCC GCTTTCGTGC CGCTGAATCC GGGGCTTGCG
GATGCGCCGG GTTACATCGC GGGCCAACCG GTTTCGGAGG CGCTCAGCCC CGACCGCAAG
ACGCTCCTCG TGCTGACGAG CGGCTACAAC AACGTGCTCG ACGCGAACGG CAAGATGATC
AAGCCGGATT CGAACGAATA CGTGTTCGTG TTCGACGTGT CCGGCGGCGC GCCCGTGCGC
AAGCAGGTGC TGCAGGTGAG CGATACCTAC GTCGGCATCG CGTTCGCGCC GGACGGTCAG
CATTTCTACG TGGCGGGCGG CGGCGACGAC AACCTGCATC GCTTCGCGTT CTCGCAGGGC
GCGTGGGGCG AGGTGGGCGC GCCGATCGCG CTGAATCACA AGGCGGGCAA CGGCATCGGC
AGCACGCCGC TTGCGACGGG CGTCGACGTG ACGGCGGACG GCAAGCGCGC GGTCGTCGCG
AATCGCTACA ACGATTCGAT CACGCTCGTC GATCTCGGCG CGGGCGCGGT GCTCGCCGAG
CGCGATCTGC GGCCGGGCAA GAGCGGCGGC GCATCGGGCG CGCCGGGCGG CGAGTATCCG
AACGCGGTGC GCATCGTCGG CAACAAGACG GCCTATGTGT CGAGCGAGCG TGATCGCGAG
ATCGTCGTCG TCGATCTGTC GACGAACGCG CCGCAGGTCG TCACGCGCAT TCCCGTCAAG
GGCAATCCGA ACAAGATGGT GCTGAATGCC GCGCAGTCGC GCCTGTACGT GGCGTCCGAC
AACGCGGACC TCGTGTCGGT GATCGATACG GCCGCGAACA AGGTGGTGTC CACCGTGTCG
ACGGTCGCGC CGGCGGGCCT CGTCACCGAG ATGCAGTATC GCGGCGCGTC GCCGAACGGG
CTCGCGCTAT CGGCCGACGA GCGCACGCTG TACGTGACGA ACCGCGGCAC GAACGACGTG
GCGGTGATTT CGCTTGCCGG CGCGTCGCCC GCCGTCACCG GCCTGATTCC GACCGGCTGG
TATCCGTCCG ACGTCGCGCT CGGCGCGACG AACGCGATGT ACGTCGTCTA CACGAAGAAC
ATGCCGGGGC CGAATCCGGG CAACTGCAAG GACAGCGGTC GCACGGTGCC GTGCCCGGTG
AAGAACACGC CGGTGAAGCT CGTCGAGAAT CAGTACATCG AACAACTGAG CAAGGGCGGC
CTGATGTGGA TGCCGGCGCC CGGCAGCAAG ACGCTCGACC TGTTGACGAC CCAGGTGGCG
AACAACAACA GCTTCAACGC GGCGCTCACG CCGAACGACA TCGCGACGAT GGCCGCGCTG
CGCAAGAAGA TCAAGCACGT GATCTATATC GTCAAGGAGA ACCGCACGTA CGACCAGATT
CTCGGCGACA TCGGCCGCGG CAACAGCGAT CCGTCGCTCG CGGAGTTTCC CGACGCGACG
ACGCCGAGCA TGCACGCGCT CGCGAAGACG TTCGTCACGC TCGACAACTT CTACGATTCC
GGCGACGTGA GCGGCGACGG CTGGCCGTGG AGCACGGGCG CGCGCGAATC GGACGCGGGC
GCGAAGATGC TGCCCGTCAA CTACGCGAAC AGCCCCGCGC GCGCCGGCGC GCGCGGCGGC
TCGTACGACT GGGAAGGCGC GAACCGCAAC GTCAACGTGG GCCTGACGGG CGCGCAACGC
GCGGCCGCGA ATCCGTCGCT GTCGTCCGAT CCGGACCTGC TGCCCGGCAA CGCCGACGTG
TCGGCTCCCG ACGGCCCGTC CGACGCGGTT CAGCAGGGCT ACATCTGGAA CGCCGCGCTG
CGCGCGGGGC TGACGGTGCG CAACTACGGC TTCTTCGCGG ATCTCGCGCG CTACAGCGGC
CCTGGCGCGA TCGCGCCGGA TCGCACGCCG TTCGTCGATC ACGCGGTGCA AACCTATGCG
ACGAGCCCCG CGCTCGTCGA TCGCACCGAT CCGTATTTCC GCGGCTATGA CAACGCATAT
CCGGATTTCG ATCGCGAACT CGAGTGGGAG CGCGAGTTCA ACGGCTTCGT CGCGAACGGC
CAGATGCCGT CGCTCACGCT GCTGCGCCTG CCGCACGATC ATACGGGTTC CTATTCGAGC
GCGCTCGACG GCGTCAACAC GCCGGAGATC CAGATGGCCG ACAACGACTA CGCGGTCGGC
CGCGTCGCGC AGGCCGTCGC GAACAGCCCG TATGCGGCCG ATACGCTGAT CTTCGTCGTC
GAGGACGACG CGCAGGACGG GCCCGATCAC GTCGATGCGC ATCGCAGCAC GGCGTTCGTG
ATCGGGCCTT ACGTGAAGCA GAACGCGGTC GTCAGCACGC ACTACACGAC GGTCAACATG
ATCCGCACGA TCACCGAAGT GCTCGGCCTC GATCACTTGG GGCTGTTCGA CGCGACGCAA
GGCCCGATGA CCGACGTATT CGATCTGAAC CAGTCGAAGT GGAGCTTCAA GGCAGTGGCG
TCCGGCTTGC TCGCGAACAC GCAACTGCCG GTTCCGTCGG GCGACATCAA GACGGCGGCG
TTCAAGCCGA CGCACGGGAT GCGCTACTGG GCGCTCGCGA CGCGCGGCAT GGATTTCTCG
GTCGAGGATC GGCTCGATGC GGTTGCGTAC AACAAGCTGC TGTGGAAGGG GTTGATGAGC
GGGCGCGCGT ACCCGGTGCG CGTCGGCGAG CGTGCGGCGC CGCATCGTCG CGACGACGAT
GACGACGTGA AGGTGTCGCA GGCGAGGAGC GCGGCGCACG GATGA
 
Protein sequence
MYRRFLLVGM TALALAACNS DSVDTPALGD TRTTLLATGQ LISPAAAPGA AFVPLNPGLA 
DAPGYIAGQP VSEALSPDRK TLLVLTSGYN NVLDANGKMI KPDSNEYVFV FDVSGGAPVR
KQVLQVSDTY VGIAFAPDGQ HFYVAGGGDD NLHRFAFSQG AWGEVGAPIA LNHKAGNGIG
STPLATGVDV TADGKRAVVA NRYNDSITLV DLGAGAVLAE RDLRPGKSGG ASGAPGGEYP
NAVRIVGNKT AYVSSERDRE IVVVDLSTNA PQVVTRIPVK GNPNKMVLNA AQSRLYVASD
NADLVSVIDT AANKVVSTVS TVAPAGLVTE MQYRGASPNG LALSADERTL YVTNRGTNDV
AVISLAGASP AVTGLIPTGW YPSDVALGAT NAMYVVYTKN MPGPNPGNCK DSGRTVPCPV
KNTPVKLVEN QYIEQLSKGG LMWMPAPGSK TLDLLTTQVA NNNSFNAALT PNDIATMAAL
RKKIKHVIYI VKENRTYDQI LGDIGRGNSD PSLAEFPDAT TPSMHALAKT FVTLDNFYDS
GDVSGDGWPW STGARESDAG AKMLPVNYAN SPARAGARGG SYDWEGANRN VNVGLTGAQR
AAANPSLSSD PDLLPGNADV SAPDGPSDAV QQGYIWNAAL RAGLTVRNYG FFADLARYSG
PGAIAPDRTP FVDHAVQTYA TSPALVDRTD PYFRGYDNAY PDFDRELEWE REFNGFVANG
QMPSLTLLRL PHDHTGSYSS ALDGVNTPEI QMADNDYAVG RVAQAVANSP YAADTLIFVV
EDDAQDGPDH VDAHRSTAFV IGPYVKQNAV VSTHYTTVNM IRTITEVLGL DHLGLFDATQ
GPMTDVFDLN QSKWSFKAVA SGLLANTQLP VPSGDIKTAA FKPTHGMRYW ALATRGMDFS
VEDRLDAVAY NKLLWKGLMS GRAYPVRVGE RAAPHRRDDD DDVKVSQARS AAHG