Gene BURPS1106A_A2806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2806 
Symbol 
ID4905472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2739817 
End bp2744475 
Gene Length4659 bp 
Protein Length1552 aa 
Translation table11 
GC content61% 
IMG OID640145909 
ProductYD repeat-/RHS repeat-containing protein 
Protein accessionYP_001076835 
Protein GI126455872 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGGCAG CTGAATGGGG AGGGGGCGTG GCACTACCCG CTGTCAAGCA TCTGGACCCC 
GTGGTCGGGA TCGATGTTCA TTCGGTACTG GTCACACCCG GTACGCCGCC GGTGTTCCTG
CCGCATCCCT ACGTCGGTTT CATGCTCGAT CGCCGCGAGT ACATCGACGC CGCGCTGGGC
GTGATCGGCA GTATCGTGTT TACGTTTACA CCCCTCGGCA AGGAGGCGGA AGCCGGTCTG
AAGTGGGCAA AGAAGCAGGC GAAGGCCGAA CTGATGAGCG ACCCGCTGAT TGCCGAAGGC
GTGAAGCTCG GCAAGGAAGC GGCCCCGGTT GCTGGCGACA TCGCCGGCGC GGTCGGCGCT
GGTGTGGGGA TGGGAAGCAT GATGGGCCGC CCCATCTTCG TAAACGGCTT CTTGCGGGCC
ACGGCGGGTA CCCACTCGTT TCATGTACCG GGTCTGCACT TCCCGTTGGG CGAATCGTTC
GCACCGCCGC CGGAGGATTT TGAGCCTTCG AACGACGGGG AATCGTACAT GGGCAGCAGA
ACGGTGCTCG CCAACAACGA TCCAATGTCG TTCATGGCGC TGCCGGCGAT GAGCTGCTGG
TCGATCGGGC TGGAGCCGCC ACCGCATAAC GGTGCGCACA CGCAGAGAAC GGCGCTGTCG
ATGCCAAGTT CGGTGATGCT GCCGATACCG GTGGGGCGGC CGGTGCTGGT GGGTGGGCCA
CCGGTCATGA ACATGGCGGC ACTGGCTTCG TCGCTGTTCA AGGCCTTTCG CGGAAGCGAC
TGGGCCAAGG CGCTGGCCGA CAAGCTGCAT CTGAAACCCG GTTTCCTGCG CTGCAATGTG
CTGCGCGCGG AGCCGGTGGA CGCGATCACG GGTGAGGTCG TGGTTCAGCA GCATGACTTC
ACGGTCCCCG GCCGCCTGCC GCTCGTGTGG AACCGGTATT ACACCAGCCA CGACACCTAT
GTCGGTGCGG TTGGTATTGG CTGGCAGACG CCTGCCGATA TCCGGCTCGA ACTGACCCGA
CATGAAGGCG CGGTGGGGGT AGTGGCGTAT TTCCCGGATC ACGCGACGGC TTTCGATGCA
ATACCGGATG CAGCCGGGTG GCCGGCGCGT GTGTATGACT GGCAGCACGG ACAGGCGCTA
TATCGGCAGG ACGACCGGCT GGTCCTGCGC ACGCGTGCCG GGTTCGAATA TGAGTTTGCG
TTGCCCGCAC ACTGGCAACA CGCGCTGGAG AGGCTCGCGG AAGACGCCAC GTTGACGCTT
CCTGTCGAAC GGATGGCCGA CCTCAACGGC AATGCGTGGG TGTTCGAGCG GGGACTGGAC
AGCAGCCTGA CGCGTCTGGT CGAGTGGAAG GCCGAGGAGG CGACCGGCCG CATGATCGCG
TGCGCGGCGA GCGCAAGCAG CCGGGCGGGA GACCACGTAA GTCTCCTCAC CGCGCTGACC
CTGATCGATG CCGGCGGCCG CGCGCATCCG CTTGTCAGTT ACCAGCACGA CCGAAACGTC
GATCTGGTCG CCGCGATCGA TGCGATGGGC CAGCCGCATC GTTACGCCTA CACCGATGGG
CACCGGATGA TTAGCCACAC GAGTGCCCGG GGGATATCTT TCTACTACAG CCACCGTCGG
CATGACGACG GCGTATGGCG GGTCGACCAT GCGTGGGGGG ACAACGGTCT CTTCGACTAC
CGCTTTATCT ATGATCCCGC GCGTCATGAA ACCCGCATCA CTGATTCCCT GGGGCACACG
ACGCTCCTGC AATCAGACGA ACGTGGCATG CCGGTCGCCC GGATCGATCC GCTCGGTGGT
GTGACGAGTT ACCGGTACGA CGCTCAGGGG CGCACCAACG CGGAAACCGA CCCTGCGGGA
CGCACCACCG CGTGGGAATA CGACACGTTC GCCAATCTGC TCATGCGGAC CTTGCCGGAT
GGCACTGCGT TACGCACCGA GTACAACACC GGCCACAAGC CTGTGTGCGT GGCGGTGCCC
GGAGGCGGGC AATGGCGCTA CACGTGGGAC GAACGAGGTA ACCTGCTCAT GCAAACGACG
CCATCGCAAG CCAGCGTTCG CTACGCATAC GATCAGTATG GCCAACTCAT AGCGCACACA
GGGCCACGTG GCACGGTGAC GCGGTTCGAC TACGACCGGA GTGGCCTTCT CGCGGTGCGG
ACGGATGCAC TCGGCCATCG CACGCAGTAC ACGCACGACG CGCTTGGGAA TCTGGTGCAT
GTTATCAACG CACTAGGGCA GACGAGCCGT TACGAATACG ACAACATCGG CAATCTGACG
CGCGCCATCG AACCGGGCGG GCGCGAGGTC CACTGTGTCT ATGATGCCGA TGGCAATCTG
GTGCGCTACC GCGATGCGAC CGGCAACGTG ACTCAACTGG TCTATACGGC GCTTGGGCAT
ATCAGCAAAA GACAGACGCC GGACGGAAAC GTCGTCGAAT ACCGCTACGA CACCGAGGAA
CAACTGGTGG GCGTGGTCAA TGGGCGCGGC GAGATCTACG AAATCAAGCG CGACGAGCTG
GGCCGGATTG TCGAGGAAAC GGATTACTGG GGCCAACCGA GGCATTACCG GTACGGCGCC
GCAGGCGAAC TGCTCGATAT TACTGATCCT TTGGGACAAA CCATCGAGTA TCGCTGCGAT
CGGCTCGGCC GCGTCGTGGA AAAGCGCATG CCGGACCCCC TGCACCCCGA CGGCGTGCGC
ATCGACCGCT TCGTATATGA CCAGCATGGT GACCTCGTGC TTGCGGAGAA CCCGTCGAGT
CGCGTCGAGT TCCGCTACGA TGCAGACGGC CGGGTGATCG CGGAAAGGCA GGGCGACGAT
TTCACGATTG CGAGCACCTA CGACGCAAGC GGCAACCGGA CCGAGCGCAA AACCCGGCTC
GTCGCCGACA GTGATGTGGT CGAACATACC GTGCGGTATG AATATGACGC GCTGGACGCG
GTGATAGCGA TCCAGATCGA CGATGCGGCG CCTATTGTCC TTGAACGCGA TGCAGTCGGT
CAGGTTTGTG TCGAGCAATT GAGCCCAGAG TTGCGGCGCG AACTGTCGTA CGAAGCGGGT
GGGCAACTAG CGAAGCAGAC GCTGCTAGGT GGTACCGGCC TGCTGTTCGC AAGCGAATAC
GCGTACGATG CGAATGACGA GTTGGTAGAG AAGCGCGACT CGCGCACCGG CGTTGAGCGC
TTTCAGTACG ACCCGGTAGG CCGGATAGTC GCGCATACCG ACCCTACGGG CCGGCTGCGC
AGTTACGTGT ACGATCCTGC GGGCGACCTG CTGAAAACGC ACATCCACGA ACGCCGCACG
GCAGGCGCAA CTGAGATGAC GCAGACTGGC ACATGGGTTC GAGAAGGCGA ATTCGAAGGC
CATTACCACG CCTACGATCG CGCGGGTAAC CTGATACGCA GGCAGGACGT CGGGCAGGAC
CTTACCCTGC GTTGGGATGC GGCCGGCCAA CTGGCCGAAG CGGTAGCCGT GCGGCCGGCC
ATTGCTGGAG CGGGCGGGGG GCAGGTCCGT ATCGGCACGC AGTACGAATA TGATGCGTTC
CGTCGCCGGG TGGGCAAGCT TGTGCTTACC CGCGCGGCAG GCAGTGCGGA GCTGGTATTG
TCGCGCATCA GTCGTTTCTT CTGGGACGGC AATACGCTGG TGGGCCAGTG CACGAGAGGT
GGCGGCGAGG GAGGCGGTAC GGCGATACCG GATGCGGGCG ATCGAATGCC AGCAGCGCAG
TTCATCCCGG TCCGTGACAA TAGGGATGAC GCACCGGTGT CCGGGTTTGA GCATGCGTAT
GAGTGGGTCT ACTATCCAGG AATGTTCCGG CCGCTCGCGG TAGTGCATTG TGATCTGGCA
GCGACCAGAG CATCTGTGCC GACGACAACA AATTTGCTGC CGCTCATCGG GGCAGTGTAC
TTCTTCCAGA GCGATCTGAA CGGAGCGCCG GTCCAGATGC ATGCACCCGG TGGAAGAGTC
GTATGGGAAG CGCGTTACGA CCCAATCGGA AGAAGCGAGC AACTCGGACT TCAACTGGTC
GAGCAGCCGG TTAGGTTGCA GGGGCAGTAT TTCGATGCAG AAACTGGCCT CAATTACAAT
CGGCATCGAT ATTTTGATCC GAATTCTGGG ATCTTTATTA GCCAGGATCC AATTCGACTG
TCTGGTGGTT TAAATCTCTA TCAATATGCG CCGGAGACAA ACAATTGGAT AGATCCTCTT
GGTTGTTCTG GGCACAGGCG ACGACATGAA AAAATGCCTC CCGAGGGCGC GCCATTGACC
ACAGGAAATC TATTTCGCCA TGCACAAGAC GAAGGGATTC CCCCCATATT CAGTCGCAAG
GATGACGAAT TTTATCACAG GCTTATTGAG ATATATGCTG GAACTGGTGT GTTGAACGCC
TATATCAGGG GCATAGCAAG TCATGTCGAA CCAAAGGCGG GCTTGATTTT GAACGAGGAT
GGAAATGGCT GGAAGATCGG ATCATTGTAC ATTAACTATC CCGATGGTCC GTGTCCAGGT
TGTCGGAGAC TCATGCCGTT TATACTCAAT GACGGCAGCA TTCTTTACGT AACATTTCCC
ACGCTAGGAT TGGACGGGTA TTCTTACGGC CATTTCCATG GCGGAGTGTC AGGATTTTTC
AGGGAGGGCA CACCATGCAA CTTACGCCAT CCTGAATGA
 
Protein sequence
MAAAEWGGGV ALPAVKHLDP VVGIDVHSVL VTPGTPPVFL PHPYVGFMLD RREYIDAALG 
VIGSIVFTFT PLGKEAEAGL KWAKKQAKAE LMSDPLIAEG VKLGKEAAPV AGDIAGAVGA
GVGMGSMMGR PIFVNGFLRA TAGTHSFHVP GLHFPLGESF APPPEDFEPS NDGESYMGSR
TVLANNDPMS FMALPAMSCW SIGLEPPPHN GAHTQRTALS MPSSVMLPIP VGRPVLVGGP
PVMNMAALAS SLFKAFRGSD WAKALADKLH LKPGFLRCNV LRAEPVDAIT GEVVVQQHDF
TVPGRLPLVW NRYYTSHDTY VGAVGIGWQT PADIRLELTR HEGAVGVVAY FPDHATAFDA
IPDAAGWPAR VYDWQHGQAL YRQDDRLVLR TRAGFEYEFA LPAHWQHALE RLAEDATLTL
PVERMADLNG NAWVFERGLD SSLTRLVEWK AEEATGRMIA CAASASSRAG DHVSLLTALT
LIDAGGRAHP LVSYQHDRNV DLVAAIDAMG QPHRYAYTDG HRMISHTSAR GISFYYSHRR
HDDGVWRVDH AWGDNGLFDY RFIYDPARHE TRITDSLGHT TLLQSDERGM PVARIDPLGG
VTSYRYDAQG RTNAETDPAG RTTAWEYDTF ANLLMRTLPD GTALRTEYNT GHKPVCVAVP
GGGQWRYTWD ERGNLLMQTT PSQASVRYAY DQYGQLIAHT GPRGTVTRFD YDRSGLLAVR
TDALGHRTQY THDALGNLVH VINALGQTSR YEYDNIGNLT RAIEPGGREV HCVYDADGNL
VRYRDATGNV TQLVYTALGH ISKRQTPDGN VVEYRYDTEE QLVGVVNGRG EIYEIKRDEL
GRIVEETDYW GQPRHYRYGA AGELLDITDP LGQTIEYRCD RLGRVVEKRM PDPLHPDGVR
IDRFVYDQHG DLVLAENPSS RVEFRYDADG RVIAERQGDD FTIASTYDAS GNRTERKTRL
VADSDVVEHT VRYEYDALDA VIAIQIDDAA PIVLERDAVG QVCVEQLSPE LRRELSYEAG
GQLAKQTLLG GTGLLFASEY AYDANDELVE KRDSRTGVER FQYDPVGRIV AHTDPTGRLR
SYVYDPAGDL LKTHIHERRT AGATEMTQTG TWVREGEFEG HYHAYDRAGN LIRRQDVGQD
LTLRWDAAGQ LAEAVAVRPA IAGAGGGQVR IGTQYEYDAF RRRVGKLVLT RAAGSAELVL
SRISRFFWDG NTLVGQCTRG GGEGGGTAIP DAGDRMPAAQ FIPVRDNRDD APVSGFEHAY
EWVYYPGMFR PLAVVHCDLA ATRASVPTTT NLLPLIGAVY FFQSDLNGAP VQMHAPGGRV
VWEARYDPIG RSEQLGLQLV EQPVRLQGQY FDAETGLNYN RHRYFDPNSG IFISQDPIRL
SGGLNLYQYA PETNNWIDPL GCSGHRRRHE KMPPEGAPLT TGNLFRHAQD EGIPPIFSRK
DDEFYHRLIE IYAGTGVLNA YIRGIASHVE PKAGLILNED GNGWKIGSLY INYPDGPCPG
CRRLMPFILN DGSILYVTFP TLGLDGYSYG HFHGGVSGFF REGTPCNLRH PE