Gene BURPS668_A2951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2951 
Symbol 
ID4888730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2796668 
End bp2801287 
Gene Length4620 bp 
Protein Length1539 aa 
Translation table11 
GC content61% 
IMG OID640132887 
ProductRhsD protein 
Protein accessionYP_001063942 
Protein GI126445357 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGCTGC CGGCCGTCAA GCACATGGAC CCCGTCGTCG GGATCGACGT GCATTCCGTG 
ATCGTCACGC CGGGGCTGCC GCCGGTGTTC CTGCCGCATC CGCATGTCGG CTTCATGCTC
GATCTGCGCG AATACGTCGA GGCCGCGAAG GGCGTGATCG GCAGCATTGC GATGACGATC
GCCGAAGACG CGGTGCAGGA CTACCTGAAG GATCACCCCG ACGACGGCAA GAAACTGACC
GATGCGCTCA ATTCGCTCGA CAACCAGAAG AAGCAGGCCG AGCGCGATCC TTGGGTCGCG
CAGGCGCTGA AGCTCGATCG CGAAGTGTCG TCAATCAAGG GTGACGTGAC GAACGCGGTG
GGCGCGGGAG TCGGGATGGG AGGCGCGGCC GGCCGGCCGA TTTTCGTGAA CGGGCTGATG
CGCGCGACGG CGGGCACGCA TTCGTTTCAT GTGCCGGGCT TGCACTTCCC GCTGGGCGAG
ACATTCGCGC CGCCCGACCC GTTGCCGTCC GACGACGCCG AATCGTACAT GGGCAGCCGG
ACCGTGCTCG CGAACAACGA TCCGATGTCG TTCCTCGCGC TGCCCGCGAT GAGCTGCTGG
GCCGAGGGGA TCGAGCCGCC GCCGCACAAC GGTGCGCATA CGACGCGCAC GCATGTGTCG
ATGCCGAGTT CGGTGATGCT GCCGATTCCC GTCGGCCGGC CGGTGCTCGT CGGCGGGCCG
CCCGTGATGA ACATGGCGGC GGCCGCCAAG GGGATGTTCA AGGCGTTTCA GGGATCGAAG
TGGGCGAAGC ACCTCGCGGA CAAACTGCAT CTGAAGCCGG GGTTCCTGCG ATGCGCGGTG
CTGAAGGCGG AGCCGGTGGA CGTGACGACG GGCGAAGTGG TCGTGCAGCA GAACGACTTC
ACCGTGTCAG GCCGCTTGCC GCTCGCGTGG GACCGCTATT ACGCGAGTCA GGATCGCCAT
CCGGGCGCGG TGGGCTTCGG CTGGCAGACG CCGGCCGATA TCCGGATCGA GCTGATGCGC
AACGAAGGCG GGATCGGCGC GGTCGCGCAT TTTCCTGATC ATGCGACCGC TTTCAACGCG
GTGCCGGCGG ACGACGGCTG GCCGGCGCGC ACGTATGACT GGCAGCATGG GCATGCGCTG
TATGGCGACG ATGGGCGGAT GGTGCTGCGC ACGCGCGAGG GCATCGAATA CGGATTCGTG
CTGCCTTCCC GTTGGAGCGA TGCCGTCGCG GCGCTCGATG GCGACGATTC GCGACTCACG
CTGCCGATCG ATCGCATGGC CGATCTCAAC GGCAATGCGT GGGTGTTCGA GCGCGACGTG
TATGGCGGGC TCGTGCGTCT CGTCGAATGG AAACGCGACG GGCGAACCGG GCGCGTGGTC
GAATGCGGTA CTAGTAGCGG GCTGCACGCC GGGCTGTTGA CATCGCTGAC GTTGATCGAT
GCGGAGGGAA ACGCGCATCC GCTCGTGAGC TACGAGCATG ATCGTGAGCG TAATCTGGCG
GCGGCGATCG ACGCGATGGC GCATCCGCAT CATTTCGAAT ATGCGGCCGG GCATCGGATG
GTGAGCCACA CGAGCGCGCG CGGCGTGTCG TTCTACTACA GCTACCAGCA GGGCGACGAC
GGCGTGTGGC GCGTCGATCA TGCGTGGGGC GATAACGGGC TGTTCGACTA TCGTTTCGTC
TACGATCGCG CGCGGATGGA AACGCGCGTC ACCAATTCGC TCGGGCATAC GTCGATCACG
CAGATGAACG AGCGCGGGGT GCCTGTTGCG GAAATCGATT CGCTAGGTGG CGTGACCGGG
TATCGGTATG ACACGCAGGG GCGTGCGAGT GCGGTGATTG ATACGGCGGG GCGGACGACG
GCGTGGGAAT ACGACGCGTA CGGCAGTTTG GTTGCGCAGA CGTTGCCGGA TGGCAGTGTA
GTGCGCGCGG AGTATGACGC CGACGGCCGG CCCGTTTGTA TCACGATGCC TACCGCGCGG
CAACTGCGCT ACGAGTGGGA CGAGCACGGT CATTTGCGCT CACGCACTAC GTCTTCCCGG
GCTGTTTCGA GATACGTGTA CGACGCATAC GGGCAACTCG TTTCGTACAG CGGATCGCGT
GGTGCGATCA CGCGATTCGA GTACGATCGC GACGGAAATC TGGCGGCTGT TACAGATGCC
TTGGGGAATC GTACGCGATA TGTGCGCGAC GCGCGTGGAA GAGTCGTTTG GATGGCCGAC
CCGCGCGGGC AGGTAGGGCT TTACGAATAT GACGGCAACG GCAATTTGAC GAGAGCGGTC
TTGCCTGGCG GGAAAGAGAT TCGTTGCGAG TTCGATGACG ACGGAAACCT GGTTCATTAT
CGCGACGCCG CCGGCCAATC GACAACGTTG GCGTATTCGC CGATCGGGTT GGTGGAACGC
AGAATCGCGC CGAGCGGCGG CATTGTCGAT TGCCGTTACG ATACGGAGGG GCAACTGGTC
GGCGTCGTCA ACGAACGTGG CGAGCAGTAT GCGCTTAAAC GCGACCCGCT AGGCCGGATC
GTGTCCGAGT CGGACTATTG GGGGCAGACA TGGCACTATC GATACGGTGG CTCAGGCGAA
TTGCTGTGTA GCACCGATCC GCTCGGGCGG GTGGTCGAGT ATCAGTACGA TCGATGCGGC
CGGATCGTGG AGAAGCTTGC GCGGGCCTCT GCGGATGACG CATTTGTCCA GGTCCATCGC
TTCGCCTATG ACCAGTGTGG CAATCTGATA CTCGCCGAGA ACGACGACAG CCGGGTCGAG
TTTTGCCATG ATGCGGATGG ACGAGTGATT GAAGAAAAAC AAGGCGACGA CTTTACGATC
AACAACGTCT TCGATGCGGC CGGCAATCGC ATCGAGCGTC GGACCAGGCT TCTATCGGAA
GGCGCGCTCA TCGAACATGT CGTTCACTAT GAGTATGATG TCTTGAACGC CGTGGTTTCG
ATTCGGATAG ACGGCTCGGC GGCGGTCGTC ATAGAACGCG ATGATCTTGG CCAGGCCGTC
TCAGAGGGAC TGGGCCCTGC GCTGAGACGA GCGTTTTCGT ACGAGGCGGG CGGGCAGCTT
GCCGTTCAGA CGCTGTTTGC ATCAACCGGC ACGATGTTCG CAAGCGAGTA TGCATACGAC
CCGAACGGCG AGGTGATCGA AAAGCGCGAT GAGCGCGGGC GCATGGAGCG CTTCGAATAT
GATTCCGTGG GACGCGTGGT TTCGCACCTC GGCGCTCGCG GCATCGTGCG CCGGCTCGCC
TATGATTTGG CTGGGGATCT GTTGCGAACA CGTGTTTCCG GATATGACGC GGCGGCCCTT
GCCGACGCGG AAAAAGCCGC GAACTGGATT CGCGACGGCG AATATGAAGG TTGTTACTGT
GCGTTCGACC GTGCGGGCAA TATGGTTCAC CGTCGAGATG CCGAGCAGAA TCTGGCTTTG
TGCTGGGATC CGGCGGGCCA ACTGCGCGAG ACGATAGCAT TGCGTCCGGC AACGGCTGAT
GTGGCGTCCG CTCGGAAGCG TACTCGCACA CAGTACGAAT ACGATGCATT GCGGCGGCGA
ACACGCAAGC TGGTGCAAAT CGAATCGGAA GGCGAGCCGG ATTCGGTGTT CTCGTACGTC
AGTTGCTTCT TCTGGGATGG CGATGCGCTC GTGGGAGAGC GTACGACAGG CGGCTGGGTG
GGGCGCGCGG GCCGCGCAGA GGAAGACGAG CGGGACGCAA CATCGCTCGC GCCGCCGTCT
CCAGCCGGCT CGAAGAACGG CAACTGGCTG ACGTTACAAC CCGATCACGT ATGTGAGTGG
TTCTATTACC CGCGAACATT TCACCCGCTC GGAGTCGTGC ACTGCTATGC CGGCGAGAAG
CGCGAGCTCG CGAGGGCGAC GGTTGGCTCG CCTTCGAAGG CGGATGCAAC TTACTTTTAC
CAAAGCGATC CGAACGGCGC GCCGGTGCGA ATGCTCGATG TGGAGGGAAA CGTGGCTTGG
GAGGCGAGCT ACGACGCCAA TGGAGGCATC GAGCAATTTG GCATTCAGGC GATGCCGCAG
CCGCTTCGAT TGCAAGGGCA GTACTTCGAT GCCGAGACGG GCATGAGTTA CAACCGACAT
CGTTACTACG ATGCACGGAT CGGCCAGTTC GTCAGCGAAG ACCCTATCCG TCTGAGCGGT
GGAGAGAACT TGTACCGCTA TTGCGTCAAC AGCATATCGT GGGCGGATCC TCTCGGACTC
GATAGAGTTC CTTTGTTCGA TCCGAACAAT CGGTTGTCTT TCAATGCGAT CTGGGCATAT
ACAGGCGGTC TGCCCACTCC GTCCGACGTC ACGGTCGTTC GGCCGGCGAC AAGTTGGGGA
AGGAATTCGG GGGTTCTAAA ATGGCAGGAT CATTATTATG TTTTTGCAAG CGCTGGTAAG
AAATCGCATT CCGAGGACGC GATCATTGAC TTTATAAAAC AGCACGATAT CAAGCCGAGC
GAGATTCAAG GTTTGTATAG CATACTCAGC CCGTGCGCCG AAGAGAAGAA ATCCTGTCTT
GAAAAGACCA AAGGCGAAGG GCTGGAATTT GATTGGAGTT TATTCCACCA GACCGACAAC
GTAGGTTCGG AAGTCAGGAA GGAGATCGGT GCGCTGATTA AGAATATGAA GGGGGTGTGA
 
Protein sequence
MALPAVKHMD PVVGIDVHSV IVTPGLPPVF LPHPHVGFML DLREYVEAAK GVIGSIAMTI 
AEDAVQDYLK DHPDDGKKLT DALNSLDNQK KQAERDPWVA QALKLDREVS SIKGDVTNAV
GAGVGMGGAA GRPIFVNGLM RATAGTHSFH VPGLHFPLGE TFAPPDPLPS DDAESYMGSR
TVLANNDPMS FLALPAMSCW AEGIEPPPHN GAHTTRTHVS MPSSVMLPIP VGRPVLVGGP
PVMNMAAAAK GMFKAFQGSK WAKHLADKLH LKPGFLRCAV LKAEPVDVTT GEVVVQQNDF
TVSGRLPLAW DRYYASQDRH PGAVGFGWQT PADIRIELMR NEGGIGAVAH FPDHATAFNA
VPADDGWPAR TYDWQHGHAL YGDDGRMVLR TREGIEYGFV LPSRWSDAVA ALDGDDSRLT
LPIDRMADLN GNAWVFERDV YGGLVRLVEW KRDGRTGRVV ECGTSSGLHA GLLTSLTLID
AEGNAHPLVS YEHDRERNLA AAIDAMAHPH HFEYAAGHRM VSHTSARGVS FYYSYQQGDD
GVWRVDHAWG DNGLFDYRFV YDRARMETRV TNSLGHTSIT QMNERGVPVA EIDSLGGVTG
YRYDTQGRAS AVIDTAGRTT AWEYDAYGSL VAQTLPDGSV VRAEYDADGR PVCITMPTAR
QLRYEWDEHG HLRSRTTSSR AVSRYVYDAY GQLVSYSGSR GAITRFEYDR DGNLAAVTDA
LGNRTRYVRD ARGRVVWMAD PRGQVGLYEY DGNGNLTRAV LPGGKEIRCE FDDDGNLVHY
RDAAGQSTTL AYSPIGLVER RIAPSGGIVD CRYDTEGQLV GVVNERGEQY ALKRDPLGRI
VSESDYWGQT WHYRYGGSGE LLCSTDPLGR VVEYQYDRCG RIVEKLARAS ADDAFVQVHR
FAYDQCGNLI LAENDDSRVE FCHDADGRVI EEKQGDDFTI NNVFDAAGNR IERRTRLLSE
GALIEHVVHY EYDVLNAVVS IRIDGSAAVV IERDDLGQAV SEGLGPALRR AFSYEAGGQL
AVQTLFASTG TMFASEYAYD PNGEVIEKRD ERGRMERFEY DSVGRVVSHL GARGIVRRLA
YDLAGDLLRT RVSGYDAAAL ADAEKAANWI RDGEYEGCYC AFDRAGNMVH RRDAEQNLAL
CWDPAGQLRE TIALRPATAD VASARKRTRT QYEYDALRRR TRKLVQIESE GEPDSVFSYV
SCFFWDGDAL VGERTTGGWV GRAGRAEEDE RDATSLAPPS PAGSKNGNWL TLQPDHVCEW
FYYPRTFHPL GVVHCYAGEK RELARATVGS PSKADATYFY QSDPNGAPVR MLDVEGNVAW
EASYDANGGI EQFGIQAMPQ PLRLQGQYFD AETGMSYNRH RYYDARIGQF VSEDPIRLSG
GENLYRYCVN SISWADPLGL DRVPLFDPNN RLSFNAIWAY TGGLPTPSDV TVVRPATSWG
RNSGVLKWQD HYYVFASAGK KSHSEDAIID FIKQHDIKPS EIQGLYSILS PCAEEKKSCL
EKTKGEGLEF DWSLFHQTDN VGSEVRKEIG ALIKNMKGV