Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2951 |
Symbol | |
ID | 4888730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2796668 |
End bp | 2801287 |
Gene Length | 4620 bp |
Protein Length | 1539 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640132887 |
Product | RhsD protein |
Protein accession | YP_001063942 |
Protein GI | 126445357 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGCTGC CGGCCGTCAA GCACATGGAC CCCGTCGTCG GGATCGACGT GCATTCCGTG ATCGTCACGC CGGGGCTGCC GCCGGTGTTC CTGCCGCATC CGCATGTCGG CTTCATGCTC GATCTGCGCG AATACGTCGA GGCCGCGAAG GGCGTGATCG GCAGCATTGC GATGACGATC GCCGAAGACG CGGTGCAGGA CTACCTGAAG GATCACCCCG ACGACGGCAA GAAACTGACC GATGCGCTCA ATTCGCTCGA CAACCAGAAG AAGCAGGCCG AGCGCGATCC TTGGGTCGCG CAGGCGCTGA AGCTCGATCG CGAAGTGTCG TCAATCAAGG GTGACGTGAC GAACGCGGTG GGCGCGGGAG TCGGGATGGG AGGCGCGGCC GGCCGGCCGA TTTTCGTGAA CGGGCTGATG CGCGCGACGG CGGGCACGCA TTCGTTTCAT GTGCCGGGCT TGCACTTCCC GCTGGGCGAG ACATTCGCGC CGCCCGACCC GTTGCCGTCC GACGACGCCG AATCGTACAT GGGCAGCCGG ACCGTGCTCG CGAACAACGA TCCGATGTCG TTCCTCGCGC TGCCCGCGAT GAGCTGCTGG GCCGAGGGGA TCGAGCCGCC GCCGCACAAC GGTGCGCATA CGACGCGCAC GCATGTGTCG ATGCCGAGTT CGGTGATGCT GCCGATTCCC GTCGGCCGGC CGGTGCTCGT CGGCGGGCCG CCCGTGATGA ACATGGCGGC GGCCGCCAAG GGGATGTTCA AGGCGTTTCA GGGATCGAAG TGGGCGAAGC ACCTCGCGGA CAAACTGCAT CTGAAGCCGG GGTTCCTGCG ATGCGCGGTG CTGAAGGCGG AGCCGGTGGA CGTGACGACG GGCGAAGTGG TCGTGCAGCA GAACGACTTC ACCGTGTCAG GCCGCTTGCC GCTCGCGTGG GACCGCTATT ACGCGAGTCA GGATCGCCAT CCGGGCGCGG TGGGCTTCGG CTGGCAGACG CCGGCCGATA TCCGGATCGA GCTGATGCGC AACGAAGGCG GGATCGGCGC GGTCGCGCAT TTTCCTGATC ATGCGACCGC TTTCAACGCG GTGCCGGCGG ACGACGGCTG GCCGGCGCGC ACGTATGACT GGCAGCATGG GCATGCGCTG TATGGCGACG ATGGGCGGAT GGTGCTGCGC ACGCGCGAGG GCATCGAATA CGGATTCGTG CTGCCTTCCC GTTGGAGCGA TGCCGTCGCG GCGCTCGATG GCGACGATTC GCGACTCACG CTGCCGATCG ATCGCATGGC CGATCTCAAC GGCAATGCGT GGGTGTTCGA GCGCGACGTG TATGGCGGGC TCGTGCGTCT CGTCGAATGG AAACGCGACG GGCGAACCGG GCGCGTGGTC GAATGCGGTA CTAGTAGCGG GCTGCACGCC GGGCTGTTGA CATCGCTGAC GTTGATCGAT GCGGAGGGAA ACGCGCATCC GCTCGTGAGC TACGAGCATG ATCGTGAGCG TAATCTGGCG GCGGCGATCG ACGCGATGGC GCATCCGCAT CATTTCGAAT ATGCGGCCGG GCATCGGATG GTGAGCCACA CGAGCGCGCG CGGCGTGTCG TTCTACTACA GCTACCAGCA GGGCGACGAC GGCGTGTGGC GCGTCGATCA TGCGTGGGGC GATAACGGGC TGTTCGACTA TCGTTTCGTC TACGATCGCG CGCGGATGGA AACGCGCGTC ACCAATTCGC TCGGGCATAC GTCGATCACG CAGATGAACG AGCGCGGGGT GCCTGTTGCG GAAATCGATT CGCTAGGTGG CGTGACCGGG TATCGGTATG ACACGCAGGG GCGTGCGAGT GCGGTGATTG ATACGGCGGG GCGGACGACG GCGTGGGAAT ACGACGCGTA CGGCAGTTTG GTTGCGCAGA CGTTGCCGGA TGGCAGTGTA GTGCGCGCGG AGTATGACGC CGACGGCCGG CCCGTTTGTA TCACGATGCC TACCGCGCGG CAACTGCGCT ACGAGTGGGA CGAGCACGGT CATTTGCGCT CACGCACTAC GTCTTCCCGG GCTGTTTCGA GATACGTGTA CGACGCATAC GGGCAACTCG TTTCGTACAG CGGATCGCGT GGTGCGATCA CGCGATTCGA GTACGATCGC GACGGAAATC TGGCGGCTGT TACAGATGCC TTGGGGAATC GTACGCGATA TGTGCGCGAC GCGCGTGGAA GAGTCGTTTG GATGGCCGAC CCGCGCGGGC AGGTAGGGCT TTACGAATAT GACGGCAACG GCAATTTGAC GAGAGCGGTC TTGCCTGGCG GGAAAGAGAT TCGTTGCGAG TTCGATGACG ACGGAAACCT GGTTCATTAT CGCGACGCCG CCGGCCAATC GACAACGTTG GCGTATTCGC CGATCGGGTT GGTGGAACGC AGAATCGCGC CGAGCGGCGG CATTGTCGAT TGCCGTTACG ATACGGAGGG GCAACTGGTC GGCGTCGTCA ACGAACGTGG CGAGCAGTAT GCGCTTAAAC GCGACCCGCT AGGCCGGATC GTGTCCGAGT CGGACTATTG GGGGCAGACA TGGCACTATC GATACGGTGG CTCAGGCGAA TTGCTGTGTA GCACCGATCC GCTCGGGCGG GTGGTCGAGT ATCAGTACGA TCGATGCGGC CGGATCGTGG AGAAGCTTGC GCGGGCCTCT GCGGATGACG CATTTGTCCA GGTCCATCGC TTCGCCTATG ACCAGTGTGG CAATCTGATA CTCGCCGAGA ACGACGACAG CCGGGTCGAG TTTTGCCATG ATGCGGATGG ACGAGTGATT GAAGAAAAAC AAGGCGACGA CTTTACGATC AACAACGTCT TCGATGCGGC CGGCAATCGC ATCGAGCGTC GGACCAGGCT TCTATCGGAA GGCGCGCTCA TCGAACATGT CGTTCACTAT GAGTATGATG TCTTGAACGC CGTGGTTTCG ATTCGGATAG ACGGCTCGGC GGCGGTCGTC ATAGAACGCG ATGATCTTGG CCAGGCCGTC TCAGAGGGAC TGGGCCCTGC GCTGAGACGA GCGTTTTCGT ACGAGGCGGG CGGGCAGCTT GCCGTTCAGA CGCTGTTTGC ATCAACCGGC ACGATGTTCG CAAGCGAGTA TGCATACGAC CCGAACGGCG AGGTGATCGA AAAGCGCGAT GAGCGCGGGC GCATGGAGCG CTTCGAATAT GATTCCGTGG GACGCGTGGT TTCGCACCTC GGCGCTCGCG GCATCGTGCG CCGGCTCGCC TATGATTTGG CTGGGGATCT GTTGCGAACA CGTGTTTCCG GATATGACGC GGCGGCCCTT GCCGACGCGG AAAAAGCCGC GAACTGGATT CGCGACGGCG AATATGAAGG TTGTTACTGT GCGTTCGACC GTGCGGGCAA TATGGTTCAC CGTCGAGATG CCGAGCAGAA TCTGGCTTTG TGCTGGGATC CGGCGGGCCA ACTGCGCGAG ACGATAGCAT TGCGTCCGGC AACGGCTGAT GTGGCGTCCG CTCGGAAGCG TACTCGCACA CAGTACGAAT ACGATGCATT GCGGCGGCGA ACACGCAAGC TGGTGCAAAT CGAATCGGAA GGCGAGCCGG ATTCGGTGTT CTCGTACGTC AGTTGCTTCT TCTGGGATGG CGATGCGCTC GTGGGAGAGC GTACGACAGG CGGCTGGGTG GGGCGCGCGG GCCGCGCAGA GGAAGACGAG CGGGACGCAA CATCGCTCGC GCCGCCGTCT CCAGCCGGCT CGAAGAACGG CAACTGGCTG ACGTTACAAC CCGATCACGT ATGTGAGTGG TTCTATTACC CGCGAACATT TCACCCGCTC GGAGTCGTGC ACTGCTATGC CGGCGAGAAG CGCGAGCTCG CGAGGGCGAC GGTTGGCTCG CCTTCGAAGG CGGATGCAAC TTACTTTTAC CAAAGCGATC CGAACGGCGC GCCGGTGCGA ATGCTCGATG TGGAGGGAAA CGTGGCTTGG GAGGCGAGCT ACGACGCCAA TGGAGGCATC GAGCAATTTG GCATTCAGGC GATGCCGCAG CCGCTTCGAT TGCAAGGGCA GTACTTCGAT GCCGAGACGG GCATGAGTTA CAACCGACAT CGTTACTACG ATGCACGGAT CGGCCAGTTC GTCAGCGAAG ACCCTATCCG TCTGAGCGGT GGAGAGAACT TGTACCGCTA TTGCGTCAAC AGCATATCGT GGGCGGATCC TCTCGGACTC GATAGAGTTC CTTTGTTCGA TCCGAACAAT CGGTTGTCTT TCAATGCGAT CTGGGCATAT ACAGGCGGTC TGCCCACTCC GTCCGACGTC ACGGTCGTTC GGCCGGCGAC AAGTTGGGGA AGGAATTCGG GGGTTCTAAA ATGGCAGGAT CATTATTATG TTTTTGCAAG CGCTGGTAAG AAATCGCATT CCGAGGACGC GATCATTGAC TTTATAAAAC AGCACGATAT CAAGCCGAGC GAGATTCAAG GTTTGTATAG CATACTCAGC CCGTGCGCCG AAGAGAAGAA ATCCTGTCTT GAAAAGACCA AAGGCGAAGG GCTGGAATTT GATTGGAGTT TATTCCACCA GACCGACAAC GTAGGTTCGG AAGTCAGGAA GGAGATCGGT GCGCTGATTA AGAATATGAA GGGGGTGTGA
|
Protein sequence | MALPAVKHMD PVVGIDVHSV IVTPGLPPVF LPHPHVGFML DLREYVEAAK GVIGSIAMTI AEDAVQDYLK DHPDDGKKLT DALNSLDNQK KQAERDPWVA QALKLDREVS SIKGDVTNAV GAGVGMGGAA GRPIFVNGLM RATAGTHSFH VPGLHFPLGE TFAPPDPLPS DDAESYMGSR TVLANNDPMS FLALPAMSCW AEGIEPPPHN GAHTTRTHVS MPSSVMLPIP VGRPVLVGGP PVMNMAAAAK GMFKAFQGSK WAKHLADKLH LKPGFLRCAV LKAEPVDVTT GEVVVQQNDF TVSGRLPLAW DRYYASQDRH PGAVGFGWQT PADIRIELMR NEGGIGAVAH FPDHATAFNA VPADDGWPAR TYDWQHGHAL YGDDGRMVLR TREGIEYGFV LPSRWSDAVA ALDGDDSRLT LPIDRMADLN GNAWVFERDV YGGLVRLVEW KRDGRTGRVV ECGTSSGLHA GLLTSLTLID AEGNAHPLVS YEHDRERNLA AAIDAMAHPH HFEYAAGHRM VSHTSARGVS FYYSYQQGDD GVWRVDHAWG DNGLFDYRFV YDRARMETRV TNSLGHTSIT QMNERGVPVA EIDSLGGVTG YRYDTQGRAS AVIDTAGRTT AWEYDAYGSL VAQTLPDGSV VRAEYDADGR PVCITMPTAR QLRYEWDEHG HLRSRTTSSR AVSRYVYDAY GQLVSYSGSR GAITRFEYDR DGNLAAVTDA LGNRTRYVRD ARGRVVWMAD PRGQVGLYEY DGNGNLTRAV LPGGKEIRCE FDDDGNLVHY RDAAGQSTTL AYSPIGLVER RIAPSGGIVD CRYDTEGQLV GVVNERGEQY ALKRDPLGRI VSESDYWGQT WHYRYGGSGE LLCSTDPLGR VVEYQYDRCG RIVEKLARAS ADDAFVQVHR FAYDQCGNLI LAENDDSRVE FCHDADGRVI EEKQGDDFTI NNVFDAAGNR IERRTRLLSE GALIEHVVHY EYDVLNAVVS IRIDGSAAVV IERDDLGQAV SEGLGPALRR AFSYEAGGQL AVQTLFASTG TMFASEYAYD PNGEVIEKRD ERGRMERFEY DSVGRVVSHL GARGIVRRLA YDLAGDLLRT RVSGYDAAAL ADAEKAANWI RDGEYEGCYC AFDRAGNMVH RRDAEQNLAL CWDPAGQLRE TIALRPATAD VASARKRTRT QYEYDALRRR TRKLVQIESE GEPDSVFSYV SCFFWDGDAL VGERTTGGWV GRAGRAEEDE RDATSLAPPS PAGSKNGNWL TLQPDHVCEW FYYPRTFHPL GVVHCYAGEK RELARATVGS PSKADATYFY QSDPNGAPVR MLDVEGNVAW EASYDANGGI EQFGIQAMPQ PLRLQGQYFD AETGMSYNRH RYYDARIGQF VSEDPIRLSG GENLYRYCVN SISWADPLGL DRVPLFDPNN RLSFNAIWAY TGGLPTPSDV TVVRPATSWG RNSGVLKWQD HYYVFASAGK KSHSEDAIID FIKQHDIKPS EIQGLYSILS PCAEEKKSCL EKTKGEGLEF DWSLFHQTDN VGSEVRKEIG ALIKNMKGV
|
| |