Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0267 |
Symbol | |
ID | 3845192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 329863 |
End bp | 334482 |
Gene Length | 4620 bp |
Protein Length | 1539 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637837573 |
Product | rhsD protein |
Protein accession | YP_438469 |
Protein GI | 83716862 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCTGC CGGCCGTCAA GCACATGGAC CCCGTCGTCG GGATCGACGT GCATTCGGTA ATCGTCACGC CGGGGCTGCC GCCGGTGTTT CTGCCGCATC CGCATGTCGG CTTCATGCTC GATCTGCGCG AATACGTCGA GGCCGCGAAG GGCGTGATCG GCAGCATTGC GATGACGATC GCCGAAGAGG CGGTGCAGGA CTACCTGAAG GATCACCCCG ACGACAGCAA GAAACTGACC GATGCGCTCA ATTCGCTCGA CAACCAAAAG AAGCAGGCCG AGCGCGATCC GATGGTCGCG GAGATGCTGA GGCTCAATCG CGACGTGTCG TCGATCAGGG GCGACGTGGC GAACGCGGTG GGCGCGGGCG TCGGGATGGG GGGCGCAGCC GGCCGGCCGA TTTTCGTGAA CGGGCTGATG CGCGCGACGG CGGGTACGCA TTCGTTTCAC GTGCCGGGCT TGCACTTCCC GCTGGGCGAA ACGTTCGCGC CGCCCGATCC GCTGCCGTCC GACGACGCCG AATCGTACAT GGGCAGCCGC ACCGTACTGG CGAACAACGA TCCGATGTCG TTCCTCGCGC TGCCCGCGAT GAGCTGCTGG GCCGAGGGGA TCGAGCCGCC GCCGCACAAC GGCGCGCATA CGGCGCGCAC GCATGTGTCG ATGCCGAGTT CGGTGATGCT GCCGATTCCC GCCGGGCGAC CGGTGCTCGT CGGTGGGCCG CCCGTGATGA ACATGGCGGC GGCCGCGAAG GGGATGTTCA AGGCGTTTCA GGGATCGAAG TGGGCGAAGA AGCTCGCGGA CAAGCTGCAT CTGAAGTCTG GGTTCCTGCG ATGCGCGGTG CTGAAGGCGG AGCCGGTGGA CGTGACGACG GGCGAAGTGG TCGTGCAGCA GAACGACTTC ACCGTCTCCG GCCGGTTGCC GCTCGTGTGG GATCGATACT ACGCGAGTCA CGATCGCCAT GCAGGCGCGG TGGGCTTCGG CTGGCAGACG CCCGCCGATA TCCGGCTCGA GCTGATGCGC AACGAAGATG GCATCGGCGC GGCTGCTTGT TTCCCCGATC ATGCGACCGC CTTCGACATG GTGCCGGCAG ACGATGGTTG GCCGGCGCGC ACGTATGACT GGCAGCATGG GCACGCGCTG TATCGCGACG ATGGGCGGAT GGTGCTGCGC ACGCGCGAGG GCATCGAATA CGGATTCGTG CTGCCTTCCC GTTGGCGAAA TGCCGTCGCG GCGCTCGATG GCGACGATTC GCGACTCACG CTGCCGATCG GCCGCATGGC CGATCTCAAC GGCAACGCGT GGGTGTTCGA GCGTGACGTG TATGGCGGGC TCGTGCGTCT CGTCGAATGG AAGCGCGACG GGCGGACCGA GCGCGTGGTC GAGTGCGGCA CGGGCAGCGG GCTGCACGCC GGGCTGTTGA CATCGCTGAC GCTGATCGAC GCAGGCGGAA ACGCGCATCC GCTCGTGAGC TACGAGCATG ACCGCGAGCG CAATCTGGCG GCGGCGATCG ACGCGATGGC GCATCCGCAT CATTTCGAAT ATGCGGCCGG GCATCGGATG GTGAGCCACA CGAGTGCGCG CGGCGTGTCG TTCCGCTACA GCTACCAGCA GGGCGACGAC GGCGTGTGGC GCGTCGATCA TGCGTGGGGC GATAACGGGC TGTTCGACTA TCGTTTCGTC TACGATCGCG CGCGGATGGA AACGCGCGTC ACCAATTCGC TCGGGCATAC GACGATCACG CAGATGAACG AGCGCGGGAT GCCTGTTGCG GAAATCGATT CGCTAGGCGG CGTGACCGGG TATCGGTATG ACACGCAGGG GCGTGCGAGT GCGGTGATTG ATACGGCGGG GCGGACGACG GCGTGGGAAT ACGACGCGTA CGGCAGTTTG GTTGCGCAGA CGTTGCCGGA TGGCAGTGTA GTGCGCGCGG AGTATGACGC CGACGGCCGG CCCGTTTGTA TCACGATGCC TACCGCGCGG CAACTGCGCT ACGAGTGGGA CGAGCACGGT CATTTGCGCT CACGCACTAC GTCTTCCCGG GCTGTTTCGA GATACGTGTA CGACGCATAC GGGCAACTCG TTTCGTACAG CGGATCGCGT GGTGCGATCA CGCGATTCGA GTACGATCGC GACGGAAATC TGGCGACCGT TACAGATGCC TTGGGGAATC GTACGCGATA TGTGCGCGAC GCGCGTGGAA GAGTCGTTTG GATGGCCGAC CCGCGCGGGC AGGTAGGGCT TTACGAATAT GACGGCAACG GCAATTTGAC GAGAGCGGTC TTGCCTGGTG GGAAAGAGAT TCGTTGCGAG TTCGATGACG ACGGAAACCT GGTTCATTAT CGCGACGCCG CCGGCCAATC GACAACGTTG GCGTATTCGC CGATCGGGTT GGTGGAACGC AGAATCGCGC CGAGCGGCGG CATTGTCGAT TGCCGTTACG ATACGGAGGG GCAACTGGTC GGCGTCGTCA ACGAACGTGG CGAGCAGTAT GCGCTTAAAC GCGACCCGCT AGGCCGGATC GTGTCCGAGT CGGACTATTG GGGGCAGACA TGGCACTATC GATACGGTGG CTCAGGCGAA TTGCTGTGTA GCACCGATCC GCTCGGGCGG GTGGTCGAGT ATCAGTACGA TCGATGCGGC CGGATCGTGG AGAAGCTTGC GCGGGCCTCT GCGGATGACG CATTTGTCCA GGTCCATCGC TTCACCTATG ACCAGTGTGG CAATCTGATA CTCGCCGAGA ACGACGATAG CCGGGTCGAG TTTTGCCATG ATGCGGATGG ACGAGTGATT GAAGAAAAAC AAGGCGACGA CTTTACGATC AACAACGTCT TCGATGCGGC CGGCAATCGC ATCGAGCGTC GGACCAGGCT TCTATCGGAA GGCGCGCTCA TCGAACATGT CGTTCACTAT GAGTATGACG TCTTGAACGC CGTGGTTTCG ATTCGGATAG ACGGCTCGGC GGCGGTCGTC ATAGAACGCG ATGATCTTGG CCAGGCCGTC TCAGAGCGAC TGGGCCCTGC GCTGAGACGA GCGTTTTCAT ACGAGGCGGG CGGGCAGCTT GCCGTTCAGA CGCTGTTTGC ATCAACCGGC ACGATGTTCG CAAGCGAGTA TGCATACGAC CCGAACGGCG AGGTGATCGA AAAGCGCGAT GAGCGCGGGC GCATGGAGCG CTTCGAATAT GATTCCGTGG GACGCGTGGT TTCGCACCTC GGCGCTCGCG GCATCGTGCG CCGGCTCGCC TATGATTTGG CTGGGGATCT GTTGCGAACA CGTGTTTCCG GATATGACGC GGCGGCCCTT ACCGACGCGG AAAAAGCCGC GAACTGGATT CGCGACGGCG AATATGAAGG TTGCTACTGT GCGTTCGACC GTGCGGGCAA TATGGTTCAC CGTCGAGATG CCGAGCAGAA TCTGGCCTTG CGCTGGGATC CGGCGGGCCA ACTGCGCGAG ACGATAGCAT TGCGTCCGGC AACGGCTGAT GTGGCGTCCG CTCGGAAGCG TACTCGCACA CAGTACGAAT ACGATGCATT GCGGCGGCGA ACACGCAAGC TGGTGCAAAT CGAATCGGAA GGCGAGCCGG ATTCGGTGTT CTCGTACGTC AGTTGCTTCT TCTGGGATGG CGATGCGCTC GTGGGAGAGC GTACGACAGG CGGCTGGGTG GGGCGCGCGG GCCGCGCAGA GGAAGACGAG CGGGACGCAA CATCGCTCGC GCCGCCGTCT CCAGCCGGCT CGAAGAACGG CAACTGGCTG ACGTTACAAC CCGATCACGT ATGTGAGTGG TTCTATTACC CGCGAACATT TCACCCGCTC GGAGTCGTGC ACTGCTATGC CGGCGAGAAG CGCGAGCTCG CGAGGGCGAC GGTTGGCTCG CCTTCGAAGG CGGATGCAAC TTACTTTTAC CAAAGCGATC CGAACGGCGC GCCGGTGCGA ATGCTCGATG TGGAGGGAAA CGTGGCTTGG GAGGCGAGCT ACGACGCCAA TGGAGGCATC GAGCAATTTG GCATTCAGGC GATGCCGCAG CCGCTTCGAT TGCAAGGGCA GTACTTCGAT GCCGAGACGG GCATGAGTTA CAACCGACAT CGTTACTACG ATGCACGGAT CGGCCAGTTC GTCAGCGAAG ACCCTATCCG TCTGAGCGGT GGAGAGAACT TGTACCGCTA TTGCGTCAAC AGCATATCGT GGGCGGATCC TCTCGGACTC GATAGAGTTC CTTTGTTCGA TCCGAACAAT CGGTTGTCTT TCAATGCGAT CTGGGCATAT ACAGGCAATC TGCCCACTCC GTCCGACGTC ACGGTCGTTC GGCCGGCGAC AAGTTGGGGA AGGAATTCGG GGGTTCTAAA ATGGCAGGAT CATTATTATG TTTTTGCAAG CGCTGGTAAG AAATCGCATT CCGAGGACGC GATCATTGAC TTTATAAAAC AGCACGATAT CAAGCCGAGC GAGATTCAAG GTTTGTATAG CATACTCAGC CCGTGCGCCG AAGAGAAGAA ATCCTGTCTT GAAAAGACCA AAGGCGAAGG GCTGGAATTT GATTGGAGTT TATTTCACCA GACCGACAAC GTAGGTTCGG AAGTCAGGAA GGAGATCGGT GCGCTGATTA AGAATATGAA GGGGGTGTGA
|
Protein sequence | MALPAVKHMD PVVGIDVHSV IVTPGLPPVF LPHPHVGFML DLREYVEAAK GVIGSIAMTI AEEAVQDYLK DHPDDSKKLT DALNSLDNQK KQAERDPMVA EMLRLNRDVS SIRGDVANAV GAGVGMGGAA GRPIFVNGLM RATAGTHSFH VPGLHFPLGE TFAPPDPLPS DDAESYMGSR TVLANNDPMS FLALPAMSCW AEGIEPPPHN GAHTARTHVS MPSSVMLPIP AGRPVLVGGP PVMNMAAAAK GMFKAFQGSK WAKKLADKLH LKSGFLRCAV LKAEPVDVTT GEVVVQQNDF TVSGRLPLVW DRYYASHDRH AGAVGFGWQT PADIRLELMR NEDGIGAAAC FPDHATAFDM VPADDGWPAR TYDWQHGHAL YRDDGRMVLR TREGIEYGFV LPSRWRNAVA ALDGDDSRLT LPIGRMADLN GNAWVFERDV YGGLVRLVEW KRDGRTERVV ECGTGSGLHA GLLTSLTLID AGGNAHPLVS YEHDRERNLA AAIDAMAHPH HFEYAAGHRM VSHTSARGVS FRYSYQQGDD GVWRVDHAWG DNGLFDYRFV YDRARMETRV TNSLGHTTIT QMNERGMPVA EIDSLGGVTG YRYDTQGRAS AVIDTAGRTT AWEYDAYGSL VAQTLPDGSV VRAEYDADGR PVCITMPTAR QLRYEWDEHG HLRSRTTSSR AVSRYVYDAY GQLVSYSGSR GAITRFEYDR DGNLATVTDA LGNRTRYVRD ARGRVVWMAD PRGQVGLYEY DGNGNLTRAV LPGGKEIRCE FDDDGNLVHY RDAAGQSTTL AYSPIGLVER RIAPSGGIVD CRYDTEGQLV GVVNERGEQY ALKRDPLGRI VSESDYWGQT WHYRYGGSGE LLCSTDPLGR VVEYQYDRCG RIVEKLARAS ADDAFVQVHR FTYDQCGNLI LAENDDSRVE FCHDADGRVI EEKQGDDFTI NNVFDAAGNR IERRTRLLSE GALIEHVVHY EYDVLNAVVS IRIDGSAAVV IERDDLGQAV SERLGPALRR AFSYEAGGQL AVQTLFASTG TMFASEYAYD PNGEVIEKRD ERGRMERFEY DSVGRVVSHL GARGIVRRLA YDLAGDLLRT RVSGYDAAAL TDAEKAANWI RDGEYEGCYC AFDRAGNMVH RRDAEQNLAL RWDPAGQLRE TIALRPATAD VASARKRTRT QYEYDALRRR TRKLVQIESE GEPDSVFSYV SCFFWDGDAL VGERTTGGWV GRAGRAEEDE RDATSLAPPS PAGSKNGNWL TLQPDHVCEW FYYPRTFHPL GVVHCYAGEK RELARATVGS PSKADATYFY QSDPNGAPVR MLDVEGNVAW EASYDANGGI EQFGIQAMPQ PLRLQGQYFD AETGMSYNRH RYYDARIGQF VSEDPIRLSG GENLYRYCVN SISWADPLGL DRVPLFDPNN RLSFNAIWAY TGNLPTPSDV TVVRPATSWG RNSGVLKWQD HYYVFASAGK KSHSEDAIID FIKQHDIKPS EIQGLYSILS PCAEEKKSCL EKTKGEGLEF DWSLFHQTDN VGSEVRKEIG ALIKNMKGV
|
| |