Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1322 |
Symbol | |
ID | 4903505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1245491 |
End bp | 1250116 |
Gene Length | 4626 bp |
Protein Length | 1541 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640144428 |
Product | substrate-binding repeat-containing protein |
Protein accession | YP_001075357 |
Protein GI | 126455779 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG3209] Rhs family protein [COG4104] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGAAG TGCAGAGCAG CGCGCCGTCG GACAGCCAGC AGAAGCTCGC GGCGCTCGCG CAACGAGGCA GCAATGCGGA CGCCGTGCAG ACGGTGTCCA ACATGGGGCT CGCGATCAAC GCGGCGCAGG TCACGGCGGC CGGCACGAGC GCATGGGCGG CCGGCACGTT CCAGTGCTTC GCCGGGCGCG TGATCGCGCC GCTCGGCGGC GCGATGCTCG GCGGCGCGCT CGCCGAGGCG CTCGGCGCGG ATCGCCCGGT CACATGGGTG CTGGACAAGA TGGGGCTGCC GGCCGTCGCG AAGCCCGGCA AGGCGCCCGC TCGCGTCGGG CACAAGATCG TCCATGAGAA CGCCTTCATC GGCGCGCTGA CCGGCCTGCT GGCGGGGATC GCGGTCGGCG TGGCGATCGC CGCCGCGGCC GCCGCGATCG TGGCGACGGG CGGCGCGGCC GCGGTCGCCA TCGCGGCGGC CGGCCCGTTC GTGGTCGGGT TCGTCTCGGG CGCGGTGGGC GGTTTCGTCG GCGCGGCGGT GGCCAAGGGC ATCGGCCATA CCGGCTCGGT GACGGGCGCG ATCGCGCACG GCTCGCCCAA CGTCTCGTTC GAAGGCGCGC CGGTTGCCCG CGTGACCGAT CCGGTGACCT GCAGCAAGGA TCCCGGCATG CCGCCGCCGC AGATCGCACA GGGCAGCCTG ACCGTATCGG TCAACGGCCT GCCGCTCGCG CGGATCGGCC ACAAGATCAC ATGTTCGGCG GTCATTCAGG AAGGCTGCAC GACGATCAGC GCGGACGAGA CGACCGGCAC GTTCGGCAAG ATCGACGCGA ACGTGTCGCT TCTCGAGCAA CTGGTGCTGA CGGCCACCGA CGTCATCATG ATGCGTTCGG CCACCAAGGA AGGCGGCTTG CTGGACGGCG TGCTGCGCGA GCTGCTCGGC GAGCCGATCG ACATGGCGAC GGGCGATTAC GCCGACTACC GGACGGATTT CACGTGGCCG CACGTGCTGC CGCTCACGCT TTCCCGGGCG TACGCGGGGC GGCAGCCGGT CGAGGGGCTG CTCGGCGACA GGTGGATCAG CAACTGGTCG CAGCGGCTGC GCTATCGGTG GCCCGCGGAC GGCCCGGCCA CCGTCACGTT TTTCGACGCG GACGGCCAGC AGCTCGTGTA CCCGGTTCCG CACGAGCCGT TCAACGCGAT CAACTTCTGG GCGCCGCACT ACGCGTTGCA CGGCAGCCGC GCGCGGGCTG TCGTTTTCGA CGAGCGCTCG CAGCAATCGC TGATTTTCGA GCCCGCGCAT GCCGAGGACG ACGTCGCGCG CCTCACCCGC ATCGAGGATC GCAACGGCAA CACGATCGAC TTCGAGTACA ACGCGCTCGG CCGGCTATGC ACGGTGCGGC ACAGCGGCGG GATGACGCTG TGGGTCACCT GCGATTCCCG CGGGTTGTTG CAGCGCGTGT CGGAGCAGCC GGGCGGAGAG GGCGAACTGG TTCGCTATCG TTTCGACGGC AAGCGCCTGA CCGACGTTCA CAGCCGCTTC CAGGGCGAAT TTCATTTCGG CTATACCGAC GAGGGCTGGC TGAATCACTG GCGCGACAGC GGCGCGACCC AGGTGGCGCT GCGTTATGAC GAGCGCGGAA GGGTGATCGC CACGCGCACG AATACCGCGC TGTACGACGA TCGCTTCGAG TATGACGACG AGGCGCGGCT GACGACGTAC ATCGACGCGC TCGGGCATCG GCATCAACGC TGGTTCGATG CGCAGAACCG GTTGATTCGG TCGCGGGATC CGCTGGGGCG GGTAATGTGC GCGAGCTACG ACGAAAACGG ATGGCTTGCC TCCCGCACGG ATCCGCTCGG CCGTGTCAGC ACGTATCGCC ACGATTGCCG CGGCCGCCCG CTTCAGGTCA CCGACGCTTT CGGGCGGGTG AGCCGCTATG GGTGGAACGG CGCCGGCCAA CTCGTCGAAC AGCAGGATCA CAACGGCAAG GTGCAGTGGC ACTACAGCGC CGAGGGCAAT CTGGTCGCGT TGCGGGCGCG ATCCGGCGAG ACGCGCTTTC GCTATGACGC GCGCGGGCTG CTCGTCGGTC GAACCGATCC GGACGGTGCC GTGCATGCGT GGCGGTATGA CGGCGCCGGG CGCCCCGAGC GCTGGACCGA TCCGCTGGGG CGGCATACCT ATCTCGAGCA TGACCGTTAC GGCAGATTGA TCGCGCGCAT CGATGCCGCG GGCCATCGCA CGACGTACGG CTATGAGCGA GGGCCGTCCA ACCCGCGCGA GCTGCTCGCG AGCATCACCT ATCCGGACGG CGCGATTGCG CGCTTCCAGT ACGATTCGGA AGGCTTGCTG ACGGAAGCGG TCAATCCGTT GGGCCAGCGC ACTCGCTATG CGTGGGGCGC GTTCGATCTG CTCGCGAGCG TGACGGACCC CGGACAGGCG ACGACGCGCT ATCACCGCGA CGGCGCGGCT CGCCTGATCG GCGTGACCAA TGCGGCGGGG CAGCACTGGA AGTTCGAGCG GGACCCGGCG GGCCAACTGA TCGCGCAAAC CGACTGGAGT GGCCGCCGCA CGCGGTATGT CCTCAATCCG ATGGGACAGG TAACGGAAAA GCATCTGCCC GACGGCGTGG CGATACGTTA CGAATACGAC AGCCAGGACC GGCTCGTTGC ATTGACCGGA CCGCGTCGGC GTCACGTGTT TGCCTGGACG GCGAGCGGGT TGCTGACTCG GGCGGAGGTC TGGACGCGAA ACGACGAAGC GGGCGAATGG CGGCGCGACG ACCGGCTGGA CCTGGAGTAC GACGACGCCT GTCGGCTTGT CGAGGAAAGT CAGCATGGCC GGGCCGTAGG CTACGAATAC GATCCGATGG GGCGCACGCG GTCGATGGCT ACGCCGAGCG GGCGCACGCT TTGGCAGTAC GACGCGGCAG GCCAACTCGG CTCGATGGAA AGTAACGGCC ACCGGTTCCA TTTCGATTAC GACGGGCTCG GCCTGGAAAC GCTTCGGCGC TACACGCCGA CGCAGGCGCA CGTGGCGCGG CACCCGCAGT GGGTCGAGCC GTATTCCGAA GGCTATGCGC AACAGCAAGA TCACGACGGA CGGTGGCGAT TGACGAAGCA AGCCTGCGCA ACCTGGGCGG AGCTTCGCGA GCGCGGCGCG GCGCGTACGC GCCATTACGA GTGGGACGCC GCGGGGCGCT GCGTCGGGAT GCACGAGGCG CGCCGAGGGC TGCCGATCGC GCAGGACCGC TGGCGCTACG ATGCCCGCGG CCAAGTCGTG GATGCGCATT ACGAGCGCAC TGAAACGCGA AGCGGGCGCG AGCGCTACGA GTACGATGCG CTCGGGAACG TCTCGACACG GCAGATCGAC GCGGGCGAGG CGATGACGCA CGTCTATCAC GGCGACCAAT TGGTCAGCGC CGGGCCGAGT CGTTACGAAT ACGACGCGCG CGGGAGGATG ATCGCGCGAA CGGAGGGGCG CGACGGCTTC CGGCCGCGGA CGTGGCGTTA TCAGTGGGAC GACTTCGATC AGCTCGAGCA AGTCCTGACC CCGGAGGGCG AGCGCTGGCG GTATCGGTAC GATGCGTTCG GGCGGCGCAT CAGCAAGGCG TGCCTGTCGA CGCCGAAAGC CGGCCGGCCG GCGCGGATCG ACTATTTGTG GTGCGGCAGC CGGCTGATAG AGGCATGGCG CGCGTACGGC GAACGAGACG GCACGCAATA CGAAATCCAG CGCTGGCACT ATCGACCGGG CACCCATTCG GTTTTGGCGC AGGAGCGACT GAAATACGAC GACAAGCCGG ATCTGCAGAA CAGCGAGTGG TTTGCGCTCG CTTGCGATCC CAATGGCGCT CCGCATACGC TGTACAGTTC GGACGGCAGG ATCATGTGGA GTGCGCGGCG CGAACTCTGG GGCCGCGTGG CGGACGATCC GGACCGCGAT ACCGTCCGGC ATGCGGTGCG CGAGCAGTTG CGAACCAGCC TGCTGACAGG CGATGCATTC GATCCTCCGG ACTGCGAACT GCGATTTCCG GGGCAATGGG CGGACGAGGA AAGCGGCTTA CATTACAACT TCCACCGGTA TTACGATCCT GCGACCGGGC AGTATCTGAG TCCCGATCCG TTGGGTCTCG CCGGCGGGCT GAGAACGCAT GCGTATGTTC ACGATCCGCT GCAGTGGGTG GACCCGCTGG GGCTGCAGGG ATACCAAGGT TCCGGGAAGC CGGAATTCAT CGGTAACCGG AAGCTGCCTC AGAACGACTT GCCCTGGATC AAGTATCAAA AGCGCGTCAC GGGGCGCCCG TACGAGGAAA CGTGGCGCGT AGGCGATCAC AACGTCAATC TCGACGGGAA GCGAGCCGGC TATACGGTCG AGGCGAAATG GACCGGCAAG AACAGTGCCG CGTGGGAGTC TTCCCCCTAT AATCCCGAGC ACGAGTTCTA CAACGAGTCC AAGATTCTCG ATCAGGCGGG CCGGTTGCTC GAGTTCAATG ACGCTTCCGG CGGAAGCGGG GTGCGTTATG CGGTGTCGAA TGCGGAAGCC CAACAGCATT TCACGCAGCT ATTCGAGCAG CATTTCCCTA CCGAAATGCA GGACGGTACG TTGAGTGTCT GGCACGTCCC AGGAAACGGA ATGTGA
|
Protein sequence | MGEVQSSAPS DSQQKLAALA QRGSNADAVQ TVSNMGLAIN AAQVTAAGTS AWAAGTFQCF AGRVIAPLGG AMLGGALAEA LGADRPVTWV LDKMGLPAVA KPGKAPARVG HKIVHENAFI GALTGLLAGI AVGVAIAAAA AAIVATGGAA AVAIAAAGPF VVGFVSGAVG GFVGAAVAKG IGHTGSVTGA IAHGSPNVSF EGAPVARVTD PVTCSKDPGM PPPQIAQGSL TVSVNGLPLA RIGHKITCSA VIQEGCTTIS ADETTGTFGK IDANVSLLEQ LVLTATDVIM MRSATKEGGL LDGVLRELLG EPIDMATGDY ADYRTDFTWP HVLPLTLSRA YAGRQPVEGL LGDRWISNWS QRLRYRWPAD GPATVTFFDA DGQQLVYPVP HEPFNAINFW APHYALHGSR ARAVVFDERS QQSLIFEPAH AEDDVARLTR IEDRNGNTID FEYNALGRLC TVRHSGGMTL WVTCDSRGLL QRVSEQPGGE GELVRYRFDG KRLTDVHSRF QGEFHFGYTD EGWLNHWRDS GATQVALRYD ERGRVIATRT NTALYDDRFE YDDEARLTTY IDALGHRHQR WFDAQNRLIR SRDPLGRVMC ASYDENGWLA SRTDPLGRVS TYRHDCRGRP LQVTDAFGRV SRYGWNGAGQ LVEQQDHNGK VQWHYSAEGN LVALRARSGE TRFRYDARGL LVGRTDPDGA VHAWRYDGAG RPERWTDPLG RHTYLEHDRY GRLIARIDAA GHRTTYGYER GPSNPRELLA SITYPDGAIA RFQYDSEGLL TEAVNPLGQR TRYAWGAFDL LASVTDPGQA TTRYHRDGAA RLIGVTNAAG QHWKFERDPA GQLIAQTDWS GRRTRYVLNP MGQVTEKHLP DGVAIRYEYD SQDRLVALTG PRRRHVFAWT ASGLLTRAEV WTRNDEAGEW RRDDRLDLEY DDACRLVEES QHGRAVGYEY DPMGRTRSMA TPSGRTLWQY DAAGQLGSME SNGHRFHFDY DGLGLETLRR YTPTQAHVAR HPQWVEPYSE GYAQQQDHDG RWRLTKQACA TWAELRERGA ARTRHYEWDA AGRCVGMHEA RRGLPIAQDR WRYDARGQVV DAHYERTETR SGRERYEYDA LGNVSTRQID AGEAMTHVYH GDQLVSAGPS RYEYDARGRM IARTEGRDGF RPRTWRYQWD DFDQLEQVLT PEGERWRYRY DAFGRRISKA CLSTPKAGRP ARIDYLWCGS RLIEAWRAYG ERDGTQYEIQ RWHYRPGTHS VLAQERLKYD DKPDLQNSEW FALACDPNGA PHTLYSSDGR IMWSARRELW GRVADDPDRD TVRHAVREQL RTSLLTGDAF DPPDCELRFP GQWADEESGL HYNFHRYYDP ATGQYLSPDP LGLAGGLRTH AYVHDPLQWV DPLGLQGYQG SGKPEFIGNR KLPQNDLPWI KYQKRVTGRP YEETWRVGDH NVNLDGKRAG YTVEAKWTGK NSAAWESSPY NPEHEFYNES KILDQAGRLL EFNDASGGSG VRYAVSNAEA QQHFTQLFEQ HFPTEMQDGT LSVWHVPGNG M
|
| |