Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1568 |
Symbol | |
ID | 4903235 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 1523917 |
End bp | 1527264 |
Gene Length | 3348 bp |
Protein Length | 1115 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640134798 |
Product | glycosy hydrolase family protein |
Protein accession | YP_001065839 |
Protein GI | 126454507 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCATGCGT GGCCGCCGCT GTTCGAGCGC GTCGCCGCAT GGGGCTTCGA TCATGTGCTG ATCGGCGGCC TCTGGGCGGC GAGCCGCCGC GGCTATCCGC GCCACGTCGC GGACCCCGAT CGCCCCGCCG AATCGTTCGC GACGTCGCTC GACGCGACGA GCGCGCTCGC CCGCCTGTCG GACAGCGCGC GCGAGCACGG GCTGCGCATC GCCGTCGAAG TCGTCGTCGA CCGCGTCGCG CGCGAGCACC CGCTGCACGA CGCGCACCGC GACTGGTACG TCGTCGACGA ACGCGACGAC GCGCTGATCG ATCCGCGCAC CGCGGCGCTC GCGCACGACG TCGCGCATGC GAACGTCGGC AGCGCGGCCG CGCTCGACGC GCTCGCCGAC TGGTGGCGCG CCCGGCTCGG CGCGCTGGCC GACGCGGGCG CGGCGGCCTT TCTCGTCGAC GCGCCGCAGC GGATGCCGGC GCACTGGTGG GCCGCGCTGC TCCGCGCGCT GCGGCAGGCC CGCGCGGATC TGCCGGTGAT CGCCGGCGTG CCGGGCCGGG AGCGCGAGGC GCTCGCGCAG CTCGCCTGCG CCGGCTTCGA CGCGGCGTTC TCGTCGCTGC GCTGGTGGGA CCTGCGCGCG CCGTGGTTCG TCGAAGAACA CCGGCTGCTG CGGCGCGTCG GCGCGCCGAT CGCGTTCCCG GACGCGTTCG ACGGCCCGCG GCTCGCGCAC GATTGGCGGC AGGCCGCGCC CGAGACGATC GAGCGCGCGC ATCGCCGCGC GCTGTGGACC GCCGCCGCGC TCGGCACCGG CTGGCTCGTG CCGATGGGCT TCGAGCGCGG CGTGGCCGTC GAACTGATGG CGCGCGAGCC GCGAGCCGAC GCGTACCGCG CCGCGCTCGA CAGCGCGCCG TTCGACTTGT CGGGCGCGAT CGCCGAAGCG AACGCGCTGC GCCGCGCGGC GCCCGCGCTG CGCGGCAACG GCGAGATCGC GCAGCTGACG GGTGCCGATG CGCCGGCGAC CGTGCTGCTG CGCGGCGCGC GCACCGCGCT CGAATACGAC GACGAGGCCG CGCTGATCGC GGTCAATCCG GACCTCGCGC ACCCCGCGGC GATCGTGCCG TGCGCGGCGC TCGCCGGCGT GCCGGGCGGC TTCACGCGCT TCGCGCCGTT CGCCGACGGC CGCCGCCCGC GTATGGGCGC GCTCGAACCG TTCGCGCTCG CCGCCGGTGC CTGCACGTTG CTGCGCGCGC AGCGCGCGCG CCCGGTCACG ACGGCCCCCG CCGAGGATCG GCGCGGCAAT CGGCCCGGCA CCCGTGCGTC GGTCACGGCC GCGCTCGCCG GCGAGCGCAT CGCGATCGAG CGCATCGAGC CCGTCGTCGA CGATGGCCGC TTCGCCGTCA AGCGCGTGAT CGGCGAGCGG CTCGCCGTGC GCGCGGCGAT TTTCGCCGAC GGTCACGCAC GCCTCGCGGC GGCCGTCCAA TGGCGCGCCG CGGACGAGAA CGGCTGGCAC GAGGCCCGAT GCGCCGCCGA AGGCAACGAT GCGTGGCGAG CGGACATTCC GCTCGAACGG CTCGGCCGGC ATCTGTTCCG CGTGATCGCG TGGCGCGACG ATTGGGCGAC GCTCGTCGAC GAGATCGGCA AGAAGCACGC GGCGGGTCAG GCGGTGGCGC TCGAGCTGGA AGAAGCGCGG CGACTCGCCG CCGACGTGCT CGCGCGCGCG CCGGAGGCGA ACCCCGCCGC GCTCGCCGTG CTGCGCGAAT TCGCGGCGGC CCTCGACGCC GCGCCGCCCG ACCAGCGGCT CGCGCTGATC GGCGCGCCGC ACGTCGCCGA CGCGTTCGCG GCGCTGCGCG AGCGAGCGTT CGCCACGCGC GACGCGCCCG TCTTCCCGGT CGACGTCGAG CGGCGCGCGG CCCGCTTCGC CAGCTGGTAC GAGATGTTTC CCCGCTCGGC GAGCGACGAT GTCCGCCGCC ACGGCACGTT CGACGACGTC GTCGCGCATC TGCCGCGCAT CCGCGACATG GGCTTCGACG TGCTGTACTT CCCGCCGATC CATCCGATCG GCACGACCGC GCGCAAGGGC CGCAACAACA GCCTGCAGGC CGCGCCCGAC GACGTCGGCA GCCCGTATGC GATCGGCTCG CCGGCGGGCG GCCACACCGC CGTCCATCCG CAGCTCGGCT CGCTCGATGC GTTCCGCCGG CTCGTCGCTG CGGCGCGCGC GCACGATCTC GAGATCGCGC TCGACTTCGC GGTTCAATGC TCGCCGGACC ATCCGTGGCT CACCGAGCAT CCCGGCTGGT TCGCATGGCG GCCGGACGGC TCGCTGCGCT ACGCGGAAAA TCCGCCGAAG CGCTATCAGG ACATCGTGAA TCCCGACTTC TACGCGCGCG ACGCGATGCC CGCGCTGTGG ATCGCGCTGC GCGACGTCGT GCTGTTCTGG ATCGACGCGG GCGTGCGCAT CTTCCGCGTC GACAATCCGC ACACGAAGCC GCTGCCGTTC TGGGCATGGA TGATCGCCGA CGTGCGCGCG CGACACCCGG ACACGGTGTT CCTGTCCGAG GCGTTCACGC GGCCGAGCAT GATGTACCGG CTCGCGAAGC TCGGCTTCTC GCAGTCGTAT ACCTACTTCA CGTGGCGCGA GTCGAAGCGC GAGTTCATCG ATTATCTGAC CGAGCTCGCC GACGGGCCGG CGCGCGAATA CTTCCGGCCG AACTTCTTCG TCAACACGCC GGACATCAAT CCGCGCCACC TGCAGCAGGC GCCGCGCACG CAGTTCGTGA TCCGCGCGGC GCTCGCCGCG ACGCTCTCGG GCCTCTGGGG AATGTATTCG GGCTTCGAGC TGTGCGAGTC CGACGCGCTG CCCGACAGCG AGGAATATCG CGACGCGGAG AAATACGAGC TGCGCGCGCG CGACTGGCGG CGGCCCGGCC ACATCGGCGA CGAAATCGCG CGGCTCAACC GCGCGCGGCG CGACAACCCG GCGCTGCAGA CGCATCTCGG CATCCGCTTC GCGCACGCGC CGAACGACGC GGTGCTGGTG TTCTCGAAGG CGACGCCCGC GCACGACAAC GTCGTCGTCG TCGCGATCAG CCTCGATCCG TGGCATCCGC AGGCCACCGA TTTCACGCTC GACGCGGCGC TGTACCGCGG CTGGGGCATC GCCGACGGCG AGCGGCTCGT CGCCGTCGAT CAGACGGCCG ACCACGTCGA AACCTGGCAC GGGCGCCGGC ATTACGTCGC GCTCGACCCG CACGTGCGCC CGTTCGCGAT CTGGCGCGTC GCGCCCGCGG CGGGCGTCGC GCGCGGCGCT CGCGACGACG CGCGCGACGT CCCCGCACAG GAGGTGCACG AACGATGA
|
Protein sequence | MHAWPPLFER VAAWGFDHVL IGGLWAASRR GYPRHVADPD RPAESFATSL DATSALARLS DSAREHGLRI AVEVVVDRVA REHPLHDAHR DWYVVDERDD ALIDPRTAAL AHDVAHANVG SAAALDALAD WWRARLGALA DAGAAAFLVD APQRMPAHWW AALLRALRQA RADLPVIAGV PGREREALAQ LACAGFDAAF SSLRWWDLRA PWFVEEHRLL RRVGAPIAFP DAFDGPRLAH DWRQAAPETI ERAHRRALWT AAALGTGWLV PMGFERGVAV ELMAREPRAD AYRAALDSAP FDLSGAIAEA NALRRAAPAL RGNGEIAQLT GADAPATVLL RGARTALEYD DEAALIAVNP DLAHPAAIVP CAALAGVPGG FTRFAPFADG RRPRMGALEP FALAAGACTL LRAQRARPVT TAPAEDRRGN RPGTRASVTA ALAGERIAIE RIEPVVDDGR FAVKRVIGER LAVRAAIFAD GHARLAAAVQ WRAADENGWH EARCAAEGND AWRADIPLER LGRHLFRVIA WRDDWATLVD EIGKKHAAGQ AVALELEEAR RLAADVLARA PEANPAALAV LREFAAALDA APPDQRLALI GAPHVADAFA ALRERAFATR DAPVFPVDVE RRAARFASWY EMFPRSASDD VRRHGTFDDV VAHLPRIRDM GFDVLYFPPI HPIGTTARKG RNNSLQAAPD DVGSPYAIGS PAGGHTAVHP QLGSLDAFRR LVAAARAHDL EIALDFAVQC SPDHPWLTEH PGWFAWRPDG SLRYAENPPK RYQDIVNPDF YARDAMPALW IALRDVVLFW IDAGVRIFRV DNPHTKPLPF WAWMIADVRA RHPDTVFLSE AFTRPSMMYR LAKLGFSQSY TYFTWRESKR EFIDYLTELA DGPAREYFRP NFFVNTPDIN PRHLQQAPRT QFVIRAALAA TLSGLWGMYS GFELCESDAL PDSEEYRDAE KYELRARDWR RPGHIGDEIA RLNRARRDNP ALQTHLGIRF AHAPNDAVLV FSKATPAHDN VVVVAISLDP WHPQATDFTL DAALYRGWGI ADGERLVAVD QTADHVETWH GRRHYVALDP HVRPFAIWRV APAAGVARGA RDDARDVPAQ EVHER
|
| |