Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0638 |
Symbol | |
ID | 4882323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 610694 |
End bp | 613213 |
Gene Length | 2520 bp |
Protein Length | 839 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640126566 |
Product | glycosy hydrolase family protein |
Protein accession | YP_001057690 |
Protein GI | 126442197 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCCG CGCCGGATCG CGTGGCGCGC GGCGCCGCCC AATGGACGTT GATCGCGACG CCCGCTGGCG CGATCGCGCG ACCGAGCGAG CTCGGCGAAG CCGGCTGGTG CGCGGCAAGC GTGCCCGGCA CCGTCGCGCA GGCGCTCGCC GCCGCGCGCC GCTTCGATCC CGCGCATCCG TACCCGCTCG GCGACAGCGA CTACTGGTAC CGCACGACGC TGCACGGCGC GGGGCCGCGC ATCGTCCGGC TCAACGGCCT CGCGACGATC GCCGAGGTCT GGCTCGACGA CACGCTGCTG CTCTGCTCGG ACAACATGTA CGTCGCGCAC GATCTGCCCG TGACGCTCGG CGGCGCGCAC CGTCTCGCGC TCTGCTTTCG CTCGCTCGAC CGGCACCTCG CCGAGCACCC GCCGCGCGGC CGCGCGCGCT GGCGCACGCG CCTCGTCGAT ACGCCCGCGC TGCGCGGCGT GCGCGCGACG TTCCTCGGCC GAATGCCGGG CTGGTTCCCG GCGATCGAGC CGGTCGGCCC GTGGCGTCCG ATCGACATCG TGAATCCGGC CGGGGCGCCG ACGATCGTGC GCGACACGCT GCGCGCGACG CTCGACGGGC GCGACGGCGT GCTCGACGCG ACGCTCGAGT TCGCCGCGCC GCTTCCGAGA ACGGCGCGCG CGCAGCTCGT CTGCGGCGAG CATGCGGCGC CGCTCGAAGC GACCGGTCCG CGCACCGCGC GCGCAACGCT CAGGATCGCG AACGTCACGC CGTGGTGGCC GCATACGCAC GGCGAACCCG CGTTGTACGA CGTCGGCGTG GCAATCGGCG GCGCAACGAT TGCGCTCGCC AAAACGGGCT TTCGCACGCT CGCCGTCGAG CGCGGCGCGG ACGGCCGCGG CTTCGCGCTG TCGGTCAACG GCACGCCGCT TTTCGCGCGC GGCGCATGCT GGACGAGCGC CGATCCCGTC GGGCTGCACG CCGATGCGCC CGCCTATCGC CGCGCGCTCG CGCTCGCGCG CGACGCCGGC TGCAACATGA TCCGCGTCGG CGGCACGATG ATCTACGAGG CCGACGCGTT CTACGCGCTC TGCGACGAGT TGGGGCTGCT CGTGTGGCAA GATTTCATGC TCGCGAACCT CGACTATCCG TCGAACGATC CGCGCTTTGC CGAATCGCTC AAGCGCGAGG CCGAGCAGTT CCTCGGCCGG CACATGGCGC GGCCGTCGAT CGCGGTGCTG TGCGGCGGCA GCGAGATCGC GCAGCAGGCC GCGATGGTCG GCCTCGCGCC CGACGAGCGC CGCGTGCCCG CCACCGAGCA ATGGCTCGCC GAACTGTGCG CCGCGCATCG CCCCGATGCC GCGTACGTCA GCGATTCGCC GCACGGCGGC GTGCTGCCGT TCGCGCCGCG CGAGGGCGTT ACGCACTACT ACGGCGTCGG CGCGTACCTG CGCCCTCCCG AGGATGCACG CCGCGCCGGC GTGCGCTTCG CGAGCGAGTG CCTCGCGTTC GCGAACGTGC CGTGCGACGC GACGCTCGCC TCGATCGGCT CGCCCGCCGC GCACGAGCCG GCCTGGAAAC GCGCGGTGCC GCGCGATCCC GGCGCGCCGT GGGATTTCGA CGACGTGCGC GATCACTACC TGCGCACGCT GTACGGCGTC GAGCCCGCGC GCTTGCGCAG CATCGATCCC GCCCGCTATC TGACGCTGTC GCGCGCCGTC GTCGCCGATC TCGTCGGGGA GACGCTCGCC GAGTGGCGCC GCGTCGGCTC CTCGTGCGCG GGCGCGCTCG TCTGGCAGTT CCAGGACGTG ATGCCGGGCG CGGGCTGGGG CCTCGTCGAC GCGCACGGCC GGCCGAAATC CGCATGGCAT GCGTTGCGGC GCGTATCGCA GCCGCGGCAG ATCCTGCTGA CCGACGAAGG GCTCAACGGC CTCGACGTGC ACGTGCTCAA CGATGCGCCC GCGCCGCTCG AAGCCCGCAT CGAGCTCGTC GCGCTACGCG ACGGCAAGAC GCCGGTCGCG CGCGCGGCCC GCACGGTCCA CGTCGCCGCC CACGCGGGCC AATGCGTGAA TTCGGCCGAC CTGCTCGGCC GATTCTTCGA TTTCACCTAT GCGTACCGCT TCGGCCCGCG CGAGCACGAC GTCGTGATCG CATCGCTGTA CGCGAGCGAC GGCGCGCTGC TGTCGCAGGC GTTCCACTTT CCCGAACGCA CCGCGCCGAC CGTGTTCGAG CGCGGCGACA TCGGTCTCGA GGCGAGCGCC GCGTATCGAG ACGGCCGCTG GTGCGTGCAG GTGCAGACGC GCACGTTCGC GCGCTACGTG CATGTGTGCG CGCCGGGCCT GCTGCCCGAC ATCGACTGGT TCCATCTCGC GCCGGGTGCC GCCGCGCGGA TCGAGTTCGC CGCCGACCCT CATTCTCCCG CTCCCGACCA CCGCCCGCCC GCAGCCGACG CGGCCCACTG CGCCCCGCCC GCGATCGAGG TGCGCGCACT CAATTCCAAC AAGACCATTC GCCCGAGGAT AGAAAATTGA
|
Protein sequence | MKSAPDRVAR GAAQWTLIAT PAGAIARPSE LGEAGWCAAS VPGTVAQALA AARRFDPAHP YPLGDSDYWY RTTLHGAGPR IVRLNGLATI AEVWLDDTLL LCSDNMYVAH DLPVTLGGAH RLALCFRSLD RHLAEHPPRG RARWRTRLVD TPALRGVRAT FLGRMPGWFP AIEPVGPWRP IDIVNPAGAP TIVRDTLRAT LDGRDGVLDA TLEFAAPLPR TARAQLVCGE HAAPLEATGP RTARATLRIA NVTPWWPHTH GEPALYDVGV AIGGATIALA KTGFRTLAVE RGADGRGFAL SVNGTPLFAR GACWTSADPV GLHADAPAYR RALALARDAG CNMIRVGGTM IYEADAFYAL CDELGLLVWQ DFMLANLDYP SNDPRFAESL KREAEQFLGR HMARPSIAVL CGGSEIAQQA AMVGLAPDER RVPATEQWLA ELCAAHRPDA AYVSDSPHGG VLPFAPREGV THYYGVGAYL RPPEDARRAG VRFASECLAF ANVPCDATLA SIGSPAAHEP AWKRAVPRDP GAPWDFDDVR DHYLRTLYGV EPARLRSIDP ARYLTLSRAV VADLVGETLA EWRRVGSSCA GALVWQFQDV MPGAGWGLVD AHGRPKSAWH ALRRVSQPRQ ILLTDEGLNG LDVHVLNDAP APLEARIELV ALRDGKTPVA RAARTVHVAA HAGQCVNSAD LLGRFFDFTY AYRFGPREHD VVIASLYASD GALLSQAFHF PERTAPTVFE RGDIGLEASA AYRDGRWCVQ VQTRTFARYV HVCAPGLLPD IDWFHLAPGA AARIEFAADP HSPAPDHRPP AADAAHCAPP AIEVRALNSN KTIRPRIEN
|
| |