Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_0652 |
Symbol | |
ID | 4902159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 625693 |
End bp | 628212 |
Gene Length | 2520 bp |
Protein Length | 839 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640133882 |
Product | glycosy hydrolase family protein |
Protein accession | YP_001064934 |
Protein GI | 126452640 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCCG CGCCGGATCG CGTGGCGCGC GGCGCCGCCC AATGGACGTT GATCGCGACG CCCGCTGGCG CGATCGCGCG ACCGAGCGAG CTCGGCGAAG CCGGCTGGTG CGCGGCAAGC GTGCCCGGCA CCGTCGCGCA GGCGCTCGCC GCCGCGCGCC GCTTCGATCC CGCGCATCCG TACCCGCTCG GCGACAGCGA CTACTGGTAC CGCACGACGC TGCACGGCGC GGGGCCGCGC ATCGTCCGGC TCAACGGCCT CGCGACGATC GCCGAGGTCT GGCTCGACGA CACGCTGCTG CTCTGCTCGG ACAACATGTA CGTCGCGCAC GATCTGCCCG TGACGCTCGG CGGCGCGCAC CGTCTCGCGC TCTGCTTTCG CTCGCTCGAC CGGCACCTCG CCGAGCACCC GCCGCGCGGC CGCGCGCGCT GGCGCACGCG CCTCGTCGAT ACGCCCGCGC TGCGCGGCGT GCGCGCGACG TTCCTCGGCC GGATGCCGGG CTGGTTCCCG GCGATCGAGC CGGTCGGCCC GTGGCGTCCG ATCGACATCG TGAATCCGGC CGGGGCGCCG ACGATCGTGC ACGACACGCT GCGCGCGACG CTCGACGGGC GCGACGGCGT GCTCGACGCG ACGCTCGAGT TCGCCGCGCC GCTTCCGAGA ACGGCGCGCG CGCAGCTCGT CTGCGGCGAG CATGCGGCGC CGCTCGAAGC GACCGGTCCG CGCACCGCGC GCGCGACGCT CAGGATTGCG AACGTCACGC CGTGGTGGCC GCATACGCAC GGCGAACCCG CGTTGTACGA CGTCGGCGTG GCAATCGGCG GCGCGACGAT TGCGCTCGCC AAAACGGGCT TTCGCACGCT CGCCGTCGAG CGCGGCGCGG ACGGCCGCGG CTTCGCGCTA TCGGTCAACG GCACGCCGCT TTTCGCGCGC GGCGCATGCT GGACGAGCGC CGACCCCGTC GGCCTGCACG CCGATGCGCC CGCCTATCGC CGCGCGCTCG TGCTCGCGCG CGACGCCGGC TGCAACATGA TCCGCGTCGG CGGCACGATG ATCTACGAGG CCGACGCGTT CTACGCGCTC TGCGACGAGT TGGGGCTGCT CGTGTGGCAA GACTTCATGC TCGCGAACTT CGACTATCCG TCGAACGATC CGCGCTTTGC CGAATCGCTC AAGCGCGAGG CCGAGCAGTT CCTCGGCCGG CACATGGCGC GGCCGTCGAT CGCGGTGCTG TGCGGCGGCA GCGAGATCGC GCAGCAGGCC GCGATGGTCG GCCTCGCGCC CGACGAGCGC CGCGTGCCCG CCACCGAGCA ATGGCTCGCC GAACTGTGCG CCGCGCATCG CCCCGATGCC GCGTACGTCA GCGATTCGCC GCACGGCGGC GTGCTGCCGT TCGCGCCGCG CGAGGGCGTT ACGCACTACT ACGGCGTCGG CGCGTACCTG CGCCCTCCCG AGGATGCACG CCGCGCCGGC GTGCGCTTCG CGAGCGAGTG CCTCGCGTTC GCGAACGTGC CGTGCGACGC GACGCTCGCC TCGATCGGCT CGCCCGCCGC GCACGAGCCG GCCTGGAAAC GCGCGGTGCC GCGCGATCCC GGCGCGCCGT GGGATTTCGA CGACGTGCGC GATCACTACC TGCGCGCGCT GTACGGCGTC GAGCCCGCGC GCTTGCGCAG CATCGATCCC GCCCGCTATC TGACGCTGTC GCGCGCCGTC GTCGCCGATC TCGTCGGGGA GACGCTCGCC GAGTGGCGGC GCGTCGGCTC CTCGTGCGCG GGCGCGCTCG TCTGGCAGTT CCAGGACGTG ATGCCGGGCG CGGGCTGGGG CCTCGTCGAC GCGCACGGCC GGCCGAAATC CGCATGGCAT GCGTTGCGGC GCGTATCGCA GCCGCGGCAG ATCCTGCTGA CCGACGAAGG GCTCAACGGC CTCGACGTGC ACGTGCTCAA CGATGCGCCC GCGCCGCTCG AAGCCCGCAT CGAGCTCGTC GCGCTGCGCG ACGGCAAGAC GCCGATCGCG CGCGCGGCCC GCACGGTGCA CGTCGCCGCC CACGCGGGCC AATGCGTGAA TTCGGCCGAC CTGCTCGGCC GATTCTTCGA TTTCACCTAT GCGTACCGCT TCGGCCCGCG CGAGCACGAC GTCGTGATCG CATCGCTGTA CGCGAGCGAC GGCGCGCTGC TGTCGCAGGC GTTCCACTTT CCCGAACGCA CCGCGCCGAC CGTGTTCGAG CGCGGCGACA TCGGTCTCGA GGCGAGCGCC GCGTATCGAG ACGGCCGCTG GTGCGTGCAA GTGCAGACGC GCACGTTCGC GCGCTACGTG CATGTGTGCG CGCCGGGCCT GCTGCCCGAC ATCGACTGGT TCCATCTCGC GCCGGGTGCC GCCGCGCGGA TCGAGTTCGC CGCCGACCCT CATTCTCCCG CTCCCGACCA CCGCCCGCCC GAAGCCGACG CGGCCCACTG CGCCCCGCCC GCGATCGAGG TGCACGCACT CAATTCCAAC AAGACCATTC GCCCGAGGAT AGAAAATTGA
|
Protein sequence | MKSAPDRVAR GAAQWTLIAT PAGAIARPSE LGEAGWCAAS VPGTVAQALA AARRFDPAHP YPLGDSDYWY RTTLHGAGPR IVRLNGLATI AEVWLDDTLL LCSDNMYVAH DLPVTLGGAH RLALCFRSLD RHLAEHPPRG RARWRTRLVD TPALRGVRAT FLGRMPGWFP AIEPVGPWRP IDIVNPAGAP TIVHDTLRAT LDGRDGVLDA TLEFAAPLPR TARAQLVCGE HAAPLEATGP RTARATLRIA NVTPWWPHTH GEPALYDVGV AIGGATIALA KTGFRTLAVE RGADGRGFAL SVNGTPLFAR GACWTSADPV GLHADAPAYR RALVLARDAG CNMIRVGGTM IYEADAFYAL CDELGLLVWQ DFMLANFDYP SNDPRFAESL KREAEQFLGR HMARPSIAVL CGGSEIAQQA AMVGLAPDER RVPATEQWLA ELCAAHRPDA AYVSDSPHGG VLPFAPREGV THYYGVGAYL RPPEDARRAG VRFASECLAF ANVPCDATLA SIGSPAAHEP AWKRAVPRDP GAPWDFDDVR DHYLRALYGV EPARLRSIDP ARYLTLSRAV VADLVGETLA EWRRVGSSCA GALVWQFQDV MPGAGWGLVD AHGRPKSAWH ALRRVSQPRQ ILLTDEGLNG LDVHVLNDAP APLEARIELV ALRDGKTPIA RAARTVHVAA HAGQCVNSAD LLGRFFDFTY AYRFGPREHD VVIASLYASD GALLSQAFHF PERTAPTVFE RGDIGLEASA AYRDGRWCVQ VQTRTFARYV HVCAPGLLPD IDWFHLAPGA AARIEFAADP HSPAPDHRPP EADAAHCAPP AIEVHALNSN KTIRPRIEN
|
| |