Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2835 |
Symbol | |
ID | 4901047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 2790070 |
End bp | 2791098 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640136061 |
Product | beta-hexosaminidase |
Protein accession | YP_001067082 |
Protein GI | 126452419 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.261076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTGT CCCCCGGTCC GGTGATGCTC GACGTCGCCG GCACGACGCT CACGCGCGAC GACGCGCGCC GCCTCGCGCA TCCGCACACG GGCGGCGTGA TCCTGTTCGC GCGCCACTTC GAGAGCCGCG CGCAACTCGT CGCGCTGACC GAGGCGATCC GGGCGATCCG CGACGGCATC CTGATCGCGG TCGATCACGA GGGCGGCCGC GTGCAGCGCT TTCGCACCGA CGGCTTCACC GTGCTGCCGG CGATGCGCCG GCTCGGCGAG CTGTGGGACA AGGACGTGCT GCACGCGACG AAGGCGGCGA CCGCGCTCGG CTATGTGCTC GCTTCCGAGC TGCGCGCGTG CGGCATCGAC ATGAGCTTCA CGCCCGTGCT CGATCTCGAT TACGGCCGCT CGAAGGTGAT CGGCGATCGC GCGTTCCATC GCGATCCGCG CGTCGTCGCG TTGCTCGCGA AGAGCGTCAA CCACGGGCTC GCGCTCGCCG GGATGGCGAA CTGCGGCAAG CATTTTCCCG GCCACGGCTT CGCGCAGGCC GATTCGCACG TCGCGCTGCC GACCGACGAT CGTCCGCTCG ACGAGATCCT CGCGAACGAC GCGGCGCCGT ACGACTGGCT CGGGCTGTCG TTGTCGGCCG TTATCCCGGC GCACGTGATC TACACGCAGG TCGATTCGAA GCCGGCCGGC TTCTCGCGCG TGTGGTTGCA GGACGTGCTG CGCGGCCGGC TGCGCTTTGC GGGCGCCGTG TTCAGCGACG ATCTGTCGAT GGAGGCCGCG CGCGAGGGCG GCACGCTCGC GCAGTCGGCG CAGGCCGCGC TCGAGGCGGG CTGCGACATG GTGCTCGTGT GCAACCAGCC GGATGCGGCG GAGCGGGTGC TCGACGAGCT GCGCACGACG GCGTCGCGCG AATCGTCGCG GCGGATCAAG CAGATGCGGC CGCGCGGCAA GGCGCTCGAG TGGCGCAAGC TGATGCGCGA GCCGCGCTAT CTGAATGCGC AGGGCCTGTT GCGCAGCACG TTCGCCTGA
|
Protein sequence | MKLSPGPVML DVAGTTLTRD DARRLAHPHT GGVILFARHF ESRAQLVALT EAIRAIRDGI LIAVDHEGGR VQRFRTDGFT VLPAMRRLGE LWDKDVLHAT KAATALGYVL ASELRACGID MSFTPVLDLD YGRSKVIGDR AFHRDPRVVA LLAKSVNHGL ALAGMANCGK HFPGHGFAQA DSHVALPTDD RPLDEILAND AAPYDWLGLS LSAVIPAHVI YTQVDSKPAG FSRVWLQDVL RGRLRFAGAV FSDDLSMEAA REGGTLAQSA QAALEAGCDM VLVCNQPDAA ERVLDELRTT ASRESSRRIK QMRPRGKALE WRKLMREPRY LNAQGLLRST FA
|
| |