Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1472 |
Symbol | |
ID | 4904843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 1428857 |
End bp | 1430386 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640144578 |
Product | putative thiaminase I |
Protein accession | YP_001075506 |
Protein GI | 126457723 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.620486 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACTGA CGCCCGCGCC GCGCGCCGGC ATGCGCGCGC ATCGACAAAC ACCAAGACAA GCGCATCGTT TCGCACGCGT CATTCGACAA GGAACCATCA TGCGTCGCCT CTTGTGCTGC TTCACGATCG TTCTGGGCTT CCTGTTCGCC GCGCCGTCGC ACGCGGGCGA CGCGGCCGCC GTTGGGCAAC TGACGGTGGC GCTGTATCCG TGGGTGCCGC GCGTCGACCA GTTCAAGCGC GCGATCGAAA CCGAATGGAA GAAGGCGCAG CCGGGCGTCG CGCTGCGGTT CGTGTCCGCG CACGCGTGGG ACGGCGGCTA CCGGAACGAT CCGCCCGCGA GCGCCGACGT CTACGTGTAC GACGCGATCT TCCTCGACTA TTTCCGCAGC CAGAACTGGC TCGAGCCGCT CGCGGCGGAC GAGATCCAGC ACATCGACGA TTTCCTGCCG TACGCGATCC AGGGCGTGAA GGCGGGCGAC CGGTACTACA GCATCCCGCA GCTCGGCTGC GCGAACGTGC TGTTCTACCG GAAGGACGAC GCGGCGCTCG CGGCCGCGAC GACGCTCACG CAGGTGCGCG GCGCGCTCGA GCAATGCACG TTCACGAGCG AGATCCCGCC GGACAGGCGC GGGCTGATGG TCGACATGTC CGGGCGCACG ACGAACGCCG CGCTCTATCT GGACGCCGCG CACAGCCGCA CGGGCGCATA CCCGCTGCCG CTGCCGTGGA ACGCGAACGA CCTGAACGGC GAAGCGCTCG GCAGCCTGCG CGCGCTGATG GCGATGTCGA GCTGGCCGAA CGCGACAGCC GAGCTGCCGG GCCAGTACGA TCGCTCGGTA TGGTTCAGCG ACGGCGAAGG GCGCGCGGTG ATCGGCTATT CGGAATCGAT GTCGGCGATG AGCGAGGCGG CGCGGCGCGA TCTCGACTTC AAGTTCCTGC CGCTGTCGGA CACGCCGCAG CCGCCGCTCT TCTACGCGGA CGTGATCGGC GTGAACACGA CGACCCACGC GCGCGGCACG CGCGCGCTCG CGGTGCAACT CGCGAACGTG ATCGCCGCAT CGTCGACGAT GGTGCAAAGC GTCGGGCCGG ACGGCAGCGG CGTGCCGCAA TATCTGTTCT CCGCGCGGCG CAGCGTGCTG CACACGCTCG CGCAGCGCTA TCCGCTCTAT CGGAAGATGG TCGCGCTGCT GGATGCGCGC GAGCCGGTGA TGTTCAAGAT CGATGCGCAG TCGCGCAACT GGCTCGCCTC GATGAGCGGG CCGATCGCGC AGCGCGCGCG CGCCGATTAC CCGTGCGGCT GCGATATCGA CACCGCGCTG CCGATCGCCG ACTATCGCGG CGCGCAGGCC GTGTGCCCGA CCGTCTGCGC GGCGCAGGGC GGCTGGAACG GCCAGTGGAC CAATCAGTCT CCCGCGGCGC CCGCCGGGCA GTCGGCGTGC GGCTGCAACG CGTGCCCGAC GTCAGCCGCG GCGAAGCTGC CGCGCGCGCT CGCCACCCGC GCCGCGCCCG GCGATCGCGC GAAGCCGTGA
|
Protein sequence | MELTPAPRAG MRAHRQTPRQ AHRFARVIRQ GTIMRRLLCC FTIVLGFLFA APSHAGDAAA VGQLTVALYP WVPRVDQFKR AIETEWKKAQ PGVALRFVSA HAWDGGYRND PPASADVYVY DAIFLDYFRS QNWLEPLAAD EIQHIDDFLP YAIQGVKAGD RYYSIPQLGC ANVLFYRKDD AALAAATTLT QVRGALEQCT FTSEIPPDRR GLMVDMSGRT TNAALYLDAA HSRTGAYPLP LPWNANDLNG EALGSLRALM AMSSWPNATA ELPGQYDRSV WFSDGEGRAV IGYSESMSAM SEAARRDLDF KFLPLSDTPQ PPLFYADVIG VNTTTHARGT RALAVQLANV IAASSTMVQS VGPDGSGVPQ YLFSARRSVL HTLAQRYPLY RKMVALLDAR EPVMFKIDAQ SRNWLASMSG PIAQRARADY PCGCDIDTAL PIADYRGAQA VCPTVCAAQG GWNGQWTNQS PAAPAGQSAC GCNACPTSAA AKLPRALATR AAPGDRAKP
|
| |