Gene BURPS1106A_A1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1472 
Symbol 
ID4904843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1428857 
End bp1430386 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content70% 
IMG OID640144578 
Productputative thiaminase I 
Protein accessionYP_001075506 
Protein GI126457723 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.620486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTGA CGCCCGCGCC GCGCGCCGGC ATGCGCGCGC ATCGACAAAC ACCAAGACAA 
GCGCATCGTT TCGCACGCGT CATTCGACAA GGAACCATCA TGCGTCGCCT CTTGTGCTGC
TTCACGATCG TTCTGGGCTT CCTGTTCGCC GCGCCGTCGC ACGCGGGCGA CGCGGCCGCC
GTTGGGCAAC TGACGGTGGC GCTGTATCCG TGGGTGCCGC GCGTCGACCA GTTCAAGCGC
GCGATCGAAA CCGAATGGAA GAAGGCGCAG CCGGGCGTCG CGCTGCGGTT CGTGTCCGCG
CACGCGTGGG ACGGCGGCTA CCGGAACGAT CCGCCCGCGA GCGCCGACGT CTACGTGTAC
GACGCGATCT TCCTCGACTA TTTCCGCAGC CAGAACTGGC TCGAGCCGCT CGCGGCGGAC
GAGATCCAGC ACATCGACGA TTTCCTGCCG TACGCGATCC AGGGCGTGAA GGCGGGCGAC
CGGTACTACA GCATCCCGCA GCTCGGCTGC GCGAACGTGC TGTTCTACCG GAAGGACGAC
GCGGCGCTCG CGGCCGCGAC GACGCTCACG CAGGTGCGCG GCGCGCTCGA GCAATGCACG
TTCACGAGCG AGATCCCGCC GGACAGGCGC GGGCTGATGG TCGACATGTC CGGGCGCACG
ACGAACGCCG CGCTCTATCT GGACGCCGCG CACAGCCGCA CGGGCGCATA CCCGCTGCCG
CTGCCGTGGA ACGCGAACGA CCTGAACGGC GAAGCGCTCG GCAGCCTGCG CGCGCTGATG
GCGATGTCGA GCTGGCCGAA CGCGACAGCC GAGCTGCCGG GCCAGTACGA TCGCTCGGTA
TGGTTCAGCG ACGGCGAAGG GCGCGCGGTG ATCGGCTATT CGGAATCGAT GTCGGCGATG
AGCGAGGCGG CGCGGCGCGA TCTCGACTTC AAGTTCCTGC CGCTGTCGGA CACGCCGCAG
CCGCCGCTCT TCTACGCGGA CGTGATCGGC GTGAACACGA CGACCCACGC GCGCGGCACG
CGCGCGCTCG CGGTGCAACT CGCGAACGTG ATCGCCGCAT CGTCGACGAT GGTGCAAAGC
GTCGGGCCGG ACGGCAGCGG CGTGCCGCAA TATCTGTTCT CCGCGCGGCG CAGCGTGCTG
CACACGCTCG CGCAGCGCTA TCCGCTCTAT CGGAAGATGG TCGCGCTGCT GGATGCGCGC
GAGCCGGTGA TGTTCAAGAT CGATGCGCAG TCGCGCAACT GGCTCGCCTC GATGAGCGGG
CCGATCGCGC AGCGCGCGCG CGCCGATTAC CCGTGCGGCT GCGATATCGA CACCGCGCTG
CCGATCGCCG ACTATCGCGG CGCGCAGGCC GTGTGCCCGA CCGTCTGCGC GGCGCAGGGC
GGCTGGAACG GCCAGTGGAC CAATCAGTCT CCCGCGGCGC CCGCCGGGCA GTCGGCGTGC
GGCTGCAACG CGTGCCCGAC GTCAGCCGCG GCGAAGCTGC CGCGCGCGCT CGCCACCCGC
GCCGCGCCCG GCGATCGCGC GAAGCCGTGA
 
Protein sequence
MELTPAPRAG MRAHRQTPRQ AHRFARVIRQ GTIMRRLLCC FTIVLGFLFA APSHAGDAAA 
VGQLTVALYP WVPRVDQFKR AIETEWKKAQ PGVALRFVSA HAWDGGYRND PPASADVYVY
DAIFLDYFRS QNWLEPLAAD EIQHIDDFLP YAIQGVKAGD RYYSIPQLGC ANVLFYRKDD
AALAAATTLT QVRGALEQCT FTSEIPPDRR GLMVDMSGRT TNAALYLDAA HSRTGAYPLP
LPWNANDLNG EALGSLRALM AMSSWPNATA ELPGQYDRSV WFSDGEGRAV IGYSESMSAM
SEAARRDLDF KFLPLSDTPQ PPLFYADVIG VNTTTHARGT RALAVQLANV IAASSTMVQS
VGPDGSGVPQ YLFSARRSVL HTLAQRYPLY RKMVALLDAR EPVMFKIDAQ SRNWLASMSG
PIAQRARADY PCGCDIDTAL PIADYRGAQA VCPTVCAAQG GWNGQWTNQS PAAPAGQSAC
GCNACPTSAA AKLPRALATR AAPGDRAKP