Gene BURPS1710b_A0059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0059 
Symbol 
ID3693635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp85752 
End bp87509 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content71% 
IMG OID637730312 
Productthiaminase I precursor 
Protein accessionYP_335217 
Protein GI76818592 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.234303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGCGC ATCGACAAAC ACCAAGACAA GCGCATCGTT TCGCACGCGT CATTCGACAA 
GGAACCATCA TGCGTCGCCT CTTGTGCTGC TTCACGATCG TTCTGGGCTT CCTGTTCGCC
GCGCCGTCGC ACGCGGGCGA CGCGGCCGCC GTTGGGCAAC TGACGGTGGC GCTGTATCCG
TGGGTGCCGC GCGTCGACCA GTTCAAGCGC GCGATCGAAA CCGAATGGAA GAAGGCGCAG
CCGGGCGTCG CGCTGCGGTT CATATCCGCG GACGCGTGGG ACGGCGGCTA CCGGAACGAT
CCGCCCGCGA GCGCCGACGT CTACGTGTAC GACGCGATCT TCCTCGACTA TTTCCGCAGC
CAGAACTGGC TCGAGCCGCT CGCGGCGGAC GAGATCCAGC ACATCGACGA TTTCCTGCCG
TACGCGATCC AGGGCGTGAA GGCGGGCGAC CGGTACTACA GCATCCCGCA GCTCGGCTGC
GCGAACGTGC TGTTCTACCG GAAGGACGAC GCGGCGCTCG CGGCCGCGAC GACGCTCACG
CAGGTGCGCG GCGCGCTCGA GCAATGCACG TTCACGAGCG AGATCCCGCC GGACAGGCGC
GGGCTGATGG TCGACATGTC CGGGCGCACG ACGAACGCCG CGCTCTATCT GGACGCCGCG
CACAGCCGCA CGGGCGCATA CCCGCTGCCG CTGCCGTGGA ACGCGAACGA CCTGAACGGC
GAAGCGCTCG GCAGCCTGCG CGCGCTGATG GCGATGTCGA GCTGGCCGAA CGCGACAGCC
GAGCTGCCGG GCCAGTACGA TCGCTCGGTA TGGTTCAGCG ACGGCGAAGG GCGCGCGGTG
ATCGGCTATT CGGAATCGAT GTCGGCGATG AGCGAGGCGG CGCGGCGCGA TCTCGACTTC
AAGTTCCTGC CGCTGTCGGA CACGCCGCAG CCGCCGCTCT TCTACGCGGA CGTGATCGGC
GTGAACACGA CGACCCACGC GCGCGGCACG CGCGCGCTCG CGGTGCAACT CGCGAACGTG
ATCGCCGCAT CGTCGACGAT GGTGCAAAGC GTCGGGCCGG ACGGCAGCGG CGTGCCGCAA
TATCTGTTCT CCGCGCGGCG CAGCGTGCTG CACACGCTCG CGCAGCGCTA TCCGCTCTAT
CGGAAGATGG TCGCGCTGCT GGATGCGCGC GAGCCGGTGA TGTTCAAGAT CGATGCGCAG
TCGCGCAACT GGCTCGCCTC GATGAGCGGG CCGATCGCGC AGCGCGCGCG CGCCCGATTA
CCCGTGCGGC TGCGATATCG ACACCGCGCT GCCGATCGCC GACTATCGCG GCGCGCAGGC
CGTGTGCCCG ACCGTCTGCG CGGCGCAGGG CGGCTGGAAC GGCCAGTGGA CCAATCAGTC
TCCCGCGGCG CCCGCCGGGC AGTCGGCGTG CGGCTGCAAC GCGTGCCCGA CGTCAGCCGC
GGCGAAACTG CCGCGCGCGC TCGCCACCCG CGCCGCGCCC GGCGATCGCG CGAAGCCGTG
ACACGGGCGG CGCCCGGCCT CGCCGCGGGC GGGCCGGGAG CCCGGCCCGC GAGGCCCGCG
CCGCGCCCGA ATCGCCTCGC GCGCGGTCGA ACGCGGTTGC GCAACGGCCG CGCCGCAACC
GGCCGGCCTC TCGAACGCGC GGCATGCGGC CCGTGCGATC CGCGTATGAG GCCGAGCCGT
CCGATAGCGG GCGGATACGC GCCGCCCCGA CCGTCGACGG CCGGCTCGGG CCCGCCCGTT
CGCGCGAGCG GGCGCTAG
 
Protein sequence
MRAHRQTPRQ AHRFARVIRQ GTIMRRLLCC FTIVLGFLFA APSHAGDAAA VGQLTVALYP 
WVPRVDQFKR AIETEWKKAQ PGVALRFISA DAWDGGYRND PPASADVYVY DAIFLDYFRS
QNWLEPLAAD EIQHIDDFLP YAIQGVKAGD RYYSIPQLGC ANVLFYRKDD AALAAATTLT
QVRGALEQCT FTSEIPPDRR GLMVDMSGRT TNAALYLDAA HSRTGAYPLP LPWNANDLNG
EALGSLRALM AMSSWPNATA ELPGQYDRSV WFSDGEGRAV IGYSESMSAM SEAARRDLDF
KFLPLSDTPQ PPLFYADVIG VNTTTHARGT RALAVQLANV IAASSTMVQS VGPDGSGVPQ
YLFSARRSVL HTLAQRYPLY RKMVALLDAR EPVMFKIDAQ SRNWLASMSG PIAQRARARL
PVRLRYRHRA ADRRLSRRAG RVPDRLRGAG RLERPVDQSV SRGARRAVGV RLQRVPDVSR
GETAARARHP RRARRSREAV TRAAPGLAAG GPGARPARPA PRPNRLARGR TRLRNGRAAT
GRPLERAACG PCDPRMRPSR PIAGGYAPPR PSTAGSGPPV RASGR