Gene BURPS1106A_0589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0589 
Symbol 
ID4902199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp556984 
End bp558804 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content69% 
IMG OID640133819 
ProductTPR repeat-containing protein 
Protein accessionYP_001064871 
Protein GI126452823 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0397506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTGC CCTTGAAGCT GTCCCAGAAG CGCCTTGCCG CTGCGCGCGG CCCGCGCGCC 
GTTCCGGTGC GCCGCGCGAT CGGCGCCGCG CTCGTCGCGG CGTGGGCGCT CGCCGCGCTC
CCCGCTCACG CGCAGGACGA CGCAGGCGAC GACGCCCCCC AGGCCGCGTT CGCGTCGGCG
CTGCCGGAAG AGCAGAAGGA TCTGCCGAAC GTCGCGCTGA CGAGCCAGAT CGTCTACCAG
GTGCTCGCGG CCGAGGTCGC GCTGCAACGC AGTCTTCCCG CGCCAGCCTA TCAGACCTAC
CTCGCGCTCG CGCGCGACAC GCGCGATCCG CGGATGGCGC AGCGGGCGAC CGAGATCGCG
CTCGCCGCGC AGAGCCCGGC GGACGCGCTG ACGGCCGCCA ATCTGTGGCG CGAATATTCG
CCGGGTTCGC AGCGCGCCGC GCAGGTCGAC GCCGCGCTCC TCGTGCTCGG CGGCAAGCCG
GCCGAAGCGC AGCCGATGCT CTCGCAAGAG CTCGCCCGCG CAACCGGCGA GAATCGCGGC
CAGGCGATCA TCGCGCTGCA GGCGCTGCTC GCGCGCGGGC CGAACCGCGT CGGCGGCCTG
ACGGTGCTCC AGGATCTGCT GAAGAACGAC ATGGGCCGGC CCGAGGCGCG GCTCGCGATC
GCGCGCCAGC AACTCGCCAC CGACGACAAG GACGGCGCGA CGCAATCGCT GAAGGAAGCG
CTGCGCATCA AGCCCGATTA TCTGCCGGCG GCGCTGATGC TGTCGCAGAT GGGCCCGGGC
GAGCGCGCGG CCGGAATCGC GTCGTTCGAG AAGTTCGTCC AGCAGAATCC GAAATCGCGC
GACGGCCGCC TCGCGCTCGC GCAACTGTAT CTCGCCGACG ATCGCCTCGA CGACGCGCAA
AAGCAGTTCG ACGCGATGCG CCGCAACGAT TCGAGCGACC CGACGCCGCT GATGGCGATT
GCGCTCATCA AGATCCAGCA GAAGCACCTC GACGACGCGA CGACGTACCT GAAGCAATAC
GTGAAGGTCG CGCAGAAGAA GCCGGGCGCG GACGTCGGCC AGGCGTACGT GTATCTCGCG
CAGATCGCGC TCGACCAGAA CAACGAGGCG CTCGCCGCGC AATGGCTCGA CAAGGTCGAC
GAAGCGAGCC AGCAGTACGT ACCCGCGCAG GTCACGCGCG CGCAGTTGCT GCAGAAGCAG
GGCAAGGCCG ACGAAGCGCG CAAGCTGCTC GCGAACCTGC AGGCGTCCGA CCCGCGCGAC
GCCGCGGTGA TCGCGCGCAC CGACGCGTCG ATCCTCTTCA CGTCGAAGCG CTACAAGGAA
GCCGCCGACC GGCTCGCGCA AGCCGTGGAG GATTTCCCGG ACGATCCCGA TCTGCGCTAC
GACTACGCGA TGGCGAGCGA GAAGATCGGC CAGTACACGA CGATGGAACA GCAGTTGCGC
CTGCTGATGC GTGCGCAGCC CGACAATCCG CAAGCCTACA ACGCGCTCGG CTATTCGCTC
GCGGACCGCA ACCTGCGCCT GCAGGAAGCG AGCAAGCTGA TCGAGAAGGC GAACTCGCTC
GCGCCGAACG ACGCGTTCAT CATGGACAGC CTCGGCTGGG TCAAGTATCG CCTCGGCGAC
ACGGCGGGCG CGACGGCGAT CCTGAAACGC GCTTACGACC TGCAGCCGAA CGCGGAAATC
GGCGCGCACC TGGGCGAAGT GCTGTGGAGA AGCGGCTCGC GCGACGAAGC GCGCGCCGCA
TGGCGCGCGG CGCAGAAGCT CGAACCCGAC AACGATACGC TCGTGCAGAC GCTCAAGCGC
CTTCAGGTGA ACGGACTTTG A
 
Protein sequence
MTLPLKLSQK RLAAARGPRA VPVRRAIGAA LVAAWALAAL PAHAQDDAGD DAPQAAFASA 
LPEEQKDLPN VALTSQIVYQ VLAAEVALQR SLPAPAYQTY LALARDTRDP RMAQRATEIA
LAAQSPADAL TAANLWREYS PGSQRAAQVD AALLVLGGKP AEAQPMLSQE LARATGENRG
QAIIALQALL ARGPNRVGGL TVLQDLLKND MGRPEARLAI ARQQLATDDK DGATQSLKEA
LRIKPDYLPA ALMLSQMGPG ERAAGIASFE KFVQQNPKSR DGRLALAQLY LADDRLDDAQ
KQFDAMRRND SSDPTPLMAI ALIKIQQKHL DDATTYLKQY VKVAQKKPGA DVGQAYVYLA
QIALDQNNEA LAAQWLDKVD EASQQYVPAQ VTRAQLLQKQ GKADEARKLL ANLQASDPRD
AAVIARTDAS ILFTSKRYKE AADRLAQAVE DFPDDPDLRY DYAMASEKIG QYTTMEQQLR
LLMRAQPDNP QAYNALGYSL ADRNLRLQEA SKLIEKANSL APNDAFIMDS LGWVKYRLGD
TAGATAILKR AYDLQPNAEI GAHLGEVLWR SGSRDEARAA WRAAQKLEPD NDTLVQTLKR
LQVNGL