Gene BURPS1106A_2088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2088 
Symbol 
ID4900222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2082173 
End bp2084674 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content67% 
IMG OID640135318 
Productfimbrial usher protein 
Protein accessionYP_001066353 
Protein GI126454737 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGCCG CCGCGCTCAC GGCGCTTTCG GCGACCGCTC GCGGCCAACA GGCGCTGGAG 
TTCGATCCCG CCTTTCTCGA GCTGGGCGGC GGCCAGGGCG GAGCCGATCT TTCCGTGTAC
GCGACATCGA ACCGCGTGCT GCCCGGCGTC TATCCGGTAT CGGTCTTCGT CAACGGCGAG
GCGATCGAGC GGCGCGACAT CACGTTCGTG TCCGAAAGCG CGCGCGACGG GCGGGAAGAC
GCGATCCCCT GCCTGAGCGC CCGGATGTTC GACGAATGGG GCGTCGACAT CGCCGCGTTT
GCGAAGCTCG CGCAAGCCGG CGAAGACGCA TGCGTCGACA TCGCCGACAG CGTGCCCCAC
GCGCGAACCG AGTTCGACAG TCATCAACTG CGGTTGAACG TAACGGTGCC CCAGGCCGCG
TTGAAGCGGC GCGCGCGCGG CGCGGTGGAC CCGGCGCGCT GGGATCAGGG TATCGACGCC
GCGCTGCTCG ACTATCAACT GAGCGCTGCC CAGTACGCCG GCGGCAATTT CGCGTCCGCC
CGTTCGCGCA CGACGTTGTA TGCGGGGTTG CGCGGCGCCG TCAATCTGGG CGCATGGCGG
CTGTCGCACA CGTCGTCGTT TCTGCGCGGG CTCGACGGCC GGAATCGCTT TCAGATCGTC
AATACGTTCG TCCAGCGCGA CATCGCCGGC TGGAACAGCC GCCTCACCGC CGGGGAGGGC
ACCACGCCCG CGAACATTTT CGACGGATTT CAGTTTCTCG GCGTGCAACT CAATACCGAC
GAGACCATGC TGCCCGACAG TCTGCAGGGC TACGCGCCCA CCGTGCACGG CGTCGCGCAG
ACCAATGCGC AGGTGACGAT CAGGCAGAAT GGTTTCGTCA TCTACAGCAC CTACGTGCCG
CCGGGGCCGT TTACGATCGA CGATCTTTAT CCGACGTCGT CGTCGGGCAA TCTGGAAGTG
ACGATTACCG AGGCCGACGG GCACGTCACG ACATTCACGC AACCGTACTC CGCGGTGCCG
ATGTTGCTGC GCGACGGTTC GTGGCGGTAT AACGTCACGG CGGGCCAATA CCGTGACGGC
ATTTCAGGCT CGCATCCGAG CTTTGCGATG GCGACGCTCG CGCGCGGGCT GGCGGGCGAA
TTCTCGTTGT ACGGCGGTTT CATCGGGGCC GGCATGTATC AATCGGTGCT CGTCGGAATC
GGCAAGAACC TGGGCAGCAT CGGCGCGGTA TCGCTCGACG TGTCGCACGC GCGCAGCGCG
GTCGACCTGG CCGACAGCAG CACGGTGTCG GGGCATGCCT TTCGCGTGCT TTACGCGAAG
GCCGTGGGCA GTTGGGGCAC CGATTTCCGG TTGCTCGCAT ACCGCTACTC CACCGCCGGC
TATCGAAGCT TCGCCGATGC GGTGCAACTG CGCGACGGCA GCGAGCCCGC GGCGCTGGGC
GCAAAGCGCC AGCGCCTCGA GGGCACGGTG AACCAGCGTC TCGGCCGCCT CGGCTCGATG
TACGCGACCG TGGCCGTGCA GACCTACTGG GGCAGCGCGG CGCGCAGCAC CGTGTATCAG
CTCGGGCACA GCGGGAATTG GGGGCGCGCC AGCTACGGAC TCTATGCGGC CTATAGCAAA
GGAAGCGGCG TGCCGTCGAG CTGGAACGTC TCGTTGTCGC TATCGATGCC GCTGGAAGTG
CTTTTCGGGG GCGCGCGCGT GCGTGCGCCG GCGGGCGGCA GTGCGAATGT CTCGTACTTC
GTCAGCCGGA ACAACGAGAA CCACGTCAAT CAACAGATGA CGGCCAGCGG CAGCAGCAGC
GAGCAGCGCC TGAACTACAG CGTGGGCGTC GCGCATTCCA GCGAGTCGGA CGTGAGCGGC
TCGGTGTCGG CCAGCTACCT CGCGCCGTTC GGCCGCTACG ACGCGTCGAT CGGCAGCGGC
CGCGGGTACA CGCAGGCCGC ATTCACTGCC GCCGGCGGCA TGCTGTGGCA TGGCACCGGA
GTGTTGTTCA CGCAACCGCT CGGCGAGACC GTGGCGGTCG TGGACGTGCC GAACGTGCAG
GGCGTGCGTT TCGAAATGCA CCCGGGCGTG AGCACGGATC GGGCGGGCGA AGCGGTGATT
CCGCGGCTGA ACCCCTATCG GGTCAACCGC ATCGTCGTCG ACCAGCGCCG GATGCCGCAG
GACGTGGAGA TCCGCAACCC GGTGAGCGAA GTCGTGCCCA CCCGGGCGGC GGTCGTTCAA
ACGCACTTCG ATTCCGTCGT CGGGCTTCGC GCGCTGTTCA CGTTGATGCG CGCGGACGGC
TCGTTTCCGC CGCAGGGCGC GACGGCCGAG AACGACGAGG GACAGGTGCT CGGCGTCGTC
GGGATGGACG GCGAGACGTT CGTGGCGGGC TTGCCCGCCG CCGAAGGGCA TTTCGTCGTT
CGCTGGGGGG CGGCGCGACA GAATCGCTGC CGCGTGAATT ACGCGCTGCC CGGAAAGGCG
GCGATCGGCG CGTACTTGGC CGTGGAGGCG ATATGCGATT GA
 
Protein sequence
MLAAALTALS ATARGQQALE FDPAFLELGG GQGGADLSVY ATSNRVLPGV YPVSVFVNGE 
AIERRDITFV SESARDGRED AIPCLSARMF DEWGVDIAAF AKLAQAGEDA CVDIADSVPH
ARTEFDSHQL RLNVTVPQAA LKRRARGAVD PARWDQGIDA ALLDYQLSAA QYAGGNFASA
RSRTTLYAGL RGAVNLGAWR LSHTSSFLRG LDGRNRFQIV NTFVQRDIAG WNSRLTAGEG
TTPANIFDGF QFLGVQLNTD ETMLPDSLQG YAPTVHGVAQ TNAQVTIRQN GFVIYSTYVP
PGPFTIDDLY PTSSSGNLEV TITEADGHVT TFTQPYSAVP MLLRDGSWRY NVTAGQYRDG
ISGSHPSFAM ATLARGLAGE FSLYGGFIGA GMYQSVLVGI GKNLGSIGAV SLDVSHARSA
VDLADSSTVS GHAFRVLYAK AVGSWGTDFR LLAYRYSTAG YRSFADAVQL RDGSEPAALG
AKRQRLEGTV NQRLGRLGSM YATVAVQTYW GSAARSTVYQ LGHSGNWGRA SYGLYAAYSK
GSGVPSSWNV SLSLSMPLEV LFGGARVRAP AGGSANVSYF VSRNNENHVN QQMTASGSSS
EQRLNYSVGV AHSSESDVSG SVSASYLAPF GRYDASIGSG RGYTQAAFTA AGGMLWHGTG
VLFTQPLGET VAVVDVPNVQ GVRFEMHPGV STDRAGEAVI PRLNPYRVNR IVVDQRRMPQ
DVEIRNPVSE VVPTRAAVVQ THFDSVVGLR ALFTLMRADG SFPPQGATAE NDEGQVLGVV
GMDGETFVAG LPAAEGHFVV RWGAARQNRC RVNYALPGKA AIGAYLAVEA ICD