Gene BURPS1106A_1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1888 
Symbol 
ID4900956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1844111 
End bp1845682 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content71% 
IMG OID640135118 
Producttype IV pilus assembly protein 
Protein accessionYP_001066153 
Protein GI126452852 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.414931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGGCC GGCGTCCGGC CGCTTCATTC GCGCCCGGCG ACGCCGGCGG CGCGCACGAG 
TTCGCGCCCG ACGCCGCGCC GGCGGGCGCG GGTGCGGCGC CGGCCGGCGG CCACGCATCG
GCGCCCGACG CGGGCGGCGC GGCACGCGCG CGCGAGCCGG CCGGCGCATC GGGCGCGAGC
GCGCCCGGCG GGGCGCCGCA GTCCGGCGTC GCGCCAAGCG GCCATCGGCC CGCATCGCGT
GACGACCGCA TCGACCGCAT CGATCGCGAT GACGGCCACG ACCGCGCGCC CGCGCAAGAC
CACCACGAGG CGCTGATCCG CTCGGAGACG TTCAAGACGA TCCGCGCGGT CGTGTTCTCG
TCGATGAACA TGTCGGCCGC GCTGATGATG TCGCGCGCCG AAGTGCGCGA AGGCATCGAG
CAGGCGGCCG CCGACGTGAT CGCGCGCGAG CGGCTGACGG TGACGCTCGC CGAGCAGACG
CTCATCGTCG ACGAGATCCT CAACGACATG TTCGGCGTCG GGCCGATCGA GCCGTTGCTC
GCCGACGAAC GCGTGACCGA CATCCTCGTC AACGGCCCCG ATCAGGTGTA CGTCGAGCGC
GCCGGCAAGC TCGAGCTCAC GCCGCTGAAG TTCCGCGACA ACGCGCACGT GATCAACGTC
GCGCAGCGGA TCGCGGCGGC GGTCGGGCGG CGCGTCGACG AGAGCAGCCC GATGGTCGAC
GCGCGGCTCG CGGACGGCAG CCGCGTGAAC GTCGTGCTGC CGCCGATCGC GATCCGCGGC
GCGTCGATCT CGATCCGCAA GTTCGCCAAG CGCAACATCA CGCTCGCGCG GATGGCGCAG
CAGGGCAACC TGTCGCAGGC GATGGTCGAG GTGCTGAAGA TCGCGAGCGC GTGCCGGCTG
AACATCGTGA TCTCGGGCGG CACGGGCTCC GGCAAGACGA CGCTGCTGAA CGCGCTGTCG
CACTTCATCG ATTCGCACGA GCGCATCGTG ACGATCGAGG ACGCCGCGGA GCTGCAATTG
CAGCAGCCGC ACGTCGTGAG CCTCGAGACG CGCCCGGAGA ACAGCGAGGG GCTGGGCGGC
GTGTCGCAGC GCGATCTCGT GCGCAACGCG CTGCGCATGC GCCCCGATCG CATCATCCTC
GGCGAGACGC GCGGCCCGGA GGCGTTCGAC GTGCTGCAGG CGATGAACAC CGGGCACGAC
GGCTCGATGA CGACGATCCA CGCGAACACG CCGCGCGATG CGATCACGCG CCTCGAGAGC
ATGGTGATGA TGGCCAACGG CAACCTGCCG CTCGTGTCGA TCCGCCGGCA GATCGCGAGC
GCGGTGCACA TGATCCTGCA GATCGAGCGC ATGCGCGACG GCGTGCGGCG CGTCACGCGC
GTGACCGAGA TCGCCGGCAT GGAGGGCGAT GTCGTGATCA CGCAGGATCT GTTCGCGTTC
CGCTACGACG CGAGCGCGTT CCAGGAGCAG GTGCACGGAA TGTTCGAATC GTCGTCGCTG
CGCCCGGCGT TCGCGCAGCG CGCCGCGTAT TACGGCCTCG AGGGCGCGCT GCTCGGCGCG
TTGCAGCCGT GA
 
Protein sequence
MFGRRPAASF APGDAGGAHE FAPDAAPAGA GAAPAGGHAS APDAGGAARA REPAGASGAS 
APGGAPQSGV APSGHRPASR DDRIDRIDRD DGHDRAPAQD HHEALIRSET FKTIRAVVFS
SMNMSAALMM SRAEVREGIE QAAADVIARE RLTVTLAEQT LIVDEILNDM FGVGPIEPLL
ADERVTDILV NGPDQVYVER AGKLELTPLK FRDNAHVINV AQRIAAAVGR RVDESSPMVD
ARLADGSRVN VVLPPIAIRG ASISIRKFAK RNITLARMAQ QGNLSQAMVE VLKIASACRL
NIVISGGTGS GKTTLLNALS HFIDSHERIV TIEDAAELQL QQPHVVSLET RPENSEGLGG
VSQRDLVRNA LRMRPDRIIL GETRGPEAFD VLQAMNTGHD GSMTTIHANT PRDAITRLES
MVMMANGNLP LVSIRRQIAS AVHMILQIER MRDGVRRVTR VTEIAGMEGD VVITQDLFAF
RYDASAFQEQ VHGMFESSSL RPAFAQRAAY YGLEGALLGA LQP