Gene BURPS668_1876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1876 
Symbol 
ID4884604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1838407 
End bp1839978 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content71% 
IMG OID640127804 
Producttype IV pilus assembly protein 
Protein accessionYP_001058911 
Protein GI126438606 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.483104 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGGCC GGCGTCCGGC CGCTTCATTC GCGCCCGGCG ACGCCGGCGG CGCGCACGAG 
TTCGCGCCCG ACGCCGCGCC GGCGGGCGCG GGCGCGGCGC CGGCCGGCGG CCACGCATCG
GCGCCCGACG CGGGCGGCGC GGCACGCGCG CGCGAGCCGG CCGGCGCATC GGGCGCGAGC
GCGCCCGGCG GGGCGCCGCA GTCCGGCGTC GCGCCAAGCG GCCATCGGCT CGCATCGCGT
GACGACCGCA TCGATCGCAT CGATCGCGAT GACGGCCACG ACCGCGCGCC CGCGCAAGAC
CACCACGAGG CGCTGATCCG CTCGGAGACG TTCAAGACGA TCCGCACGGT CGTGTTCTCG
TCGATGAACA TGTCGGCCGC GCTGATGATG TCGCGCGCCG AAGTGCGCGA AGGCATCGAG
CAGGCGGCCG CCGACGTGAT CGCGCGCGAG CGGCTGACGG TGACGCTCGC CGAGCAGGCG
CTCATCGTCG ACGAGATCCT CAACGACATG TTCGGCGTCG GGCCGATCGA GCCGTTGCTC
GCCGACGAAC GCGTGACCGA CATCCTCGTC AACGGCCCCG ATCAGGTGTA CGTCGAGCGC
GCGGGCAAGC TCGAGCTCAC GCCGCTGAAG TTCCGCGACA ACGCGCACGT GATCAACGTC
GCGCAGCGGA TCGCGGCGGC GGTCGGGCGG CGCGTCGACG AGAGCAGCCC GATGGTCGAC
GCGCGGCTCG CGGACGGCAG CCGCGTGAAC GTCGTGCTGC CGCCGATCGC GATCCGCGGC
GCGTCGATCT CGATCCGCAA GTTCGCCAAG CGCAACATCA CGCTCGCGCG GATGGCGCAG
CAGGGCAACC TGTCGCAGGC GATGGTCGAG GTGCTGAAGA TCGCGAGCGC GTGCCGGCTG
AACATCGTGA TCTCGGGCGG CACGGGCTCC GGCAAGACGA CGCTGCTGAA CGCGCTGTCG
CACTTCATCG ATTCGCACGA GCGCATCGTG ACGATCGAGG ACGCCGCGGA GCTGCAATTG
CAGCAGCCGC ACGTCGTGAG CCTCGAGACG CGCCCGGAGA ACAGCGAGGG GCTGGGCGGC
GTGTCGCAGC GCGATCTCGT GCGCAACGCG CTGCGCATGC GCCCCGATCG CATCATCCTC
GGCGAGACGC GCGGCCCGGA GGCGTTCGAC GTGCTGCAGG CGATGAACAC CGGGCACGAC
GGCTCGATGA CGACGATCCA CGCGAACACG CCGCGCGATG CGATCACGCG CCTCGAGAGC
ATGGTGATGA TGGCCAACGG CAACCTGCCG CTCGTGTCGA TCCGCCGGCA GATCGCGAGC
GCGGTGCACA TGATCCTGCA GATCGAGCGC ATGCGCGACG GCGTGCGGCG CGTCACGCGC
GTGACCGAGA TCGCCGGCAT GGAGGGCGAT GTCGTGATCA CGCAGGATCT GTTCGCGTTC
CGCTACGACG CGAGCGCGTT CCAGGAGCAG GTGCACGGGA TGTTCGAATC GTCGTCGCTG
CGTCCGGCGT TCGCGCAGCG CGCCGCGTAT TACGGCCTCG AGGGCGCGCT GCTTGGCGCG
TTGCAGCCGT GA
 
Protein sequence
MFGRRPAASF APGDAGGAHE FAPDAAPAGA GAAPAGGHAS APDAGGAARA REPAGASGAS 
APGGAPQSGV APSGHRLASR DDRIDRIDRD DGHDRAPAQD HHEALIRSET FKTIRTVVFS
SMNMSAALMM SRAEVREGIE QAAADVIARE RLTVTLAEQA LIVDEILNDM FGVGPIEPLL
ADERVTDILV NGPDQVYVER AGKLELTPLK FRDNAHVINV AQRIAAAVGR RVDESSPMVD
ARLADGSRVN VVLPPIAIRG ASISIRKFAK RNITLARMAQ QGNLSQAMVE VLKIASACRL
NIVISGGTGS GKTTLLNALS HFIDSHERIV TIEDAAELQL QQPHVVSLET RPENSEGLGG
VSQRDLVRNA LRMRPDRIIL GETRGPEAFD VLQAMNTGHD GSMTTIHANT PRDAITRLES
MVMMANGNLP LVSIRRQIAS AVHMILQIER MRDGVRRVTR VTEIAGMEGD VVITQDLFAF
RYDASAFQEQ VHGMFESSSL RPAFAQRAAY YGLEGALLGA LQP