Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1876 |
Symbol | |
ID | 4884604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 1838407 |
End bp | 1839978 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640127804 |
Product | type IV pilus assembly protein |
Protein accession | YP_001058911 |
Protein GI | 126438606 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.483104 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGGCC GGCGTCCGGC CGCTTCATTC GCGCCCGGCG ACGCCGGCGG CGCGCACGAG TTCGCGCCCG ACGCCGCGCC GGCGGGCGCG GGCGCGGCGC CGGCCGGCGG CCACGCATCG GCGCCCGACG CGGGCGGCGC GGCACGCGCG CGCGAGCCGG CCGGCGCATC GGGCGCGAGC GCGCCCGGCG GGGCGCCGCA GTCCGGCGTC GCGCCAAGCG GCCATCGGCT CGCATCGCGT GACGACCGCA TCGATCGCAT CGATCGCGAT GACGGCCACG ACCGCGCGCC CGCGCAAGAC CACCACGAGG CGCTGATCCG CTCGGAGACG TTCAAGACGA TCCGCACGGT CGTGTTCTCG TCGATGAACA TGTCGGCCGC GCTGATGATG TCGCGCGCCG AAGTGCGCGA AGGCATCGAG CAGGCGGCCG CCGACGTGAT CGCGCGCGAG CGGCTGACGG TGACGCTCGC CGAGCAGGCG CTCATCGTCG ACGAGATCCT CAACGACATG TTCGGCGTCG GGCCGATCGA GCCGTTGCTC GCCGACGAAC GCGTGACCGA CATCCTCGTC AACGGCCCCG ATCAGGTGTA CGTCGAGCGC GCGGGCAAGC TCGAGCTCAC GCCGCTGAAG TTCCGCGACA ACGCGCACGT GATCAACGTC GCGCAGCGGA TCGCGGCGGC GGTCGGGCGG CGCGTCGACG AGAGCAGCCC GATGGTCGAC GCGCGGCTCG CGGACGGCAG CCGCGTGAAC GTCGTGCTGC CGCCGATCGC GATCCGCGGC GCGTCGATCT CGATCCGCAA GTTCGCCAAG CGCAACATCA CGCTCGCGCG GATGGCGCAG CAGGGCAACC TGTCGCAGGC GATGGTCGAG GTGCTGAAGA TCGCGAGCGC GTGCCGGCTG AACATCGTGA TCTCGGGCGG CACGGGCTCC GGCAAGACGA CGCTGCTGAA CGCGCTGTCG CACTTCATCG ATTCGCACGA GCGCATCGTG ACGATCGAGG ACGCCGCGGA GCTGCAATTG CAGCAGCCGC ACGTCGTGAG CCTCGAGACG CGCCCGGAGA ACAGCGAGGG GCTGGGCGGC GTGTCGCAGC GCGATCTCGT GCGCAACGCG CTGCGCATGC GCCCCGATCG CATCATCCTC GGCGAGACGC GCGGCCCGGA GGCGTTCGAC GTGCTGCAGG CGATGAACAC CGGGCACGAC GGCTCGATGA CGACGATCCA CGCGAACACG CCGCGCGATG CGATCACGCG CCTCGAGAGC ATGGTGATGA TGGCCAACGG CAACCTGCCG CTCGTGTCGA TCCGCCGGCA GATCGCGAGC GCGGTGCACA TGATCCTGCA GATCGAGCGC ATGCGCGACG GCGTGCGGCG CGTCACGCGC GTGACCGAGA TCGCCGGCAT GGAGGGCGAT GTCGTGATCA CGCAGGATCT GTTCGCGTTC CGCTACGACG CGAGCGCGTT CCAGGAGCAG GTGCACGGGA TGTTCGAATC GTCGTCGCTG CGTCCGGCGT TCGCGCAGCG CGCCGCGTAT TACGGCCTCG AGGGCGCGCT GCTTGGCGCG TTGCAGCCGT GA
|
Protein sequence | MFGRRPAASF APGDAGGAHE FAPDAAPAGA GAAPAGGHAS APDAGGAARA REPAGASGAS APGGAPQSGV APSGHRLASR DDRIDRIDRD DGHDRAPAQD HHEALIRSET FKTIRTVVFS SMNMSAALMM SRAEVREGIE QAAADVIARE RLTVTLAEQA LIVDEILNDM FGVGPIEPLL ADERVTDILV NGPDQVYVER AGKLELTPLK FRDNAHVINV AQRIAAAVGR RVDESSPMVD ARLADGSRVN VVLPPIAIRG ASISIRKFAK RNITLARMAQ QGNLSQAMVE VLKIASACRL NIVISGGTGS GKTTLLNALS HFIDSHERIV TIEDAAELQL QQPHVVSLET RPENSEGLGG VSQRDLVRNA LRMRPDRIIL GETRGPEAFD VLQAMNTGHD GSMTTIHANT PRDAITRLES MVMMANGNLP LVSIRRQIAS AVHMILQIER MRDGVRRVTR VTEIAGMEGD VVITQDLFAF RYDASAFQEQ VHGMFESSSL RPAFAQRAAY YGLEGALLGA LQP
|
| |