Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1888 |
Symbol | |
ID | 4900956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 1844111 |
End bp | 1845682 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640135118 |
Product | type IV pilus assembly protein |
Protein accession | YP_001066153 |
Protein GI | 126452852 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.414931 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGGCC GGCGTCCGGC CGCTTCATTC GCGCCCGGCG ACGCCGGCGG CGCGCACGAG TTCGCGCCCG ACGCCGCGCC GGCGGGCGCG GGTGCGGCGC CGGCCGGCGG CCACGCATCG GCGCCCGACG CGGGCGGCGC GGCACGCGCG CGCGAGCCGG CCGGCGCATC GGGCGCGAGC GCGCCCGGCG GGGCGCCGCA GTCCGGCGTC GCGCCAAGCG GCCATCGGCC CGCATCGCGT GACGACCGCA TCGACCGCAT CGATCGCGAT GACGGCCACG ACCGCGCGCC CGCGCAAGAC CACCACGAGG CGCTGATCCG CTCGGAGACG TTCAAGACGA TCCGCGCGGT CGTGTTCTCG TCGATGAACA TGTCGGCCGC GCTGATGATG TCGCGCGCCG AAGTGCGCGA AGGCATCGAG CAGGCGGCCG CCGACGTGAT CGCGCGCGAG CGGCTGACGG TGACGCTCGC CGAGCAGACG CTCATCGTCG ACGAGATCCT CAACGACATG TTCGGCGTCG GGCCGATCGA GCCGTTGCTC GCCGACGAAC GCGTGACCGA CATCCTCGTC AACGGCCCCG ATCAGGTGTA CGTCGAGCGC GCCGGCAAGC TCGAGCTCAC GCCGCTGAAG TTCCGCGACA ACGCGCACGT GATCAACGTC GCGCAGCGGA TCGCGGCGGC GGTCGGGCGG CGCGTCGACG AGAGCAGCCC GATGGTCGAC GCGCGGCTCG CGGACGGCAG CCGCGTGAAC GTCGTGCTGC CGCCGATCGC GATCCGCGGC GCGTCGATCT CGATCCGCAA GTTCGCCAAG CGCAACATCA CGCTCGCGCG GATGGCGCAG CAGGGCAACC TGTCGCAGGC GATGGTCGAG GTGCTGAAGA TCGCGAGCGC GTGCCGGCTG AACATCGTGA TCTCGGGCGG CACGGGCTCC GGCAAGACGA CGCTGCTGAA CGCGCTGTCG CACTTCATCG ATTCGCACGA GCGCATCGTG ACGATCGAGG ACGCCGCGGA GCTGCAATTG CAGCAGCCGC ACGTCGTGAG CCTCGAGACG CGCCCGGAGA ACAGCGAGGG GCTGGGCGGC GTGTCGCAGC GCGATCTCGT GCGCAACGCG CTGCGCATGC GCCCCGATCG CATCATCCTC GGCGAGACGC GCGGCCCGGA GGCGTTCGAC GTGCTGCAGG CGATGAACAC CGGGCACGAC GGCTCGATGA CGACGATCCA CGCGAACACG CCGCGCGATG CGATCACGCG CCTCGAGAGC ATGGTGATGA TGGCCAACGG CAACCTGCCG CTCGTGTCGA TCCGCCGGCA GATCGCGAGC GCGGTGCACA TGATCCTGCA GATCGAGCGC ATGCGCGACG GCGTGCGGCG CGTCACGCGC GTGACCGAGA TCGCCGGCAT GGAGGGCGAT GTCGTGATCA CGCAGGATCT GTTCGCGTTC CGCTACGACG CGAGCGCGTT CCAGGAGCAG GTGCACGGAA TGTTCGAATC GTCGTCGCTG CGCCCGGCGT TCGCGCAGCG CGCCGCGTAT TACGGCCTCG AGGGCGCGCT GCTCGGCGCG TTGCAGCCGT GA
|
Protein sequence | MFGRRPAASF APGDAGGAHE FAPDAAPAGA GAAPAGGHAS APDAGGAARA REPAGASGAS APGGAPQSGV APSGHRPASR DDRIDRIDRD DGHDRAPAQD HHEALIRSET FKTIRAVVFS SMNMSAALMM SRAEVREGIE QAAADVIARE RLTVTLAEQT LIVDEILNDM FGVGPIEPLL ADERVTDILV NGPDQVYVER AGKLELTPLK FRDNAHVINV AQRIAAAVGR RVDESSPMVD ARLADGSRVN VVLPPIAIRG ASISIRKFAK RNITLARMAQ QGNLSQAMVE VLKIASACRL NIVISGGTGS GKTTLLNALS HFIDSHERIV TIEDAAELQL QQPHVVSLET RPENSEGLGG VSQRDLVRNA LRMRPDRIIL GETRGPEAFD VLQAMNTGHD GSMTTIHANT PRDAITRLES MVMMANGNLP LVSIRRQIAS AVHMILQIER MRDGVRRVTR VTEIAGMEGD VVITQDLFAF RYDASAFQEQ VHGMFESSSL RPAFAQRAAY YGLEGALLGA LQP
|
| |