Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1873 |
Symbol | |
ID | 4885586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 1835266 |
End bp | 1836714 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640127801 |
Product | type II/III secretion system protein |
Protein accession | YP_001058908 |
Protein GI | 126441311 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4964] Flp pilus assembly protein, secretin CpaC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGAGTG TGATCGACGC GATCGGCAAG CGGCGGCAAG CCGCATCGGC ATCGCCGCGC GCGCGGGCGT TCGCGGCGGC GCGGCGCGCG GCCGCCGCGG CGTTGTGGTG GCTCGCGCTC GCGCTGGGCA TGGCGGTGGC GCCGAGATTC GCCCGCGCGG CGGACCTCGG CCCGGTGCTG GCCGTGCCCG CCGGCGGCGG CGAGATGGTG AAGCTGCCGG CGCCGGCGGT CGCGGTGTTC GTCGCGGACC CGGACGTCGC CGACGTGCAC GTGCCGACGC CGCAGGCGGT GTTCGTGCTC GGCAAGAAAG CCGGCACGAC GACGCTCTTC GCGCTCGGCG CGAACAACCG GACGATCCTG CGCGAGACGG TCGTCGTCGA TGTCGATACG CCGTCGCTGC AGCGCATTCT CGATGCGCGC TTTCCGCAAC TGCGCCTGAC GCTCGCGGGC GCGCCGGGCT CGCTGATGGT GAGCGGCCGC GTGCCGAGCG CGGCGGACGC GGACGCCGTC GTGCAGACGC TCAAGCCGTA CCTGCGCCAG CAGGAAGCGC TCGTCAACCG GCTCACGCTC GCGCGGCCGA TCCAGGTGCA CCTGCGCGTG CGCATCACCG AGGTCGACCG CAACATCACG CAGCAGCTCG GCATCAACTG GAGCGCGCTC GGCGCGAGCG GCAACTTCGT CGGCGGCCTG TTCAACGGGC GCACGCTGTT CGACACGGCG TCGAAGGCGT TCGATCTGTC GCCGTCGGGC GCGTTCTCGG TGGTGGGCGG CTTTCACACG TCGCGCTACT CGATCGACGG CGTGCTCGAC GCGCTCGATC AGGAAGGCCT CATCACGATG CTCGCCGAGC CGAACCTGAC CGCGATCTCC GGCCAGACCG CGAGCTTCCT CGCGGGCGGC GAGTTTCCGA TTCCGGTCGC GCAGGACACG ACGGGCGCGA TCACGATCCA GTTCAAGCCA TATGGCGTGT CGCTCGACTT CACGCCGACC GTGCTCGCCG ACAACCGGAT CAGCCTCAAG GTGCGCCCGG AGGTGAGCGA GATCGATCCG ACCAACAGCG TGACGACGGG CAGCATCAAG GTGCCGGCGC TGACGGTGCG CCGCGTCGAC ACGACGGTCG AGCTGTCGAG CGGGCAGAGC TTCGCGATCG GCGGGCTCCT GCAGAGCAAG AGCAGCGACG TGCTCGCCGA GCTGCCGGGC CTCGCGCGGC TGCCCGTGCT CGGCAAGCTG TTCTCGTCGC GCAACTACCT GAACGACAAG ACCGAGGTCG TCGTGATCGT CACGCCCTAC ATCGTGCAGC CGGCGAATCC GGGCGAGCTG CGCGACGCGC TCGACGACGT CACGCGCCCG AGCAGCGACA TCGAGTTCGT GCTGCAGCGC TCGCTCGGCA TCGATCCGCT CGGCGGCGAT GCGCCGCGGC TCGCGGGCCC GGCGGGGTTC GTCTACTGA
|
Protein sequence | MWSVIDAIGK RRQAASASPR ARAFAAARRA AAAALWWLAL ALGMAVAPRF ARAADLGPVL AVPAGGGEMV KLPAPAVAVF VADPDVADVH VPTPQAVFVL GKKAGTTTLF ALGANNRTIL RETVVVDVDT PSLQRILDAR FPQLRLTLAG APGSLMVSGR VPSAADADAV VQTLKPYLRQ QEALVNRLTL ARPIQVHLRV RITEVDRNIT QQLGINWSAL GASGNFVGGL FNGRTLFDTA SKAFDLSPSG AFSVVGGFHT SRYSIDGVLD ALDQEGLITM LAEPNLTAIS GQTASFLAGG EFPIPVAQDT TGAITIQFKP YGVSLDFTPT VLADNRISLK VRPEVSEIDP TNSVTTGSIK VPALTVRRVD TTVELSSGQS FAIGGLLQSK SSDVLAELPG LARLPVLGKL FSSRNYLNDK TEVVVIVTPY IVQPANPGEL RDALDDVTRP SSDIEFVLQR SLGIDPLGGD APRLAGPAGF VY
|
| |