Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0147 |
Symbol | |
ID | 4906368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 136351 |
End bp | 137379 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640143254 |
Product | hypothetical protein |
Protein accession | YP_001074190 |
Protein GI | 126455537 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3539] P pilus assembly protein, pilin FimA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.1246 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA AATTCTCAAG ATTCCTTGCG GCCCTCGTCT TGCTCGCAGC GACCGTCGAT GCGCTTGCCG CCGGCTGCAA CATGCTGACC GCGGAGAATA TCGCGTGGCT GACCGAGCAA GGCAAGATCG CGCGCGCTCA CAGTTCCTTT CCCATATCCT TCTCTTCCGG AATGGTCGAT GTCGACCCGA ACCTCGAGAT CGGAGGCTTG ATAGCAGAGG CAAAAAGCAT CCCAAGCGAA GAACTGCACT TCATTTGGTG CAGCGCGCCC TCCGGGAATG TTCACTTTGC GCTCAATTCG TCCCCGCTGC CGTCGGAACT CGGAAATTCG ATCTATGAAA CCGGCGTGCC CGGCGTCGGA TTCCGAATCA CTCAGGTTCG CCAGAGCGGC TCGATAGGGG CGATTCCTCG CGATACGCCG TGGATCGAAG AAAAGCCCGG CCAGGACAGC TCTCTGAATT TCGGCGCGGG AACCGTGTTC CGTATCGAGC TGATCAAGAC CAGCGAAGCG TTGCCGAGCG AATCGACCAT ATCCCTTGGC AACCTCAGTC GCGTATACGG TGACGACAAC AAAACGGTCG TCGATTTCAA TGCCGGCAGC GTCAAGTTGC GCGTGCTGCC GATCTGCCAT GTCGATCAAC AGGAAAAAAA TGTAGATTTC GGCCAATTCG GCCCGAAAGA CGTTTCCTTC GATTCCGGCC CCACCAAGGA CGTCAAGTTT GACGTCCAGT GTTCGGGCCC GACGCCCCCC GTCTCCATCA CGGCGACTTT GGCCGCGACG CCGGATAGTC ACGATCAAAG CCTGATCGCG AACGCCGGCG ATGCCATGAA CCTTGCCATC CGATTGCGCG ATGCAAGTAC GCAACAGGTT CTAAGGCCCA ACGATCCCAC CAGCGAAATC AAGGTCGAGC CCGGCGGCGC AATGGAGCAC GGATTCGCAC TGGAGGCGAC TGTCCTGCGT GTCGGCACGG CGCCGCCCAC GGCGGGCACG ATCGACGCAA CCTCAATCAT CACGCTAACC ATTTTGTGA
|
Protein sequence | MKKKFSRFLA ALVLLAATVD ALAAGCNMLT AENIAWLTEQ GKIARAHSSF PISFSSGMVD VDPNLEIGGL IAEAKSIPSE ELHFIWCSAP SGNVHFALNS SPLPSELGNS IYETGVPGVG FRITQVRQSG SIGAIPRDTP WIEEKPGQDS SLNFGAGTVF RIELIKTSEA LPSESTISLG NLSRVYGDDN KTVVDFNAGS VKLRVLPICH VDQQEKNVDF GQFGPKDVSF DSGPTKDVKF DVQCSGPTPP VSITATLAAT PDSHDQSLIA NAGDAMNLAI RLRDASTQQV LRPNDPTSEI KVEPGGAMEH GFALEATVLR VGTAPPTAGT IDATSIITLT IL
|
| |