Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2348 |
Symbol | |
ID | 4883315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 2323013 |
End bp | 2324254 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640128276 |
Product | twin-arginine translocation pathway signal sequence domain-containing protein |
Protein accession | YP_001059380 |
Protein GI | 126440937 |
COG category | [S] Function unknown |
COG ID | [COG4102] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGAC GCGATTTTCT GGCCCTGGCG AGCCTTGCCG GCGCGGCGGG CGTGCCGTAT GCGTTCGCTG CCGCGCCGGG CGAGACGAGC GCAACGGGGG CGATGGGAGC GGTGGGGGCG GCGCGCGCCG CACGCTACTC GAACCTGCTG ATTCTCGTCG AGCTCAAGGG CGGCAACGAC GGGCTCAACA CGGTGATTCC GTACGCGAAT CCGCTGTACC GCACGCTGCG CCCGGCGATC GGCGTCAAGC GCGAGCAGGT CGTGCAGCTC GACGAGCGCG CCGCGCTGCA TCCGGCGCTC GAGCCGCTCA TGCCGATCTG GCGCGACGGA CGGCTCGCGA TCGTCGAAGG CGTCGGCTAT CCGCAGCCGA ATCTGTCGCA CTTTCGCTCG ATCGAGATCT GGGATACCGC GTCGCGCGCG AACGAGTATC TGCGCGAAGG GTGGCTCACG CGCGCGTTCG CGCAGGCGAG CGTGCCGCCC GGCTTCGCCG CGGACGGCAT CGTGCTCGGC AGCGCGGAAA TGGGGCCGCT CGCGAACGGC GCGCGTGCGA TCGCCCTCGT GAATCCCGCG CAGTTCGCTC GCGCGGCGCG ACTCGCGCAG CCCGTGTCGC TGCGTGAGCG CAACCCCGCG CTCGCGCACG TGATCGACAT CGAAAACGAC ATCGTCAAGG CCGCCGATCG GCTGCGTCCG CATGCGGGCA CGCCCGCGCT CGCGACCGCG TTTCCGGGCG GGCCGTTCGG CGCATCGGTG AAGACCGCGA TGCAGGTGCT CGCCGCGTGC GATACGCCGC AGCGTACGCC GGCGCCGGGG CAGGGCGTCG CGGTGCTGCG CCTCACGTTG AACGGCTTCG ACACGCACCA GAACCAGCCC GGCCAGCAGG CGGGCTTGCT CGGCCAACTG GCGCAAGGGC TGGTGGCGAT GCGCTCGGCG TTGATCGAGC TCGGGCGCTG GAACGATACG CTCGTGATGA CGTATGCGGA GTTCGGCCGG CGCGCGCGCG AGAATCAGAG CAACGGAACC GATCACGGCA CGGCCGCGCC GCATTTCGTG ATGGGCGGGC GCGTGCGGGG CGGGCTGTAC GGCGCGCCGC CCGCGCTCGA CGCGCTCGAC GGCAACGGCA ACCTGCCTGT CGCCGTCGAT TTCCGTCAGC TTTATGCGAC CGTGCTCGGC CCATGGTGGG GGCTCGACGC GGCGAGTGTG CTCAGGCAGC GTTTCGAGCC GCTGCCGTTG CTGCGCGCCT GA
|
Protein sequence | MKRRDFLALA SLAGAAGVPY AFAAAPGETS ATGAMGAVGA ARAARYSNLL ILVELKGGND GLNTVIPYAN PLYRTLRPAI GVKREQVVQL DERAALHPAL EPLMPIWRDG RLAIVEGVGY PQPNLSHFRS IEIWDTASRA NEYLREGWLT RAFAQASVPP GFAADGIVLG SAEMGPLANG ARAIALVNPA QFARAARLAQ PVSLRERNPA LAHVIDIEND IVKAADRLRP HAGTPALATA FPGGPFGASV KTAMQVLAAC DTPQRTPAPG QGVAVLRLTL NGFDTHQNQP GQQAGLLGQL AQGLVAMRSA LIELGRWNDT LVMTYAEFGR RARENQSNGT DHGTAAPHFV MGGRVRGGLY GAPPALDALD GNGNLPVAVD FRQLYATVLG PWWGLDAASV LRQRFEPLPL LRA
|
| |