Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0788 |
Symbol | |
ID | 4887403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 757515 |
End bp | 758453 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640130728 |
Product | ImpA-related N-terminal family protein |
Protein accession | YP_001061787 |
Protein GI | 126445235 |
COG category | [S] Function unknown |
COG ID | [COG3515] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03363] type VI secretion-associated protein, ImpA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAACG ATGCACTCCC GGGCCATTCG CCGGATTTGC TGGATTTCGA CGAAGACTTT ATCAAGATCG ACGCAGCCAT CTGCGAATAC GATTCCGTCG GTTACGCGCC GCAACGCAAA GGAGAGAGTG CGTTCCAGTG GGCGTCCATC GAAACGGGAT GCCTCGCTTT GTTGAAGAAG GCGAAGGATG TCCGGGTCGG CATTTGGCAT CTGCGCGCAT GCATCGCGCG GCGCGGGCTG AGCGGGCTGG CAGACGGCGT TCGATCGCTG GCCGATCTGA TGAGTGCGCC CGTCGAGGAA TTGCATCCTC GCGCGCTGCC TGACGAATCG CCCGGCGAAA CCCTCCTGAT TCATCTCGGC TGGCTTGCGG GCCCGCAGTT TCTGCATCAG CTCGGCAGTT CCCGGTTCGA AGACCGGGAC GCAACCCTCA ACGATCTGAT CGGCGGGCGC GCCGCGGCGA TCGTGGAGGA TCGCGACTAT CGCCTTAGAG CGAATACTCT TGTACATGAC ATTCAGGATT CCTTATCGCG AATTCGGGAA TCGATCGCCG CGGCCGAGCA AGAGCTCAAC GTCTCGCGCG CGCTCGACCT GTTGAGCGTC GCGGCGTCTC GGCTCACGCA GGCGCAAGCC GGCGGCGCGG ACAGCGCAAG CGTCGAGAGC GACGCGCCGG TCGATGCGCC GGCCGGCGCG TCCGCGCCCG GCGCGCAGCA GCCGATGGCG GGGCCGGGCG GCGTGTTGAG GTCGAGACAA GAGGTCGGCG CGGCGCTCGA TCGGATCGTG GAGTACTTCC GCGTGCACGA GCCGAGCCAT CCGGCGCCGA TCTTCCTGTC GCGGATTCAA CGAATGCTGG GGGCGGGTTT CGAGGAAGTG ATGGCGGAGC TCTATCCCGA GGCGGCATCT CTCGTGGCCC AACTGAGCCG GCCGCAGAGT TCCAAGTAA
|
Protein sequence | MNNDALPGHS PDLLDFDEDF IKIDAAICEY DSVGYAPQRK GESAFQWASI ETGCLALLKK AKDVRVGIWH LRACIARRGL SGLADGVRSL ADLMSAPVEE LHPRALPDES PGETLLIHLG WLAGPQFLHQ LGSSRFEDRD ATLNDLIGGR AAAIVEDRDY RLRANTLVHD IQDSLSRIRE SIAAAEQELN VSRALDLLSV AASRLTQAQA GGADSASVES DAPVDAPAGA SAPGAQQPMA GPGGVLRSRQ EVGAALDRIV EYFRVHEPSH PAPIFLSRIQ RMLGAGFEEV MAELYPEAAS LVAQLSRPQS SK
|
| |