Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0148 |
Symbol | |
ID | 4906207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 137434 |
End bp | 138504 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640143255 |
Product | ImpA-related N-terminal family protein |
Protein accession | YP_001074191 |
Protein GI | 126456359 |
COG category | [S] Function unknown |
COG ID | [COG3515] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03363] type VI secretion-associated protein, ImpA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00537792 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCCA CCCCACCCGC CGCCGCGCTG CGCTATGCGG ATCTTCTCGA GCCCGTCTCG CCGGACGCAC CCTGCGGGCC GGACCTCGAA TACGATCCGG CGTTCGTGAT GCTGCAATCC GCGATCGCGC CGAGGAAAGA TGCGCAATAT GGCGAGTTCG TCGAAGCGCC TCAGCCCGCG AACTGGGCGG AGGCCGAACG CGATTGCCTC GCGCTGCTGC TGCGCACGAA AGACATCCGG CTCGTCGTGA TCCTGATGCG ATGCCGAATC CGCCAGAGCG GCGCGGAGGG CCTGCGCGAC GGCCTCACGC TGCTCAACGA GTTGCTCGCA CGCTACGGTG AGGCACTGCA CCCCGTACCG TTCTTCGAAG GCGAACGCGA TCCCGTGGTC TATGCGAATG CGATCGCCAC GCTGGCGGAC CCCGATGCGA CGCTCGCGGA TATTCGGGAA ATCCCGTTGC CCAAGGCGAG CGGCCTGCAA TTGCAGTTGC GCGACATCGA AAAGGCGCTC GCGGTGACGC GCGTGAAAGA CGCACTCGCG CCCGAATCGG CCAGCCGGCT GCTGAAGGAA TGGTGGAATC GGCGCGACAA GACGATCGCG GCGCTCGCGC AAGCCCAGCG CATCGTGGCC GATCTGATCG CGTCGACCCG CGAGTCGCTC GGCGACGACG CGCCCGACCT GTCCGGCATC GCGAAACTGC TGCATCCGTT TGCGCAAGCG CAACTGGAAT CGCCGTATTC GGCAAACGCC GCTCAACCGC AAGGCGACGC GAAGCCGGCG ACCGGCGACG CCGCGCATGC GCGCGCCGCC GATACGTCCG CCCAGGCAGG CGATACCGAC ACGCAAGCGC CCGCTGCGAT GCCGATCGCG CCTGCCCAAC CGCCGATGGA TCGCTGGGGC GCGCTGGCGG CCATTCAGGC GACGCGCCTT TGGTTCGAGC AGAACGAGCC GAGCAGCCCG GTGATCGTGC TGCTGCGCCA GTCGGAGCGG ATGGTCGGCA AGCGCTTTTC GGAAATCGCC AATGCGATTC CCGCCGAACT GCTCGCGCAA TGGGATGCGA TCGACGTCTA G
|
Protein sequence | MNATPPAAAL RYADLLEPVS PDAPCGPDLE YDPAFVMLQS AIAPRKDAQY GEFVEAPQPA NWAEAERDCL ALLLRTKDIR LVVILMRCRI RQSGAEGLRD GLTLLNELLA RYGEALHPVP FFEGERDPVV YANAIATLAD PDATLADIRE IPLPKASGLQ LQLRDIEKAL AVTRVKDALA PESASRLLKE WWNRRDKTIA ALAQAQRIVA DLIASTRESL GDDAPDLSGI AKLLHPFAQA QLESPYSANA AQPQGDAKPA TGDAAHARAA DTSAQAGDTD TQAPAAMPIA PAQPPMDRWG ALAAIQATRL WFEQNEPSSP VIVLLRQSER MVGKRFSEIA NAIPAELLAQ WDAIDV
|
| |