Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0170 |
Symbol | |
ID | 4888248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 157393 |
End bp | 158463 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640130111 |
Product | ImpA-related N-terminal family protein |
Protein accession | YP_001061176 |
Protein GI | 126442596 |
COG category | [S] Function unknown |
COG ID | [COG3515] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03363] type VI secretion-associated protein, ImpA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCCA CCCCACCCGC CGCCGCGCTG CGCTATGCGG ATCTTCTCGA GCCCGTCTCG CCGGACGCAC CCTGCGGGCC GGACCTCGAA TACGATCCGG CGTTCGTGAT GCTGCAATCC GCGATCGCGC CGAGGAAAGA TGCGCAATAT GGCGAGTTCG TCGAAGCGCC TCAGCCCGCG AACTGGGCCG AGGCCGAACG CGATTGCCTC GCGCTGCTGC TGCGCACGAA AGACATCCGG CTCGTCGTGA TCCTGATGCG ATGCCGAATC CGCCAGAGCG GCGCGGAGGG CCTGCGCGAC GGCCTCGCGC TGCTCAACGA GTTGCTCGCG CGCTACGGTG AGGCGCTGCA CCCCGTACCG TTCTTCGAAG GCGAACGCGA TCCCGTGGTC TATGCGAATG CGATCGCCAC GCTGGCGGAC CCCGATGCGA CGCTCGCGGA TATTCGGGAA ATCCCGTTGC CCAAGGCGAG CGGCCTGCAA TTGCAGTTGC GCGACATCGA AAAGGCGCTC GCGGTGACGC GCGTGAAAGA CGCACTCGCG CCCGAATCGG CCAGCCGGCT GCTGAAGGAA TGGTGGAATC GGCGCGACAA GACGATCGCG GCGCTCGCGC AAGCCCAGCG CATCGTGGCC GATCTGATCG CGTCGACCCG CGAGTCGCTC GGCGACGACG CGCCCGACCT GTCCGGCATC GCGAAACTGC TGCATCCGTT TGCGCAAGCG CAACTGGAAT CGCCGTATTC GGCAAACGCC GCTCAACCGC AAGGCGACGC GAAGCCGGCG ACCGGCGACG CCGCGCATGC GCGCGCCGCC GATACGTCCG CCCAGGCAGG CGATACCGAC ACGCAAGCGC CCGCTGCGAT GCCGATCGCG CCTGCCCAAC CGCCGATGGA TCGCTGGGGC GCGCTGGCGG CCATTCAGGC GACGCGCCTT TGGTTCGAGC AAAACGAGCC GAGCAGCCCG GTAATCGTGC TGCTGCGCCA GTCGGAGCGG ATGGTCGGCA AGCGCTTTTC GGAAATCGCC AATGCGATTC CCGCCGAACT GCTCGCGCAA TGGGATGCGA TCGACGTCTA G
|
Protein sequence | MNATPPAAAL RYADLLEPVS PDAPCGPDLE YDPAFVMLQS AIAPRKDAQY GEFVEAPQPA NWAEAERDCL ALLLRTKDIR LVVILMRCRI RQSGAEGLRD GLALLNELLA RYGEALHPVP FFEGERDPVV YANAIATLAD PDATLADIRE IPLPKASGLQ LQLRDIEKAL AVTRVKDALA PESASRLLKE WWNRRDKTIA ALAQAQRIVA DLIASTRESL GDDAPDLSGI AKLLHPFAQA QLESPYSANA AQPQGDAKPA TGDAAHARAA DTSAQAGDTD TQAPAAMPIA PAQPPMDRWG ALAAIQATRL WFEQNEPSSP VIVLLRQSER MVGKRFSEIA NAIPAELLAQ WDAIDV
|
| |