Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0803 |
Symbol | |
ID | 4885792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 777233 |
End bp | 778594 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640130743 |
Product | hypothetical protein |
Protein accession | YP_001061802 |
Protein GI | 126444399 |
COG category | [S] Function unknown |
COG ID | [COG3522] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03353] type VI secretion protein, VC_A0114 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.533915 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAGTC TGCCGGTAGG ACCGGTCGCG TGGAGCGACG GCATGCTGAT CGAGACGCAG CACTTCCAGC AGCTCGAGCG GCATCTCGCG CATCAGGCCT CGCTGCGGCT CGGTCAGACG TCGAATCACG GCTGGGGCTT CACGCTGCTC GATCTCGACC AGGACGGCCT GGGGCTCGGC CGGCTCGGGC TGCGCCACGC GCGCGGCGTG TTCCAGGACG GCACCGCGTT CTCGCTGCCG TCGGACGATC CGCTGCCGCC GCCGCTGGAA ACCGAGCTCG CGCAGGCGGG CGACATCGCG TGCCTCGCGC TGCAAGCCGC GCGCACGGGC GGCCCGGAGA TGGCGTTCGG CGACGTCGAG CTGGCGTCGC GCTATCGCGC GGTGTCGACC GAGGTGCCGG ATCTCGCGGT CGGGCTCGAC GCGCCCGGCA CGCCGCGGCG CCTGACGATC GAGACGGGCC AGCTCGTCAC GCGCCTGTGC TGGAAGTCGC AGCTGCGCTC GGACGAGGTC GCGCTGCCGA TCGCGCGCGT GGCGGGGCGC AACGCGAGCC GCACGGTGTC GCTGGATCCG CGCTTCATTC CGCCGCTGCT CGACACGCGC GCGCACCTGG TGCTGCGCTC GCTGATCGAC GAGCTGCAGA GCACGCTGCG CGTGCGGCTC GCGAGCACGT CCGCGCAGCG CGTGCTGTCG ACGGGCGGGG GCGTGGCCGA TCTGATCGAG CTGCTGCTGC GCCAGGCGAT CGCCGAGTAC CGGATGCGCT TGGCGAACCT CGACGCGTTC GATCCGCTGC CGCCGGCGAT GCTGTATCAC GAACTGGTCG GCCTGCTCGG GCGGCTGAGC GTGCTGCCGG GCGTCGACGA GGAACTGGCC GACCGCGAGC TCGGCTACGA CCACGACGAT CTGCAGACGA GCTTCGAGCC GCTCGCGATG ATGCTGCGCC AGGCGCTCGC GCGCGTGATC GAGACACCGG TGCTGCCGCT GCGCTTCGAG GATCGCGGCG ATCAGGTGCA CATCTGCATC GTCGACAAGC AGTGGAACCT GAAGAAACTG ATTTTTGCGT TTTCGGCCGC GATGCCGGCG GAGAAGCTGC GGCAACTGTT GCCGCAGCAG ACGAAGCTGG GCGCCGTCGA GCAGATCCAG AAGCTCGTGG ACCTGCAACT GCCGGGCGCG CGGCTGAACG CGCTGCCCAA TCCCCCGCGC CAGATTCCCT ACTACGCCCA AAGCACGTAC TTCGAAGTGG AATCGACCGA TCCGTTCTGG AAGCAGACCC TCGCCGGCTC GGCGATGGCG CTGCGCATCG TCGGCGATTT CCCCGATCTT CGCTTCGAAG CCTGGGGGCT GAGAGACGGC AAGGTGGCGT GA
|
Protein sequence | MSSLPVGPVA WSDGMLIETQ HFQQLERHLA HQASLRLGQT SNHGWGFTLL DLDQDGLGLG RLGLRHARGV FQDGTAFSLP SDDPLPPPLE TELAQAGDIA CLALQAARTG GPEMAFGDVE LASRYRAVST EVPDLAVGLD APGTPRRLTI ETGQLVTRLC WKSQLRSDEV ALPIARVAGR NASRTVSLDP RFIPPLLDTR AHLVLRSLID ELQSTLRVRL ASTSAQRVLS TGGGVADLIE LLLRQAIAEY RMRLANLDAF DPLPPAMLYH ELVGLLGRLS VLPGVDEELA DRELGYDHDD LQTSFEPLAM MLRQALARVI ETPVLPLRFE DRGDQVHICI VDKQWNLKKL IFAFSAAMPA EKLRQLLPQQ TKLGAVEQIQ KLVDLQLPGA RLNALPNPPR QIPYYAQSTY FEVESTDPFW KQTLAGSAMA LRIVGDFPDL RFEAWGLRDG KVA
|
| |