Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2124 |
Symbol | |
ID | 4887613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 2058658 |
End bp | 2060157 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640132061 |
Product | hypothetical protein |
Protein accession | YP_001063118 |
Protein GI | 126444584 |
COG category | [S] Function unknown |
COG ID | [COG3517] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03355] type VI secretion protein, EvpB/VC_A0108 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGGCG AACACCTGCA ATCCCCGAAG CACGACGACG CGCCGGACGC GACGCCGCCC GAGTCCCCCG CCTCGCTGCT CGACGAGCTG ATCGAGGCCG CGCGCGTGAA GCGCGACGAA GACGCATACC CGATCACGCG CCACGGCATC CAGGCGTTCG TCGCGCATCT GGCGAAGCCC AAGCGCCCGA TCGAGACCGT GAGCCAGGCG ACGATCGACG ACATGATCGC CGAGATCGAC CGCAAGCTGT GCCGGCAGAT CGACGCGATC CTCCATGACC CGGCATTCCA GCAACTCGAA TCGACGTGGC GCTCGCTGAA GTTTCTCGTC GATCGAACGG ATTTCCGCGA GAACGTGAAG GTTCAGATTC TCGACGTCGG CAAAACGGCG CTGTTCGACG ACTTCGAGGA TTCGCCCGAC ATCACGAAAT CCGGGCTGTA CCAGAAGGTC TATACGGCCG AGTACGGCCA ATTCGGCGGC CAGCCGATCG GCGCGATCGT CGCGAACTAC ACGTTCGGGC CCGGCGCGCA GGACGTCAAG CTGCTGCAGT ACGTCGCGAG CACGTCGGCG ATGGCGCATA CGCCGTTCAT CGCGGCGGCG GGCCCCGCGT TCTTCGGCAT CGATTCGTTC GGCAAGCTGC CGAACGTGAA GGATCTCGCC TCGCTGTTCG AGGGGCCGCA ATACGCGAAA TGGAATGCGT TTCGCGAAAG CGAGGACGCG CGCTACGTCG GCCTCACGCT GCCGCGCTTC TTGCTGCGGC TGCCTTACGG CGCGAACACG ACGCCCGTCA AGCGCTTCAA CTACGAGGAG CGCGTCGACG GCGGCGACGC GCATTTTCTG TGGGGCAACG CGGCGTTCGC GTTCGCGACG CGCCTCACCG CGAGCTTCGC CGACTATCGC TGGTGCGCGA ACGTGATCGG GCCGAAAGGC GGCGGAACGG TGACCGATCT GCCGCTCTAC GCGTACGAAT CGATGGGCGA GATCCAGAAC AAGATCCCGA CCGACGTGCT GATTTCCGAG CGCCGCGAGT TCGAGCTCGC CGAACAGGGC TTCATCGCGC TGACGATGCG CAAGAACAGC GACAACGCCG CCTTTTTCTC CGCGAACTCC ACGCAGAAGC CGAAGTTCTT CGGCATCAGC AAGGAGGGCA AGGAGGCCGA GCTCAACTAC CGGCTCAGCA CGCAACTGCC GTACATCTTC GTCGTCAACC GGCTCGCCCA TTACATCAAG GTGATCCAGC GGGAAAACAT CGGCTCGTGG AAGGAGCGCG GCGATCTCGA GCAGGAGCTC AACCAGTGGA TCCGCCAGTA CGTCGTCGAC ATGGACAACC CGTCGCAGAG CGTGCGCAGC CGCCGCCCGC TGCGGCAGGC GCAGATCGTC GTGTCGGACG TCGAGGGCGA ACCCGGCTGG TATCGCGTGG ACATGAAGGT GCGGCCGCAC TTCAAGTACA TGGGCGCGTT CTTCACGCTG TCGCTCGTCG GCAAGCTCGA AAAGCGCTAG
|
Protein sequence | MEGEHLQSPK HDDAPDATPP ESPASLLDEL IEAARVKRDE DAYPITRHGI QAFVAHLAKP KRPIETVSQA TIDDMIAEID RKLCRQIDAI LHDPAFQQLE STWRSLKFLV DRTDFRENVK VQILDVGKTA LFDDFEDSPD ITKSGLYQKV YTAEYGQFGG QPIGAIVANY TFGPGAQDVK LLQYVASTSA MAHTPFIAAA GPAFFGIDSF GKLPNVKDLA SLFEGPQYAK WNAFRESEDA RYVGLTLPRF LLRLPYGANT TPVKRFNYEE RVDGGDAHFL WGNAAFAFAT RLTASFADYR WCANVIGPKG GGTVTDLPLY AYESMGEIQN KIPTDVLISE RREFELAEQG FIALTMRKNS DNAAFFSANS TQKPKFFGIS KEGKEAELNY RLSTQLPYIF VVNRLAHYIK VIQRENIGSW KERGDLEQEL NQWIRQYVVD MDNPSQSVRS RRPLRQAQIV VSDVEGEPGW YRVDMKVRPH FKYMGAFFTL SLVGKLEKR
|
| |