Gene BURPS668_1759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1759 
Symbol 
ID4884802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1736332 
End bp1737987 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content66% 
IMG OID640127687 
Producthypothetical protein 
Protein accessionYP_001058798 
Protein GI126440716 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.301432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCTGC TCGCGCTCGT CGCGGCCGCC GGCGCCGCGC ACGCGCAAAG CCGCTCGGGC 
GGCAATCCGC TGGAAGCCCT CCCGCAGATC AACACGCCGC AAAAGCCGAG CGTCACCGTG
CAGGTCGCGC CGCAGGAAGT CCAGGTGCAG GCGCTGCTCG CGCGCCATCT GACGCCGAGC
TCGTTCCAGG TCGAAGGCGT CAAGTCGATT CCGTTCGAAG AGATCTCGCA ACGCTTCACG
CCGCTCGTCG GCAAGAACAT CACGATCGGC CAGTTGATCG AGACGGCGAA CGGCGTGACC
AAGCTGTACC AGGAGCGCGG CTACGCGCTG TCGTTCGCGT TCGTTCCCGC GCAGACGTTC
GAAGGCGGCG TCGTGCGCGT GACGGTCGTC GAAGGCTATG TCGCGAACGT GAAGATCACG
GGCCGCCCCG GCGCGATGGA GCCGAAGGTG CGCGCGATCG CCGCGCACAT CATGGCCGAC
CGCCCGCTGC GCCGCGCGAC GTTCGAGCGC TACGTCAACA CGTTCGGCCT GCTGCCCGGC
GTGACGGTGA AGGCGAACGT GCCGCCGCCG CAGAATACCG ACGGCGCGAC GACGCTCGAG
CTCAACGTCG ATCGCAAGCC GTTCAACGTG AGCGCGGGCC TGAACACGAA CAATCCGGGC
CTGCAGGGGC TGTTCACGGT GACGGAGAAC GGACTCACGT CGCTCGGCGA GCAGATGAGC
ATCTCCGCGC TGTTCCCGAA AGGGCCGAAC AATCAGACGT ACGTGTCGTT CAACGGCGCC
GTGCCGATCG GCAGCAACGG CCTCGTCACG CGTCTGGACG CGAGCCACTA TCGCGGCAAT
CCGTCCGTCG ATCAGACCGT GCTGCCGAAC GTGCAGCGCA CCGTGATCAA CGACAAGCTC
GGCCTGTCGG CGTCGTATCC GCTGATGCTG AGCAACCAGC GCAGCCTGCT CGGCACGGTG
TCGGGCTATG CGTCGCACAG CGAGGATCGC TACCAGAACC AGAGCACGGG CGCGACGATC
GGCATGCGCT CGCAGGTGCG CGTGCTGCAG ATGCAGTTCG ACTACACGAG CGTGCAGCCG
AAGCAGGTGC AGAAGCTGAG CTTCAACGTC GCCAAGGCGT TCGACATCCT GGGCGCGTCG
AAATCGGGCT TCACGAACCT GCCGGGCGTC ATCGCGACGA ACCCCGCGTC GACGACGTTC
GTGCGCACGG GCGCCACGTT CGTGCAGACG AACGAGTGGC CGTTCAAGAT CGGCTCGACC
GTGCAGCTCA CCGGCCAGTA CAGCCCCGAT TCGCTGCCGA GCACCGAGCA GATCTCGTTC
GGCGCGCAGC GTTTCGCGCT CGGCTATCAG CCGGGCGAGA CGTCGGGCGA TTCGGGCTGG
GGCGCGTCGC TCGAGCTCAA TCGCGCGTTC GCGCCGGGCT TCACGTACCT GAAGAACATC
ACGCCGTACA TCGTGTACGA CATGGCGCGC GTCTATCTGC ATTCGGGCAC GCCGGTGCCG
CGCCGCCTGT CGTCGGCCGG GTTCGGCGTG CGGTTGACCG ACAGCCGCTT CTACAATCTC
GACGTGTCGA TCGCGAAGCC CGTCGGCGAC GCGCCGATCG AAAGCGCATC GCGCAGCCCG
CGCGTGAACG CCTCGTTCTC GTATCAACTC TATTGA
 
Protein sequence
MLLLALVAAA GAAHAQSRSG GNPLEALPQI NTPQKPSVTV QVAPQEVQVQ ALLARHLTPS 
SFQVEGVKSI PFEEISQRFT PLVGKNITIG QLIETANGVT KLYQERGYAL SFAFVPAQTF
EGGVVRVTVV EGYVANVKIT GRPGAMEPKV RAIAAHIMAD RPLRRATFER YVNTFGLLPG
VTVKANVPPP QNTDGATTLE LNVDRKPFNV SAGLNTNNPG LQGLFTVTEN GLTSLGEQMS
ISALFPKGPN NQTYVSFNGA VPIGSNGLVT RLDASHYRGN PSVDQTVLPN VQRTVINDKL
GLSASYPLML SNQRSLLGTV SGYASHSEDR YQNQSTGATI GMRSQVRVLQ MQFDYTSVQP
KQVQKLSFNV AKAFDILGAS KSGFTNLPGV IATNPASTTF VRTGATFVQT NEWPFKIGST
VQLTGQYSPD SLPSTEQISF GAQRFALGYQ PGETSGDSGW GASLELNRAF APGFTYLKNI
TPYIVYDMAR VYLHSGTPVP RRLSSAGFGV RLTDSRFYNL DVSIAKPVGD APIESASRSP
RVNASFSYQL Y