Gene BURPS1106A_1780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1780 
Symbol 
ID4899231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1742003 
End bp1743658 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content67% 
IMG OID640135011 
Producthypothetical protein 
Protein accessionYP_001066050 
Protein GI126454681 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCTGC TCGCGCTCGT CGCGGCCGCC GGCGCCGCGC ACGCGCAAAG CCGCTCGGGC 
GGCAATCCGC TGGAAGCCCT CCCGCAGATC AACACGCCGC AAAAGCCGAG CGTCACCGTG
CAGGTCGCGC CGCAGGAAGT CCAGGTGCAG GCGCTGCTCG CGCGCCATCT GACGCCGAGC
TCGTTCCAGG TCGAAGGCGT CAAGTCGATT CCGTTCGAAG AGATCTCGCA ACGCTTCACG
CCGCTCGTCG GCAAGGACAT CACGATCGGC CAGTTGATCG AGACGGCGAA CGGCGTGACC
AAGCTGTACC AGGAGCGCGG CTACGCGCTG TCGTTCGCGT TCGTTCCCGC GCAGACGTTC
GAAGGCGGCG TCGTGCGCGT GACGGTCGTC GAAGGCTATG TCGCGAACCT GAAGATCACG
GGCCGCCCCG GCGCGATGGA GCCGAAGGTG CGCGCGATCG CCGCGCACAT CATGGCCGAC
CGCCCGCTGC GCCGCGCGAC GTTCGAGCGC TACGTCAACA CGTTCGGCCT GCTGCCCGGC
GTGACGGTGA AGGCGAACGT GCCGCCGCCG CAGAATACCG ACGGCGCGAC GACGCTCGAG
CTCAACGTCG ATCGCAAGCC GTTCAACGTG AGCGCGGGCC TGAACACGAA CAATCCGGGC
CTGCAGGGCC TGTTCACGGT GACGGAGAAC GGACTCACGT CGCTCGGCGA GCAGATGAGC
ATCTCCGCGC TGTTCCCGAA AGGGCCGAAC AATCAGACGT ACGTGTCGTT CAACGGCGCC
GTGCCGATCG GCAGCAACGG CCTCGTCACG CGTCTGGACG CGAGCCACTA CCGCGGCAAT
CCGTCCGTCG ATCAGACCGT GCTGCCGAAC GTGCAGCGCA CCGTGATCAA CGACAAGCTC
GGCCTGTCGG CGTCGTATCC GCTGATGCTG AGCAACCAGC GCAGCCTGCT CGGCACGGTG
TCGGGCTATG CGTCGCACAG CGAGGATCGC TACCAGAACC AGAGCACGGG CGCGACGATC
GGCATGCGCT CGCAGGTGCG CGTGCTGCAG ATGCAGTTCG ACTACACGAG CGTGCAGCCG
AAGCAGGTGC AGAAGCTGAG CTTCAACGTC GCCAAGGCGT TCGACATCCT GGGCGCGTCG
AAATCGGGCT TCACGAACCT GCCGGGCGTC ATCGCGACGA ACCCCGCGTC GACGACGTTC
GTGCGCACGG GCGCCACGTT CGTGCAGACG AACGAGTGGC CGTTCAAGAT CGGCTCGACC
GTGCAGCTCA CCGGCCAGTA CAGCCCCGAT TCGCTGCCGA GCACCGAGCA GATCTCGTTC
GGCGCGCAGC GTTTCGCGCT CGGCTATCAG CCGGGCGAGA CGTCGGGCGA TTCGGGCTGG
GGCGCGTCGC TCGAGCTCAA TCGCGCGTTC GCGCCGGGCT TCACGTACCT GAAGAACATC
ACGCCGTACA TCGTGTACGA CATGGCGCGC GTCTATCTGC ATTCGGGCAC GCCGGTGCCG
CGCCGCCTGT CGTCGGCCGG GTTCGGCGTG CGGCTGACCG ACAGCCGCTT CTACAATCTC
GACGTGTCGA TCGCGAAGCC CGTCGGCGAC GCGCCAATCG AAAGCGCATC GCGCAGCCCG
CGCGTGAACG CCTCGTTCTC GTATCAACTC TATTGA
 
Protein sequence
MLLLALVAAA GAAHAQSRSG GNPLEALPQI NTPQKPSVTV QVAPQEVQVQ ALLARHLTPS 
SFQVEGVKSI PFEEISQRFT PLVGKDITIG QLIETANGVT KLYQERGYAL SFAFVPAQTF
EGGVVRVTVV EGYVANLKIT GRPGAMEPKV RAIAAHIMAD RPLRRATFER YVNTFGLLPG
VTVKANVPPP QNTDGATTLE LNVDRKPFNV SAGLNTNNPG LQGLFTVTEN GLTSLGEQMS
ISALFPKGPN NQTYVSFNGA VPIGSNGLVT RLDASHYRGN PSVDQTVLPN VQRTVINDKL
GLSASYPLML SNQRSLLGTV SGYASHSEDR YQNQSTGATI GMRSQVRVLQ MQFDYTSVQP
KQVQKLSFNV AKAFDILGAS KSGFTNLPGV IATNPASTTF VRTGATFVQT NEWPFKIGST
VQLTGQYSPD SLPSTEQISF GAQRFALGYQ PGETSGDSGW GASLELNRAF APGFTYLKNI
TPYIVYDMAR VYLHSGTPVP RRLSSAGFGV RLTDSRFYNL DVSIAKPVGD APIESASRSP
RVNASFSYQL Y