Gene BURPS668_2243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2243 
Symbol 
ID4884906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2233094 
End bp2234887 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content68% 
IMG OID640128171 
ProductPhoH family protein 
Protein accessionYP_001059278 
Protein GI126439774 
COG category[T] Signal transduction mechanisms 
COG ID[COG1875] Predicted ATPase related to phosphate starvation-inducible protein PhoH 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTGC CTACTCCCCC CAGCAAGCTC GGCAGCCTGC TGCCGCCCGA CGAATACAAG 
GCGAAAGCGC GGCCCGCGAA AGCCGCGAAG CAATCCGCCT CCGGGGGCCC CGCTCATGCC
GCCGCGGCCG ACTACAGCCC GGCGAGCGTA GCCGAACCGA TGGTCGTCGC CGCGAACACC
ACCACGCCGC TGCGCGCCGT CGCGCCCGCC GCCGACGCGA GCGCCGGCGC GCCGCGCGCG
CGCCGGGCAA AGCAGACGGC CGCGCTGCTG CAGCCGGTGC CGGCCGCGCC CGCCCCCGTC
GCGCGCTCGT CGAAGACCGA TGCGGCGAAG CCCGCGGCAG CCGCGCCCGC CGCGCCGCGC
GCGGCCGCGA AAACGCGCGG CCGCAAGGAC GCCGAAGTCG AGATGCAAAA GCTCTTCGTG
CTCGACACGA ACGTGCTGAT GCATGATCCG AGCAGCCTGT TCCGCTTCGA GGAGCACGAC
GTCTATCTGC CGATGATGAC GCTCGAGGAG CTCGACAATC ACAAGAAGGG CATGTCGGAA
GTCGCGCGCA ACGCGCGCCA GGTGAGCCGC ACGCTCGACT CGCTCGTCGC CGATGCGGGC
CCGATCTCGG CCGGCATTCC GCTCGCGCGC CTCGGCAGCC GCGAGGCGCT CGGCCGCCTG
TATTTCCAGA CCACGCTCAC GAGCATCGAG CCTGTCGAGG GCCTGCCGCA GGGCAAGGCC
GACAACCAGA TCCTGGGCGT CGTGCGCGCG CTGCAGCGCG AGCGGCCCGA CCGGCAGGTC
GTGCTGGTGT CGAAAGACAT CAACATGCGG GTCAAGGCGC ACGCGCTCGG CCTGCCCGCC
GAGGACTACT TCAACGATCA GGTTCTCGAA GACAAGGATC TGCTCTACTC CGGCGTGCGC
GAACTGCCGC AGGACTTCTG GACGAAGCAC GCGAAGGGGA TGGAGAGCTG GCAGGACACG
AAGACGGGCA CGACGTACTA CCGCGTGACG GGCCCGCTCG TCGCGTCGAT GCTCGTCAAC
GAGTTCGCCT ATCTCGAGCC GCAGAACGGC GAGCCCGCGT TCCACGCGAT CGTGCGCGAG
CTGAACGGCA AGACGGCGCT GTTGCAGACG CTGCGCGACT ACAGCCACCA CAAGAACAAC
GTGTGGGGCA TCACCGCGCG CAACCGCGAG CAGAATTTCG CGCTGAACCT GCTGATGAAC
CCCGAGATCG ACTTCGTCAC GCTGCTCGGC CAGGCCGGCA CCGGCAAGAC GCTCGTCGCG
CTCGCGGCGG GCTTGGCGCA GGTGCTCGAC GACAAGCGCT ACAACGAGAT CATCGTCACG
CGCGCGACCG TGCCGGTGGG CGAGGACATC GGCTTCCTGC CCGGCACCGA GGAAGAGAAG
ATGCAGCCGT GGATGGGCGC GTTCGACGAC AACCTCGAAG TGCTGCAGAA AACCGACGAC
GCGGCGGGCG AATGGGGCCG CGCGGCGACG CAGGAGCTGA TCCGCTCGCG CCTGAAGGTC
AAGAGCATGA ACTTCATGCG CGGCCGCACG TTCGTCGACA AGTACGTGAT CATCGACGAG
GCGCAGAACC TCACGCCCAA GCAGATGAAA ACGCTCGTCA CGCGCGCGGG CCCCGGCACG
AAGATCATCT GCCTCGGCAA CATCGCGCAG ATCGACACGC CGTATCTGAC GGAAGGCAGC
TCGGGCCTCA CGTACGTCGT CGATCGCTTC AAGGGCTGGG GCCACAGCGG CCACGTGACG
CTCGCGCGCG GCGAGCGCTC ACGGCTGGCC GATTACGCGT CCGACATCCT GTAA
 
Protein sequence
MPLPTPPSKL GSLLPPDEYK AKARPAKAAK QSASGGPAHA AAADYSPASV AEPMVVAANT 
TTPLRAVAPA ADASAGAPRA RRAKQTAALL QPVPAAPAPV ARSSKTDAAK PAAAAPAAPR
AAAKTRGRKD AEVEMQKLFV LDTNVLMHDP SSLFRFEEHD VYLPMMTLEE LDNHKKGMSE
VARNARQVSR TLDSLVADAG PISAGIPLAR LGSREALGRL YFQTTLTSIE PVEGLPQGKA
DNQILGVVRA LQRERPDRQV VLVSKDINMR VKAHALGLPA EDYFNDQVLE DKDLLYSGVR
ELPQDFWTKH AKGMESWQDT KTGTTYYRVT GPLVASMLVN EFAYLEPQNG EPAFHAIVRE
LNGKTALLQT LRDYSHHKNN VWGITARNRE QNFALNLLMN PEIDFVTLLG QAGTGKTLVA
LAAGLAQVLD DKRYNEIIVT RATVPVGEDI GFLPGTEEEK MQPWMGAFDD NLEVLQKTDD
AAGEWGRAAT QELIRSRLKV KSMNFMRGRT FVDKYVIIDE AQNLTPKQMK TLVTRAGPGT
KIICLGNIAQ IDTPYLTEGS SGLTYVVDRF KGWGHSGHVT LARGERSRLA DYASDIL