Gene BURPS668_0878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0878 
Symbol 
ID4884313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp856260 
End bp857561 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content68% 
IMG OID640126806 
Producthypothetical protein 
Protein accessionYP_001057929 
Protein GI126441313 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.717121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAG GACGTCGACA CTTCGTGCGC TCGGTTGCGA GCGCCTCGGC CGCGCTCGCG 
GCCGCCGCAT GGTCCCCGGC GCGCGCCGCA ATCGACGCGC CCGCCTCGCC CGCGACCGCG
CTGTCGCTCA CGCCCGGGCG CTGGTCGCCG AACAACGTCG CGCGGCTGCG CGCGGTGCTC
GCCGGGCACG GCGCGTCGAG CCCGCGCTAC CGCCCCGAGC ACCGCCCGTA CGCGGTGTTC
GACTGGGACA ACACGAGCAT CATGAACGAC TGCGAAGAAG CGCTGCTGAT GCACCAGATC
GACGGGCTGC ATTACCGGCT CACGCCCGAG CAGTTCTCGG CGATCCTGCG CCAAGGCGTG
CCCGACGGCC CGTTCGACGC GAAGCTCGGC TATACGAGCG TCGACGGCAA GCCCGTGCGG
ATGGAGGACA TCGCGGCCGA CGTCGACGCC GACTACCGGT GGCTGCATGC GAACTATCGC
GGCCTCGCGG GCGACAAGCC GCTCGACGAG ATCCACCGCA GCGAGCCGTT CCGGGATTTC
CGCGCGAAGC TGTACTTCAT GTACGACGCG ATCTGCGACA CGTATCCGGT CGAGATCGGC
TACAAGTGGA TCATGTACTG GTACGCGGGC ATGACGCGCG ACGAGTTGCA GGCGATGGCG
TTCGACAGCA ACGTCGCGAA CCTCGGCGAC GCGCTGCGCA AGGTGACCTA CGAAAGCTCG
CGCGCGCTGC CGGGCAAGGC GGGCGTCATC GCCGCGACGC ACTTCCACGG CATCCGCATC
CACGAGGAGA TCCGCGCGGT GATGGACACG CTGCGCTCGA ACGGCATCGA CGTGTACGTC
AGCACCGCAT CGCTCGACGA CGTCGTGCGC GTGTTCGCGG GCCATCCGGC GTTCGGCTAC
GGCGTGCCCG CCGAAAACGT GATCGGCATG CGGCTCACGA TGGCGGACGG CAAGTACATG
AACGAATACC TGCCGAACTG GCACTTCAAC TACGGGCCGG GCAAGACGGA CGGCATCCGC
CGCGAGCTCG AAGCGAAGAA GGGCTACGGG CCGCTGCTCG TGTTCGGCGA CAGCGACGGC
GACGCGTGGA TGCTGCGCGA CTTCGCCGAT ACCGCGGTCG GCGTGATCGT CAACCGGATG
AAGAAAGGCG AGATCGGTAT CGACAGCCGC AAGGCGGCCG AGCAGATCGG CGCGAAGGAC
GCGCGGCTCG TGCTGCAAGG GCGCGACGAG AACACCGGGC TGATGGTCGC CGACGAGCGC
TCGATCAAGT ACGGCAAGCG CGATCCCAAA CTGCTCGCGT GA
 
Protein sequence
MKTGRRHFVR SVASASAALA AAAWSPARAA IDAPASPATA LSLTPGRWSP NNVARLRAVL 
AGHGASSPRY RPEHRPYAVF DWDNTSIMND CEEALLMHQI DGLHYRLTPE QFSAILRQGV
PDGPFDAKLG YTSVDGKPVR MEDIAADVDA DYRWLHANYR GLAGDKPLDE IHRSEPFRDF
RAKLYFMYDA ICDTYPVEIG YKWIMYWYAG MTRDELQAMA FDSNVANLGD ALRKVTYESS
RALPGKAGVI AATHFHGIRI HEEIRAVMDT LRSNGIDVYV STASLDDVVR VFAGHPAFGY
GVPAENVIGM RLTMADGKYM NEYLPNWHFN YGPGKTDGIR RELEAKKGYG PLLVFGDSDG
DAWMLRDFAD TAVGVIVNRM KKGEIGIDSR KAAEQIGAKD ARLVLQGRDE NTGLMVADER
SIKYGKRDPK LLA